Author: Gosavi Abhijit
Publisher: Springer Publishing Company
ISSN: 0885-6125
Source: Machine Learning, Vol.55, Iss.1, 2004-04, pp. : 5-29
Disclaimer: Any content in publications that violate the sovereignty, the constitution or regulations of the PRC is not accepted or approved by CNPIEC.
Abstract
Related content
Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms
By Singh S.
Machine Learning, Vol. 38, Iss. 3, 2000-03 ,pp. :
On Average Versus Discounted Reward Temporal-Difference Learning
Machine Learning, Vol. 49, Iss. 2-3, 2002-11 ,pp. :
Kernel-Based Reinforcement Learning
By Ormoneit D.
Machine Learning, Vol. 49, Iss. 2-3, 2002-11 ,pp. :