

Author: Nouri Ali
Publisher: Springer Publishing Company
ISSN: 0885-6125
Source: Machine Learning, Vol.81, Iss.1, 2010-10, pp. : 85-98
Disclaimer: Any content in publications that violate the sovereignty, the constitution or regulations of the PRC is not accepted or approved by CNPIEC.
Abstract
The sample complexity of a reinforcement-learning algorithm is highly coupled to how proficiently it explores, which in turn depends critically on the effective size of its state space. This paper proposes a new exploration mechanism for model-based algorithms in continuous state spaces that automatically discovers the relevant dimensions of the environment. We show that this information can be used to dramatically decrease the sample complexity of the algorithm over conventional exploration techniques. This improvement is achieved by maintaining a low-dimensional representation of the transition function. Empirical evaluations in several environments, including simulation benchmarks and a real robotics domain, suggest that the new method outperforms state-of-the-art algorithms and that the behavior is robust and stable.
Related content


Model-Based Exploration of Societal Aging in
International Journal of System Dynamics Applications (IJSDA), Vol. 4, Iss. 1, 2015-01 ,pp. :





