On the Existence of Fixed Points for Approximate Value Iteration and Temporal-Difference Learning

Author: de Farias D.P.   van Roy B.  

Publisher: Springer Publishing Company

ISSN: 0022-3239

Source: Journal of Optimization Theory and Applications, Vol.105, Iss.3, 2000-06, pp. : 589-608

Disclaimer: Any content in publications that violate the sovereignty, the constitution or regulations of the PRC is not accepted or approved by CNPIEC.

Previous Menu Next

Abstract