Variance-penalized Markov decision processes: dynamic programming and reinforcement learning techniques

Author: Gosavi Abhijit  

Publisher: Taylor & Francis Ltd

ISSN: 0308-1079

Source: International Journal of General Systems, Vol.43, Iss.6, 2014-08, pp. : 649-669

Disclaimer: Any content in publications that violate the sovereignty, the constitution or regulations of the PRC is not accepted or approved by CNPIEC.

Previous Menu Next