A dynamic programming strategy to balance exploration and exploitation in the bandit problem

Author: Caelen Olivier   Bontempi Gianluca  

Publisher: Springer Publishing Company

ISSN: 1012-2443

Source: Annals of Mathematics and Artificial Intelligence, Vol.60, Iss.1-2, 2010-10, pp. : 3-24

Disclaimer: Any content in publications that violate the sovereignty, the constitution or regulations of the PRC is not accepted or approved by CNPIEC.

Previous Menu Next

Abstract