UCB revisited: Improved regret bounds for the stochastic multi-armed bandit problem

Author: Auer Peter  

Publisher: Springer Publishing Company

ISSN: 0031-5303

Source: Periodica Mathematica Hungarica, Vol.61, Iss.1-2, 2010-09, pp. : 55-65

Disclaimer: Any content in publications that violate the sovereignty, the constitution or regulations of the PRC is not accepted or approved by CNPIEC.

Previous Menu Next

Abstract