An asymptotically optimal policy for finite support models in the multiarmed bandit problem

Author: Honda Junya   Takemura Akimichi  

Publisher: Springer Publishing Company

ISSN: 0885-6125

Source: Machine Learning, Vol.85, Iss.3, 2011-12, pp. : 361-391

Disclaimer: Any content in publications that violate the sovereignty, the constitution or regulations of the PRC is not accepted or approved by CNPIEC.

Previous Menu Next

Abstract