ON THE IDENTIFICATION AND MITIGATION OF WEAKNESSES IN THE KNOWLEDGE GRADIENT POLICY FOR MULTI-ARMED BANDITS

Publisher: Cambridge University Press

E-ISSN: 1469-8951|31|2|239-263

ISSN: 0269-9648

Source: Probability in the Engineering and Informational Sciences, Vol.31, Iss.2, 2016-09, pp. : 239-263

Disclaimer: Any content in publications that violate the sovereignty, the constitution or regulations of the PRC is not accepted or approved by CNPIEC.

Previous Menu Next

Abstract