共 19 条
- [3] ADAPTIVE TREATMENT ALLOCATION AND THE MULTIARMED BANDIT PROBLEM ANNALS OF STATISTICS, 1987, 15 (03): : 1091 - 1114
- [5] Bandit Problems in Networks: Asymptotically Efficient Distributed Allocation Rules 2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC), 2011, : 1771 - 1778
- [7] An asymptotically optimal policy for finite support models in the multiarmed bandit problem Machine Learning, 2011, 85 : 361 - 391