共 50 条
- [21] GAUSSIAN PROCESS MODELLING OF DEPENDENCIES IN MULTI-ARMED BANDIT PROBLEMS PROCEEDINGS OF THE 10TH INTERNATIONAL SYMPOSIUM ON OPERATIONAL RESEARCH SOR 09, 2009, : 77 - 84
- [22] Time-Varying Stochastic Multi-Armed Bandit Problems CONFERENCE RECORD OF THE 2014 FORTY-EIGHTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2014, : 2103 - 2107
- [24] Synchronization and optimality for multi-armed bandit problems in continuous time COMPUTATIONAL & APPLIED MATHEMATICS, 1997, 16 (02): : 117 - 151
- [26] The Effect of Communication on Noncooperative Multiplayer Multi-Armed Bandit Problems 2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 331 - 336
- [27] An asymptotically optimal strategy for constrained multi-armed bandit problems Mathematical Methods of Operations Research, 2020, 91 : 545 - 557
- [28] On the Optimality of Perturbations in Stochastic and Adversarial Multi-armed Bandit Problems ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [29] Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2012, 5 (01): : 1 - 122
- [30] Mean Field Equilibrium in Multi-Armed Bandit Game with Continuous Reward PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 3118 - 3124