共 50 条
- [21] No Regret Bound for Extreme Bandits ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 51, 2016, 51 : 259 - 267
- [23] The Pareto Regret Frontier for Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
- [24] Regret Bounds for Batched Bandits THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7340 - 7348
- [25] Dueling Bandits with Weak Regret INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
- [26] On Logarithmic Regret for Bandits with Knapsacks 2021 55TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2021,
- [27] UCB revisited: Improved regret bounds for the stochastic multi-armed bandit problem Periodica Mathematica Hungarica, 2010, 61 : 55 - 65
- [29] On the Sublinear Regret of GP-UCB ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [30] Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202