共 50 条
- [12] Thresholding Bandits with Augmented UCB PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2515 - 2521
- [14] CRIMED: Lower and Upper Bounds on Regret for Bandits with Unbounded Stochastic Corruption INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 237, 2024, 237
- [15] Regret of Queueing Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
- [17] Efficient Kernel UCB for Contextual Bandits INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151 : 5689 - 5720
- [18] Nearly Optimal Regret for Stochastic Linear Bandits with Heavy-Tailed Payoffs PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2936 - 2942