共 50 条
- [21] Improved Regret Bounds for Oracle-Based Adversarial Contextual Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
- [22] Contextual Bandits with Smooth Regret: Efficient Learning in Continuous Action Spaces INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [23] No-Regret Algorithms for Heavy-Tailed Linear Bandits INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
- [26] Instance-optimal PAC Algorithms for Contextual Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [27] Provably Optimal Algorithms for Generalized Linear Contextual Bandits INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
- [28] Regret of Queueing Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
- [30] Constrained regret minimization for multi-criterion multi-armed bandits Machine Learning, 2023, 112 : 431 - 458