共 50 条
- [21] Algorithms with Logarithmic or Sublinear Regret for Constrained Contextual Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
- [22] A New Algorithm for Non-stationary Contextual Bandits: Efficient, Optimal, and Parameter-free CONFERENCE ON LEARNING THEORY, VOL 99, 2019, 99
- [23] EFFICIENT ALGORITHMS FOR LINEAR POLYHEDRAL BANDITS 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 4796 - 4800
- [24] Sublinear Optimal Policy Value Estimation in Contextual Bandits INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 4377 - 4386
- [25] Best-of-Both-Worlds Algorithms for Linear Contextual Bandits INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
- [27] Communication Efficient Distributed Learning for Kernelized Contextual Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [28] Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202 : 691 - 717
- [29] Optimal and Adaptive Off-policy Evaluation in Contextual Bandits INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
- [30] Optimal Baseline Corrections for Off-Policy Contextual Bandits PROCEEDINGS OF THE EIGHTEENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2024, 2024, : 722 - 732