共 50 条
- [1] Adversarially Robust Multi-Armed Bandit Algorithm with Variance-Dependent Regret Bounds CONFERENCE ON LEARNING THEORY, VOL 178, 2022, 178
- [2] Optimal Regret Bounds for Collaborative Learning in Bandits INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 237, 2024, 237
- [3] Hybrid Regret Bounds for Combinatorial Semi-Bandits and Adversarial Linear Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [4] Minimax Regret Bounds for Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
- [5] Variational Regret Bounds for Reinforcement Learning 35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 81 - 90
- [6] Tight Regret Bounds for Infinite-armed Linear Contextual Bandits 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130 : 370 - 378
- [7] Regret Bounds for Learning State Representations in Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [8] Variational Bayesian Reinforcement Learning with Regret Bounds ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
- [9] Linear Bandits with Limited Adaptivity and Learning Distributional Optimal Design STOC '21: PROCEEDINGS OF THE 53RD ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING, 2021, : 74 - 87
- [10] Collaborative Linear Bandits with Adversarial Agents: Near-Optimal Regret Bounds ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,