共 50 条
- [31] Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency THIRTY SIXTH ANNUAL CONFERENCE ON LEARNING THEORY, VOL 195, 2023, 195
- [32] Online Regret Bounds for Markov Decision Processes with Deterministic Transitions ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 2008, 5254 : 123 - 137
- [34] A Sublinear-Regret Reinforcement Learning Algorithm on Constrained Markov Decision Processes with reset action ICMLSC 2020: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND SOFT COMPUTING, 2020, : 51 - 55
- [35] Regret Analysis in Deterministic Reinforcement Learning 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 2246 - 2251
- [36] Optimal Regret Bounds for Collaborative Learning in Bandits INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 237, 2024, 237
- [37] Regret Bounds for Transfer Learning in Bayesian Optimisation ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 54, 2017, 54 : 307 - 315
- [38] Horizon-Free and Instance-Dependent Regret Bounds for Reinforcement Learning with General Function Approximation INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
- [40] Reinforcement Learning with Logarithmic Regret and Policy Switches ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,