共 50 条
- [1] Regret bounds for reinforcement learning via markov chain concentration Journal of Artificial Intelligence Research, 2020, 67 : 115 - 128
- [2] Minimax Regret Bounds for Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
- [3] Variational Regret Bounds for Reinforcement Learning 35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 81 - 90
- [4] Regret Bounds for Learning State Representations in Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [5] Variational Bayesian Reinforcement Learning with Regret Bounds ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
- [7] Regret Bounds for Information-Directed Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [8] Improved Regret Bounds for Undiscounted Continuous Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 524 - 532
- [9] Regret Bounds for Risk-Sensitive Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [10] Kernelized Reinforcement Learning with Order Optimal Regret Bounds ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,