共 50 条
- [2] The Bayesian Prophet: A Low-Regret Framework for Online Decision Making Performance Evaluation Review, 2019, 47 (01): : 81 - 82
- [3] Dynamic Regret of Online Markov Decision Processes INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [4] Minimax Regret Optimisation for Robust Planning in Uncertain Markov Decision Processes THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11930 - 11938
- [5] Online Regret Bounds for Markov Decision Processes with Deterministic Transitions ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 2008, 5254 : 123 - 137
- [7] Simple Regret Optimization in Online Planning for Markov Decision Processes JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2014, 51 : 165 - 205
- [8] Contextual Recommendations and Low-Regret Cutting-Plane Algorithms ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [9] Reinforcement Learning Algorithms for Regret Minimization in Structured Markov Decision Processes AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 1289 - 1290