共 50 条
- [21] An Analytic Characterization of Model Minimization in Factored Markov Decision Processes PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 1077 - 1082
- [25] Pure Exploration in Episodic Fixed-Horizon Markov Decision Processes AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 1703 - 1704
- [26] IMED-RL: Regret optimal learning of ergodic Markov decision processes ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [27] Online Learning for Markov Decision Processes in Nonstationary Environments: A Dynamic Regret Analysis 2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, : 1232 - 1237
- [28] Sampling Based Approaches for Minimizing Regret in Uncertain Markov Decision Processes (MDPs) JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2017, 59 : 229 - 264
- [29] Differentially Private Online Submodular Minimization 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89