共 50 条
- [1] Differentially Private Reward Functions for Markov Decision Processes 2024 IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS, CCTA 2024, 2024, : 631 - 636
- [2] Reinforcement Learning Algorithms for Regret Minimization in Structured Markov Decision Processes AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 1289 - 1290
- [4] A Duality Approach for Regret Minimization in Average-Reward Ergodic Markov Decision Processes LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 : 862 - 883
- [6] Dynamic Regret of Online Markov Decision Processes INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [7] Parametric Regret in Uncertain Markov Decision Processes PROCEEDINGS OF THE 48TH IEEE CONFERENCE ON DECISION AND CONTROL, 2009 HELD JOINTLY WITH THE 2009 28TH CHINESE CONTROL CONFERENCE (CDC/CCC 2009), 2009, : 3606 - 3613
- [8] Episodic task learning in Markov decision processes Artificial Intelligence Review, 2011, 36 : 87 - 98
- [10] Variance minimization of parameterized Markov decision processes DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2018, 28 (01): : 63 - 81