共 50 条
- [1] A Novel Q-learning Algorithm with Function Approximation for Constrained Markov Decision Processes 2012 50TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2012, : 400 - 405
- [3] Model-free Safe Reinforcement Learning Method Based on Constrained Markov Decision Processes Ruan Jian Xue Bao/Journal of Software, 2022, 33 (08): : 3086 - 3102
- [4] Risk-aware Q-Learning for Markov Decision Processes 2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
- [5] On Q-learning Convergence for Non-Markov Decision Processes PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2546 - 2552
- [9] Learning in Constrained Markov Decision Processes IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (01): : 441 - 453