共 50 条
- [21] Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [23] Discounted Markov decision processes with fuzzy costs Annals of Operations Research, 2020, 295 : 769 - 786