共 50 条
- [23] An optimistic value iteration for mean-variance optimization in discounted Markov decision processes RESULTS IN CONTROL AND OPTIMIZATION, 2022, 8
- [26] The complexity of Policy Iteration is exponential for discounted Markov Decision Processes 2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 5997 - 6002
- [29] A SECONDARY APPROACH TO THE DISCOUNTED MODEL IN SEMI-MARKOV DECISION-PROCESSES KEXUE TONGBAO, 1988, 33 (06): : 448 - 454