共 50 条
- [32] Optimal policies for constrained average-cost Markov decision processes TOP, 2011, 19 : 107 - 120
- [33] From Perturbation Analysis to Markov Decision Processes and Reinforcement Learning Discrete Event Dynamic Systems, 2003, 13 : 9 - 39
- [35] Reinforcement learning algorithm for partially observable Markov decision processes Kongzhi yu Juece/Control and Decision, 2004, 19 (11): : 1263 - 1266
- [36] From perturbation analysis to Markov decision processes and reinforcement learning DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2003, 13 (1-2): : 9 - 39
- [38] Toward an Optimized Value Iteration Algorithm for Average Cost Markov Decision Processes 49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 930 - 934