共 50 条
- [21] Average Reward Reinforcement Learning for Semi-Markov Decision Processes NEURAL INFORMATION PROCESSING, ICONIP 2017, PT I, 2017, 10634 : 768 - 777
- [25] Fast Approximate Dynamic Programming for Infinite-Horizon Markov Decision Processes ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [29] Incremental Improvements of Heuristic Policies for Average-Reward Markov Decision Processes IFAC PAPERSONLINE, 2020, 53 (02): : 1721 - 1728
- [30] RVI Reinforcement Learning for Semi-Markov Decision Processes with Average Reward 2010 8TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2010, : 1674 - 1679