共 50 条
- [2] SEMI-MARKOV DECISION PROCESSES WITH UNBOUNDED REWARDS MANAGEMENT SCIENCE SERIES A-THEORY, 1973, 19 (07): : 717 - 731
- [5] Constrained discounted semi-Markov decision processes MARKOV PROCESSES AND CONTROLLED MARKOV CHAINS, 2002, : 233 - 244
- [7] Average Reward Reinforcement Learning for Semi-Markov Decision Processes NEURAL INFORMATION PROCESSING, ICONIP 2017, PT I, 2017, 10634 : 768 - 777
- [9] RVI Reinforcement Learning for Semi-Markov Decision Processes with Average Reward 2010 8TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2010, : 1674 - 1679