共 50 条
- [31] Average Reward Reinforcement Learning for Semi-Markov Decision Processes NEURAL INFORMATION PROCESSING, ICONIP 2017, PT I, 2017, 10634 : 768 - 777
- [37] Risk Aversion in Finite Markov Decision Processes Using Total Cost Criteria and Average Value at Risk 2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 335 - 342
- [39] CONTINUITY OF MEAN RECURRENCE TIMES IN DENUMERABLE SEMI-MARKOV PROCESSES ZEITSCHRIFT FUR WAHRSCHEINLICHKEITSTHEORIE UND VERWANDTE GEBIETE, 1985, 69 (04): : 581 - 592