共 50 条
- [21] A sensitivity view of Markov decision processes and reinforcement learning MODELING, CONTROL AND OPTIMIZATION OF COMPLEX SYSTEMS: IN HONOR OF PROFESSOR YU-CHI HO, 2003, 14 : 261 - 283
- [23] On the convergence of projective-simulation–based reinforcement learning in Markov decision processes Quantum Machine Intelligence, 2020, 2
- [24] Verification of Markov Decision Processes Using Learning Algorithms AUTOMATED TECHNOLOGY FOR VERIFICATION AND ANALYSIS, ATVA 2014, 2014, 8837 : 98 - 114
- [25] Combining Learning Algorithms: An Approach to Markov Decision Processes ENTERPRISE INFORMATION SYSTEMS, ICEIS 2012, 2013, 141 : 172 - 188
- [26] Learning and Planning in Average-Reward Markov Decision Processes INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139 : 7665 - 7676
- [27] BATCH POLICY LEARNING IN AVERAGE REWARD MARKOV DECISION PROCESSES ANNALS OF STATISTICS, 2022, 50 (06): : 3364 - 3387
- [28] ON PARTIALLY OBSERVABLE MARKOV DECISION-PROCESSES WITH AN AVERAGE COST CRITERION PROCEEDINGS OF THE 28TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-3, 1989, : 1267 - 1273