共 50 条
- [42] Hierarchical Method for Cooperative Multiagent Reinforcement Learning in Markov Decision Processes Doklady Mathematics, 2023, 108 : S382 - S392
- [43] Policy gradient Stochastic approximation algorithms for adaptive control of constrained time varying Markov decision processes 42ND IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-6, PROCEEDINGS, 2003, : 2823 - 2828
- [45] Verification of Markov Decision Processes Using Learning Algorithms AUTOMATED TECHNOLOGY FOR VERIFICATION AND ANALYSIS, ATVA 2014, 2014, 8837 : 98 - 114
- [48] Improved Algorithms for Misspecified Linear Markov Decision Processes INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
- [50] Combining Learning Algorithms: An Approach to Markov Decision Processes ENTERPRISE INFORMATION SYSTEMS, ICEIS 2012, 2013, 141 : 172 - 188