共 50 条
- [32] Learning Adversarial Markov Decision Processes with Delayed Feedback THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7281 - 7289
- [33] A reinforcement learning based algorithm for Markov decision processes 2005 International Conference on Intelligent Sensing and Information Processing, Proceedings, 2005, : 199 - 204
- [34] Verification of Markov Decision Processes Using Learning Algorithms AUTOMATED TECHNOLOGY FOR VERIFICATION AND ANALYSIS, ATVA 2014, 2014, 8837 : 98 - 114
- [36] Learning and Planning with Timing Information in Markov Decision Processes UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2015, : 111 - 120
- [38] Combining Learning Algorithms: An Approach to Markov Decision Processes ENTERPRISE INFORMATION SYSTEMS, ICEIS 2012, 2013, 141 : 172 - 188
- [39] A sensitivity view of Markov decision processes and reinforcement learning MODELING, CONTROL AND OPTIMIZATION OF COMPLEX SYSTEMS: IN HONOR OF PROFESSOR YU-CHI HO, 2003, 14 : 261 - 283
- [40] Online Learning in Markov Decision Processes with Continuous Actions ALGORITHMIC LEARNING THEORY, ALT 2015, 2015, 9355 : 302 - 316