共 50 条
- [42] A sensitivity view of Markov decision processes and reinforcement learning MODELING, CONTROL AND OPTIMIZATION OF COMPLEX SYSTEMS: IN HONOR OF PROFESSOR YU-CHI HO, 2003, 14 : 261 - 283
- [43] Online Learning in Markov Decision Processes with Continuous Actions ALGORITHMIC LEARNING THEORY, ALT 2015, 2015, 9355 : 302 - 316
- [47] An ε-Greedy Multiarmed Bandit Approach to Markov Decision Processes STATS, 2023, 6 (01): : 99 - 112
- [48] A Learning Based Approach to Control Synthesis of Markov Decision Processes for Linear Temporal Logic Specifications 2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 1091 - 1096
- [49] Efficient qualitative analysis of classes of recursive Markov decision processes and simple Stochastic games STACS 2006, PROCEEDINGS, 2006, 3884 : 634 - 645
- [50] RECURSIVE ADAPTIVE-CONTROL OF MARKOV DECISION-PROCESSES WITH THE AVERAGE REWARD CRITERION APPLIED MATHEMATICS AND OPTIMIZATION, 1991, 23 (02): : 193 - 207