共 50 条
- [41] Optimal Policies for Quantum Markov Decision Processes International Journal of Automation and Computing, 2021, 18 : 410 - 421
- [45] Learning Policies for Markov Decision Processes in Continuous Spaces 2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 4751 - 4758
- [48] NONEXISTENCE OF EPSILON-OPTIMAL RANDOMIZED STATIONARY POLICIES IN AVERAGE COST MARKOV DECISION MODELS ANNALS OF MATHEMATICAL STATISTICS, 1971, 42 (05): : 1767 - &
- [49] EXISTENCE OF OPTIMAL STATIONARY POLICIES IN AVERAGE REWARD MARKOV DECISION-PROCESSES WITH A RECURRENT STATE APPLIED MATHEMATICS AND OPTIMIZATION, 1992, 26 (02): : 171 - 194