共 50 条
- [28] Continuous-Time Stochastic Policy Iteration of Adaptive Dynamic Programming IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (10): : 6375 - 6387
- [29] The complexity of Policy Iteration is exponential for discounted Markov Decision Processes 2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 5997 - 6002
- [30] A policy iteration heuristic for constrained discounted controlled Markov Chains Optimization Letters, 2012, 6 : 1573 - 1577