共 50 条
- [23] Optimal Policies for Quantum Markov Decision Processes International Journal of Automation and Computing, 2021, 18 : 410 - 421
- [24] EXISTENCE OF AN OPTIMAL STATIONARY POLICY IN A MARKOV DECISION PROCESS THEORY OF PROBILITY AND ITS APPLICATIONS,USSR, 1965, 10 (01): : 120 - +
- [26] Efficient Policy Iteration for Periodic Markov Decision Processes 21ST EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2014), 2014, 263 : 1167 - 1172
- [28] MONOTONE OPTIMAL POLICIES FOR MARKOV DECISION-PROCESSES MATHEMATICAL PROGRAMMING STUDY, 1976, 6 (DEC): : 202 - 215
- [29] Policy iteration for robust nonstationary Markov decision processes Optimization Letters, 2016, 10 : 1613 - 1628
- [30] Policy Gradient for Rectangular Robust Markov Decision Processes ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,