共 50 条
- [31] An optimal policy for partially observable markov decision processes with non-independent monitors ADVANCED RELIABILITY MODELING, 2004, : 213 - 220
- [32] Navigating to the Best Policy in Markov Decision Processes ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [34] Geometric Policy Iteration for Markov Decision Processes PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 2070 - 2078
- [36] Efficient Policy Representation for Markov Decision Processes SMART TECHNOLOGIES IN URBAN ENGINEERING, STUE-2022, 2023, 536 : 151 - 162