共 50 条
- [31] Policy Iteration for Decentralized Control of Markov Decision Processes JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2009, 34 : 89 - 132
- [32] Optimal Decision Tree Policies for Markov Decision Processes PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 5457 - 5465
- [36] Policy gradient Stochastic approximation algorithms for adaptive control of constrained time varying Markov decision processes 42ND IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-6, PROCEEDINGS, 2003, : 2823 - 2828
- [38] EXISTENCE OF OPTIMAL STATIONARY POLICIES IN AVERAGE REWARD MARKOV DECISION-PROCESSES WITH A RECURRENT STATE APPLIED MATHEMATICS AND OPTIMIZATION, 1992, 26 (02): : 171 - 194
- [40] Computing semi-stationary optimal policies for multichain semi-Markov decision processes Annals of Operations Research, 2020, 287 : 843 - 865