共 50 条
- [41] Learning and optimal control of imprecise Markov decision processes by dynamic programming using the imprecise Dirichlet model SOFT METHODOLOGY AND RANDOM INFORMATION SYSTEMS, 2004, : 141 - 148
- [42] Continuous-Time Distributed Dynamic Programming for Networked Multi-Agent Markov Decision Processes 2024 IEEE 18TH INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION, ICCA 2024, 2024, : 960 - 967
- [43] A performance gradient perspective on approximate dynamic programming and its application to partially observable markov decision processes PROCEEDINGS OF THE 2006 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL, 2006, : 87 - +
- [44] Topological Value Iteration Algorithm for Markov Decision Processes 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 1860 - 1865
- [46] A reinforcement learning based algorithm for Markov decision processes 2005 International Conference on Intelligent Sensing and Information Processing, Proceedings, 2005, : 199 - 204
- [48] Algorithm of discounted model of partially observable Markov decision programming Hunan Daxue Xuebao, 5 (16):
- [49] Dynamic workflow composition using Markov decision processes IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES, PROCEEDINGS, 2004, : 576 - 582
- [50] DYNAMIC-PROGRAMMING RECURSIONS FOR MULTIPLICATIVE MARKOV DECISION CHAINS MATHEMATICAL PROGRAMMING STUDY, 1976, 6 (DEC): : 216 - 226