共 50 条
- [33] Value iteration and approximately optimal stationary policies in finite-state average Markov decision chains Mathematical Methods of Operations Research, 2002, 56 : 181 - 196
- [34] SEMI-MARKOV DECISION PROCESSES WITH UNBOUNDED REWARDS MANAGEMENT SCIENCE SERIES A-THEORY, 1973, 19 (07): : 717 - 731
- [37] Optimal policies for constrained average-cost Markov decision processes TOP, 2011, 19 : 107 - 120