Dynamic programming in constrained Markov decision processes

被引:0
|
作者
Piunovskiy, A. B. [1 ]
机构
[1] Univ Liverpool, Dept Math Sci, Liverpool L69 7ZL, Merseyside, England
来源
CONTROL AND CYBERNETICS | 2006年 / 35卷 / 03期
关键词
Markov decision process (MDP); constraints; optimization; dynamic programming; myopic control strategy; queuing system;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider a discounted Markov Decision Process (MDP) supplemented with the requirement that another discounted loss must not exceed a specified value, almost surely. We show that the problem can be reformulated as a standard MDP and solved using the Dynamic Programming approach. An example on a controlled queue is presented. In the last section, we briefly reinforce the connection of the Dynamic Programming approach to another close problem statement and present the corresponding example. Several other types of constraints are discussed, as well.
引用
收藏
页码:645 / 660
页数:16
相关论文
共 50 条
  • [1] Constrained Markovian decision processes: the dynamic programming approach
    Piunovskiy, AB
    Mao, X
    OPERATIONS RESEARCH LETTERS, 2000, 27 (03) : 119 - 126
  • [2] FINITE LINEAR PROGRAMMING APPROXIMATIONS OF CONSTRAINED DISCOUNTED MARKOV DECISION PROCESSES
    Dufour, Francois
    Prieto-Rumeau, Tomas
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2013, 51 (02) : 1298 - 1324
  • [3] Approximate Linear Programming for Constrained Partially Observable Markov Decision Processes
    Poupart, Pascal
    Malhotra, Aarti
    Pei, Pei
    Kim, Kee-Eung
    Goh, Bongseok
    Bowling, Michael
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 3342 - 3348
  • [4] On constrained Markov decision processes
    Department of Econometrics, University of Sydney, Sydney, NSW 2006, Australia
    不详
    Oper Res Lett, 1 (25-28):
  • [5] On constrained Markov decision processes
    Haviv, M
    OPERATIONS RESEARCH LETTERS, 1996, 19 (01) : 25 - 28
  • [6] Risk-averse dynamic programming for Markov decision processes
    Ruszczynski, Andrzej
    MATHEMATICAL PROGRAMMING, 2010, 125 (02) : 235 - 261
  • [7] Risk-averse dynamic programming for Markov decision processes
    Andrzej Ruszczyński
    Mathematical Programming, 2010, 125 : 235 - 261
  • [8] Conditions for the Solvability of the Linear Programming Formulation for Constrained Discounted Markov Decision Processes
    Dufour, F.
    Prieto-Rumeau, T.
    APPLIED MATHEMATICS AND OPTIMIZATION, 2016, 74 (01): : 27 - 51
  • [9] Conditions for the Solvability of the Linear Programming Formulation for Constrained Discounted Markov Decision Processes
    F. Dufour
    T. Prieto-Rumeau
    Applied Mathematics & Optimization, 2016, 74 : 27 - 51
  • [10] Learning in Constrained Markov Decision Processes
    Singh, Rahul
    Gupta, Abhishek
    Shroff, Ness B.
    IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (01): : 441 - 453