Dynamic programming in constrained Markov decision processes

被引：0

作者：

Piunovskiy, A. B. ^{[1
]}

机构：

[1] Univ Liverpool, Dept Math Sci, Liverpool L69 7ZL, Merseyside, England

来源：

CONTROL AND CYBERNETICS | 2006年 / 35卷 / 03期

关键词：

Markov decision process (MDP); constraints; optimization; dynamic programming; myopic control strategy; queuing system;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We consider a discounted Markov Decision Process (MDP) supplemented with the requirement that another discounted loss must not exceed a specified value, almost surely. We show that the problem can be reformulated as a standard MDP and solved using the Dynamic Programming approach. An example on a controlled queue is presented. In the last section, we briefly reinforce the connection of the Dynamic Programming approach to another close problem statement and present the corresponding example. Several other types of constraints are discussed, as well.

引用

页码：645 / 660

页数：16

共 50 条

[1] Constrained Markovian decision processes: the dynamic programming approach
Piunovskiy, AB
Mao, X
OPERATIONS RESEARCH LETTERS, 2000, 27 (03) : 119 - 126
[2] FINITE LINEAR PROGRAMMING APPROXIMATIONS OF CONSTRAINED DISCOUNTED MARKOV DECISION PROCESSES
Dufour, Francois
Prieto-Rumeau, Tomas
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2013, 51 (02) : 1298 - 1324
[3] Approximate Linear Programming for Constrained Partially Observable Markov Decision Processes
Poupart, Pascal
Malhotra, Aarti
Pei, Pei
Kim, Kee-Eung
Goh, Bongseok
Bowling, Michael
PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 3342 - 3348
[4] On constrained Markov decision processes
Department of Econometrics, University of Sydney, Sydney, NSW 2006, Australia
不详
Oper Res Lett, 1 (25-28):
[5] On constrained Markov decision processes
Haviv, M
OPERATIONS RESEARCH LETTERS, 1996, 19 (01) : 25 - 28
[6] Risk-averse dynamic programming for Markov decision processes
Ruszczynski, Andrzej
MATHEMATICAL PROGRAMMING, 2010, 125 (02) : 235 - 261
[7] Risk-averse dynamic programming for Markov decision processes
Andrzej Ruszczyński
Mathematical Programming, 2010, 125 : 235 - 261
[8] Conditions for the Solvability of the Linear Programming Formulation for Constrained Discounted Markov Decision Processes
Dufour, F.
Prieto-Rumeau, T.
APPLIED MATHEMATICS AND OPTIMIZATION, 2016, 74 (01): : 27 - 51
[9] Conditions for the Solvability of the Linear Programming Formulation for Constrained Discounted Markov Decision Processes
F. Dufour
T. Prieto-Rumeau
Applied Mathematics & Optimization, 2016, 74 : 27 - 51
[10] Learning in Constrained Markov Decision Processes
Singh, Rahul
Gupta, Abhishek
Shroff, Ness B.
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (01): : 441 - 453

← 1 2 3 4 5 →