Dynamic programming in constrained Markov decision processes

被引：0

作者：

Piunovskiy, A. B. ^{[1
]}

机构：

[1] Univ Liverpool, Dept Math Sci, Liverpool L69 7ZL, Merseyside, England

来源：

CONTROL AND CYBERNETICS | 2006年 / 35卷 / 03期

关键词：

Markov decision process (MDP); constraints; optimization; dynamic programming; myopic control strategy; queuing system;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We consider a discounted Markov Decision Process (MDP) supplemented with the requirement that another discounted loss must not exceed a specified value, almost surely. We show that the problem can be reformulated as a standard MDP and solved using the Dynamic Programming approach. An example on a controlled queue is presented. In the last section, we briefly reinforce the connection of the Dynamic Programming approach to another close problem statement and present the corresponding example. Several other types of constraints are discussed, as well.

引用

页码：645 / 660

页数：16

共 50 条

[21] Risk-constrained Markov Decision Processes
Borkar, Vivek
Jain, Rahul
49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 2664 - 2669
[22] Risk-Constrained Markov Decision Processes
Borkar, Vivek
Jain, Rahul
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (09) : 2574 - 2579
[23] Constrained Markov Decision Processes for Intelligent Traffic
Singh, Tripty
2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
[24] Entropy Maximization for Constrained Markov Decision Processes
Savas, Yagiz
Ornik, Melkior
Cubuktepe, Murat
Topcu, Ufuk
2018 56TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2018, : 911 - 918
[25] Real-time dynamic programming for Markov decision processes with imprecise probabilities
Delgado, Karina V.
de Barros, Leliane N.
Dias, Daniel B.
Sanner, Scott
ARTIFICIAL INTELLIGENCE, 2016, 230 : 192 - 223
[26] Dominance-constrained Markov decision processes
Haskell, William B.
Jain, Rahul
2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 5991 - 5996
[27] Constrained Markov decision processes with uncertain costs
Varagapriya, V.
Singh, Vikas Vikram
Lisser, Abdel
OPERATIONS RESEARCH LETTERS, 2022, 50 (02) : 218 - 223
[28] Fast Approximate Dynamic Programming for Infinite-Horizon Markov Decision Processes
Kolarijani, M. A. S.
Max, G. F.
Esfahani, P. Mohajerin
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[29] Towards Dynamic Pricing for Shared Mobility on Demand using Markov Decision Processes and Dynamic Programming
Guan, Yue
Annaswamy, Anuradha M.
Tseng, H. Eric
2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
[30] Linear programming-based solution methods for constrained partially observable Markov decision processes
Robert K. Helmeczi
Can Kavaklioglu
Mucahit Cevik
Applied Intelligence, 2023, 53 : 21743 - 21769

← 1 2 3 4 5 →