Dynamic programming in constrained Markov decision processes

被引:0
|
作者
Piunovskiy, A. B. [1 ]
机构
[1] Univ Liverpool, Dept Math Sci, Liverpool L69 7ZL, Merseyside, England
来源
CONTROL AND CYBERNETICS | 2006年 / 35卷 / 03期
关键词
Markov decision process (MDP); constraints; optimization; dynamic programming; myopic control strategy; queuing system;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider a discounted Markov Decision Process (MDP) supplemented with the requirement that another discounted loss must not exceed a specified value, almost surely. We show that the problem can be reformulated as a standard MDP and solved using the Dynamic Programming approach. An example on a controlled queue is presented. In the last section, we briefly reinforce the connection of the Dynamic Programming approach to another close problem statement and present the corresponding example. Several other types of constraints are discussed, as well.
引用
收藏
页码:645 / 660
页数:16
相关论文
共 50 条
  • [21] Risk-constrained Markov Decision Processes
    Borkar, Vivek
    Jain, Rahul
    49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 2664 - 2669
  • [22] Risk-Constrained Markov Decision Processes
    Borkar, Vivek
    Jain, Rahul
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (09) : 2574 - 2579
  • [23] Constrained Markov Decision Processes for Intelligent Traffic
    Singh, Tripty
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [24] Entropy Maximization for Constrained Markov Decision Processes
    Savas, Yagiz
    Ornik, Melkior
    Cubuktepe, Murat
    Topcu, Ufuk
    2018 56TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2018, : 911 - 918
  • [25] Real-time dynamic programming for Markov decision processes with imprecise probabilities
    Delgado, Karina V.
    de Barros, Leliane N.
    Dias, Daniel B.
    Sanner, Scott
    ARTIFICIAL INTELLIGENCE, 2016, 230 : 192 - 223
  • [26] Dominance-constrained Markov decision processes
    Haskell, William B.
    Jain, Rahul
    2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 5991 - 5996
  • [27] Constrained Markov decision processes with uncertain costs
    Varagapriya, V.
    Singh, Vikas Vikram
    Lisser, Abdel
    OPERATIONS RESEARCH LETTERS, 2022, 50 (02) : 218 - 223
  • [28] Fast Approximate Dynamic Programming for Infinite-Horizon Markov Decision Processes
    Kolarijani, M. A. S.
    Max, G. F.
    Esfahani, P. Mohajerin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [29] Towards Dynamic Pricing for Shared Mobility on Demand using Markov Decision Processes and Dynamic Programming
    Guan, Yue
    Annaswamy, Anuradha M.
    Tseng, H. Eric
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [30] Linear programming-based solution methods for constrained partially observable Markov decision processes
    Robert K. Helmeczi
    Can Kavaklioglu
    Mucahit Cevik
    Applied Intelligence, 2023, 53 : 21743 - 21769