REALIZABLE STRATEGIES IN CONTINUOUS-TIME MARKOV DECISION PROCESSES

被引:2
|
作者
Piunovskiy, Alexey [1 ]
机构
[1] Univ Liverpool, Dept Math Sci, Liverpool L69 7ZL, Merseyside, England
关键词
continuous-time Markov decision process; total cost; discounted cost; relaxed strategy; randomized strategy; MODELS;
D O I
10.1137/17M1138959
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For the Borel model of the continuous-time Markov decision process, we introduce a wide class of control strategies. In a particular case, such strategies transform to the standard relaxed strategies, intensively studied in the last decade. In another special case, if one restricts to another special subclass of the general strategies, the model transforms to the semi-Markov decision process. Further, we show that the relaxed strategies are not realizable. For the constrained optimal control problem with total expected costs, we describe the sufficient class of realizable strategies, the so-called Poisson-related strategies. Finally, we show that, for solving the formulated optimal control problems, one can use all the tools developed earlier for the classical discrete-time Markov decision processes.
引用
收藏
页码:473 / 495
页数:23
相关论文
共 50 条
  • [21] A characterization of meaningful schedulers for continuous-time Markov decision processes
    Wolovick, Nicolas
    Johr, Sven
    FORMAL MODELING AND ANALYSIS OF TIMED SYSTEMS, 2006, 4202 : 352 - 367
  • [22] Policy learning in continuous-time Markov decision processes using Gaussian Processes
    Bartocci, Ezio
    Bortolussi, Luca
    Brazdil, Tomas
    Milios, Dimitrios
    Sanguinetti, Guido
    PERFORMANCE EVALUATION, 2017, 116 : 84 - 100
  • [23] Discounted optimality for continuous-time Markov decision processes in Polish spaces
    Guo, Xianping
    2006 CHINESE CONTROL CONFERENCE, VOLS 1-5, 2006, : 1785 - 1787
  • [24] Average optimality for continuous-time Markov decision processes in Polish spaces
    Guo, Xianping
    Rieder, Ulrich
    ANNALS OF APPLIED PROBABILITY, 2006, 16 (02): : 730 - 756
  • [25] On continuous-time Markov processes in bargaining
    Houba, Harold
    ECONOMICS LETTERS, 2008, 100 (02) : 280 - 283
  • [26] Variance minimization for continuous-time Markov decision processes: two approaches
    Quan-xin Zhu
    Applied Mathematics-A Journal of Chinese Universities, 2010, 25 : 400 - 410
  • [27] Denumerable continuous-time Markov decision processes with multiconstraints on average costs
    Liu, Qiuli
    Tan, Hangsheng
    Guo, Xianping
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2012, 43 (03) : 576 - 585
  • [28] DISCOUNTED CONTINUOUS-TIME CONSTRAINED MARKOV DECISION PROCESSES IN POLISH SPACES
    Guo, Xianping
    Song, Xinyuan
    ANNALS OF APPLIED PROBABILITY, 2011, 21 (05): : 2016 - 2049
  • [29] ABSORBING CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH TOTAL COST CRITERIA
    Guo, Xianping
    Vykertas, Mantas
    Zhang, Yi
    ADVANCES IN APPLIED PROBABILITY, 2013, 45 (02) : 490 - 519
  • [30] A survey of recent results on continuous-time Markov decision processes - Discussion
    Hu, Qiying
    TOP, 2006, 14 (02) : 248 - 251