REALIZABLE STRATEGIES IN CONTINUOUS-TIME MARKOV DECISION PROCESSES

被引：2

作者：

Piunovskiy, Alexey ^{[1
]}

机构：

[1] Univ Liverpool, Dept Math Sci, Liverpool L69 7ZL, Merseyside, England

来源：

SIAM JOURNAL ON CONTROL AND OPTIMIZATION | 2018年 / 56卷 / 01期

关键词：

continuous-time Markov decision process; total cost; discounted cost; relaxed strategy; randomized strategy; MODELS;

D O I：

10.1137/17M1138959

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

For the Borel model of the continuous-time Markov decision process, we introduce a wide class of control strategies. In a particular case, such strategies transform to the standard relaxed strategies, intensively studied in the last decade. In another special case, if one restricts to another special subclass of the general strategies, the model transforms to the semi-Markov decision process. Further, we show that the relaxed strategies are not realizable. For the constrained optimal control problem with total expected costs, we describe the sufficient class of realizable strategies, the so-called Poisson-related strategies. Finally, we show that, for solving the formulated optimal control problems, one can use all the tools developed earlier for the classical discrete-time Markov decision processes.

引用

页码：473 / 495

页数：23

共 50 条

[21] A characterization of meaningful schedulers for continuous-time Markov decision processes
Wolovick, Nicolas
Johr, Sven
FORMAL MODELING AND ANALYSIS OF TIMED SYSTEMS, 2006, 4202 : 352 - 367
[22] Policy learning in continuous-time Markov decision processes using Gaussian Processes
Bartocci, Ezio
Bortolussi, Luca
Brazdil, Tomas
Milios, Dimitrios
Sanguinetti, Guido
PERFORMANCE EVALUATION, 2017, 116 : 84 - 100
[23] Discounted optimality for continuous-time Markov decision processes in Polish spaces
Guo, Xianping
2006 CHINESE CONTROL CONFERENCE, VOLS 1-5, 2006, : 1785 - 1787
[24] Average optimality for continuous-time Markov decision processes in Polish spaces
Guo, Xianping
Rieder, Ulrich
ANNALS OF APPLIED PROBABILITY, 2006, 16 (02): : 730 - 756
[25] On continuous-time Markov processes in bargaining
Houba, Harold
ECONOMICS LETTERS, 2008, 100 (02) : 280 - 283
[26] Variance minimization for continuous-time Markov decision processes: two approaches
Quan-xin Zhu
Applied Mathematics-A Journal of Chinese Universities, 2010, 25 : 400 - 410
[27] Denumerable continuous-time Markov decision processes with multiconstraints on average costs
Liu, Qiuli
Tan, Hangsheng
Guo, Xianping
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2012, 43 (03) : 576 - 585
[28] DISCOUNTED CONTINUOUS-TIME CONSTRAINED MARKOV DECISION PROCESSES IN POLISH SPACES
Guo, Xianping
Song, Xinyuan
ANNALS OF APPLIED PROBABILITY, 2011, 21 (05): : 2016 - 2049
[29] ABSORBING CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH TOTAL COST CRITERIA
Guo, Xianping
Vykertas, Mantas
Zhang, Yi
ADVANCES IN APPLIED PROBABILITY, 2013, 45 (02) : 490 - 519
[30] A survey of recent results on continuous-time Markov decision processes - Discussion
Hu, Qiying
TOP, 2006, 14 (02) : 248 - 251

← 1 2 3 4 5 →