Optimal time-abstract schedulers for CTMDPs and continuous-time Markov games

被引：7

作者：

Rabe, Markus N. ^{[1
]}

Schewe, Sven ^{[1
]}

机构：

[1] Univ Liverpool, Liverpool L69 3BX, Merseyside, England

来源：

THEORETICAL COMPUTER SCIENCE | 2013年 / 467卷

基金：

英国工程与自然科学研究理事会;

关键词：

Continuous-time Markov decision processes; Continuous-time Markov games; Optimal control; Time-bounded reachability; BOUNDED REACHABILITY;

D O I：

10.1016/j.tcs.2012.10.001

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

We study time-bounded reachability in continuous-time Markov decision processes (CTMDPs) and games (CTGs) for time-abstract scheduler classes. Reachability problems play a paramount role in probabilistic model checking. Consequently, their analysis has been studied intensively, and approximation techniques are well understood. From a mathematical point of view, however, the question of approximation is secondary compared to the fundamental question whether or not optimal control exists. In this article, we demonstrate the existence of optimal schedulers for the time-abstract scheduler classes for CTMDPs. For CTGs, we distinguish two cases: the simple case where both players face the same restriction to use time-abstract strategies (symmetric CTGs) and the case where one player is a completely informed adversary (asymmetric CTGs). While for the former case optimal strategies exist, we prove that for asymmetric CTGs there is not necessarily a scheduler that attains the optimum. It turns out that for CTMDPs and symmetric CTGs optimal time-abstract schedulers have an amazingly simple structure: they converge to a memoryless scheduling policy after a finite number of steps. This allows us to compute time-abstract strategies with finite memory. (C) 2012 Elsevier B.V. All rights reserved.

引用

页码：53 / 67

页数：15

共 50 条

[21] Perturbations of continuous-time Markov chains
Li, Pei-Sen
STATISTICS & PROBABILITY LETTERS, 2017, 125 : 17 - 24
[22] Extremal Shift Rule for Continuous-Time Zero-Sum Markov Games
Averboukh, Yurii
DYNAMIC GAMES AND APPLICATIONS, 2017, 7 (01) : 1 - 20
[23] Neural Continuous-Time Markov Models
Reeves, Majerle
Bhat, Harish S.
2023 SICE INTERNATIONAL SYMPOSIUM ON CONTROL SYSTEMS, SICE ISCS, 2023, : 76 - 83
[24] Imprecise continuous-time Markov chains
Krak, Thomas
De Bock, Jasper
Siebes, Arno
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2017, 88 : 452 - 528
[25] Filtering of continuous-time Markov chains
Aggoun, L
Benkherouf, L
Tadj, L
MATHEMATICAL AND COMPUTER MODELLING, 1997, 26 (12) : 73 - 83
[26] Optimal Control of Probability on a Target Set for Continuous-Time Markov Chains
Ma, Chenglin
Zhao, Huaizhong
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (02) : 1202 - 1209
[27] OPTIMAL CONTROL OF A CONTINUOUS-TIME MARKOV CHAIN WITH PERIODIC TRANSITION PROBABILITIES
MARTINLOF, A
OPERATIONS RESEARCH, 1967, 15 (05) : 872 - +
[28] Continuous-time controlled Markov chains
Guo, XP
Hernández-Lerma, O
ANNALS OF APPLIED PROBABILITY, 2003, 13 (01): : 363 - 388
[29] Integrals for continuous-time Markov chains
Pollett, PK
MATHEMATICAL BIOSCIENCES, 2003, 182 (02) : 213 - 225
[30] Multimarket contact in continuous-time games
Kobayashi, Hajime
Ohta, Katsunori
ECONOMICS LETTERS, 2008, 101 (01) : 4 - 5

← 1 2 3 4 5 →