Optimal time-abstract schedulers for CTMDPs and continuous-time Markov games

被引:7
|
作者
Rabe, Markus N. [1 ]
Schewe, Sven [1 ]
机构
[1] Univ Liverpool, Liverpool L69 3BX, Merseyside, England
基金
英国工程与自然科学研究理事会;
关键词
Continuous-time Markov decision processes; Continuous-time Markov games; Optimal control; Time-bounded reachability; BOUNDED REACHABILITY;
D O I
10.1016/j.tcs.2012.10.001
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We study time-bounded reachability in continuous-time Markov decision processes (CTMDPs) and games (CTGs) for time-abstract scheduler classes. Reachability problems play a paramount role in probabilistic model checking. Consequently, their analysis has been studied intensively, and approximation techniques are well understood. From a mathematical point of view, however, the question of approximation is secondary compared to the fundamental question whether or not optimal control exists. In this article, we demonstrate the existence of optimal schedulers for the time-abstract scheduler classes for CTMDPs. For CTGs, we distinguish two cases: the simple case where both players face the same restriction to use time-abstract strategies (symmetric CTGs) and the case where one player is a completely informed adversary (asymmetric CTGs). While for the former case optimal strategies exist, we prove that for asymmetric CTGs there is not necessarily a scheduler that attains the optimum. It turns out that for CTMDPs and symmetric CTGs optimal time-abstract schedulers have an amazingly simple structure: they converge to a memoryless scheduling policy after a finite number of steps. This allows us to compute time-abstract strategies with finite memory. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:53 / 67
页数:15
相关论文
共 50 条
  • [21] Perturbations of continuous-time Markov chains
    Li, Pei-Sen
    STATISTICS & PROBABILITY LETTERS, 2017, 125 : 17 - 24
  • [22] Extremal Shift Rule for Continuous-Time Zero-Sum Markov Games
    Averboukh, Yurii
    DYNAMIC GAMES AND APPLICATIONS, 2017, 7 (01) : 1 - 20
  • [23] Neural Continuous-Time Markov Models
    Reeves, Majerle
    Bhat, Harish S.
    2023 SICE INTERNATIONAL SYMPOSIUM ON CONTROL SYSTEMS, SICE ISCS, 2023, : 76 - 83
  • [24] Imprecise continuous-time Markov chains
    Krak, Thomas
    De Bock, Jasper
    Siebes, Arno
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2017, 88 : 452 - 528
  • [25] Filtering of continuous-time Markov chains
    Aggoun, L
    Benkherouf, L
    Tadj, L
    MATHEMATICAL AND COMPUTER MODELLING, 1997, 26 (12) : 73 - 83
  • [26] Optimal Control of Probability on a Target Set for Continuous-Time Markov Chains
    Ma, Chenglin
    Zhao, Huaizhong
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (02) : 1202 - 1209
  • [28] Continuous-time controlled Markov chains
    Guo, XP
    Hernández-Lerma, O
    ANNALS OF APPLIED PROBABILITY, 2003, 13 (01): : 363 - 388
  • [29] Integrals for continuous-time Markov chains
    Pollett, PK
    MATHEMATICAL BIOSCIENCES, 2003, 182 (02) : 213 - 225
  • [30] Multimarket contact in continuous-time games
    Kobayashi, Hajime
    Ohta, Katsunori
    ECONOMICS LETTERS, 2008, 101 (01) : 4 - 5