Optimal time-abstract schedulers for CTMDPs and continuous-time Markov games

被引:7
|
作者
Rabe, Markus N. [1 ]
Schewe, Sven [1 ]
机构
[1] Univ Liverpool, Liverpool L69 3BX, Merseyside, England
基金
英国工程与自然科学研究理事会;
关键词
Continuous-time Markov decision processes; Continuous-time Markov games; Optimal control; Time-bounded reachability; BOUNDED REACHABILITY;
D O I
10.1016/j.tcs.2012.10.001
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We study time-bounded reachability in continuous-time Markov decision processes (CTMDPs) and games (CTGs) for time-abstract scheduler classes. Reachability problems play a paramount role in probabilistic model checking. Consequently, their analysis has been studied intensively, and approximation techniques are well understood. From a mathematical point of view, however, the question of approximation is secondary compared to the fundamental question whether or not optimal control exists. In this article, we demonstrate the existence of optimal schedulers for the time-abstract scheduler classes for CTMDPs. For CTGs, we distinguish two cases: the simple case where both players face the same restriction to use time-abstract strategies (symmetric CTGs) and the case where one player is a completely informed adversary (asymmetric CTGs). While for the former case optimal strategies exist, we prove that for asymmetric CTGs there is not necessarily a scheduler that attains the optimum. It turns out that for CTMDPs and symmetric CTGs optimal time-abstract schedulers have an amazingly simple structure: they converge to a memoryless scheduling policy after a finite number of steps. This allows us to compute time-abstract strategies with finite memory. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:53 / 67
页数:15
相关论文
共 50 条
  • [1] Optimal Time-Abstract Schedulers for CTMDPs and Markov Games
    Rabe, Markus
    Schewe, Sven
    ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2010, (28): : 144 - 158
  • [2] Finite optimal control for time-bounded reachability in CTMDPs and continuous-time Markov games
    Markus N. Rabe
    Sven Schewe
    Acta Informatica, 2011, 48
  • [3] Finite optimal control for time-bounded reachability in CTMDPs and continuous-time Markov games
    Rabe, Markus N.
    Schewe, Sven
    ACTA INFORMATICA, 2011, 48 (5-6) : 291 - 315
  • [4] Efficient approximation of optimal control for continuous-time Markov games
    Fearnley, John
    Rabe, Markus N.
    Schewe, Sven
    Zhang, Lijun
    INFORMATION AND COMPUTATION, 2016, 247 : 106 - 129
  • [5] Efficient Approximation of Optimal Control for Continuous-Time Markov Games
    Fearnley, John
    Rabe, Markus
    Schewe, Sven
    Zhang, Lijun
    IARCS ANNUAL CONFERENCE ON FOUNDATIONS OF SOFTWARE TECHNOLOGY AND THEORETICAL COMPUTER SCIENCE (FSTTCS 2011), 2011, 13 : 399 - 410
  • [6] A characterization of meaningful schedulers for continuous-time Markov decision processes
    Wolovick, Nicolas
    Johr, Sven
    FORMAL MODELING AND ANALYSIS OF TIMED SYSTEMS, 2006, 4202 : 352 - 367
  • [7] Continuous-Time Markov Games with Asymmetric Information
    Gensbittel, Fabien
    DYNAMIC GAMES AND APPLICATIONS, 2019, 9 (03) : 671 - 699
  • [8] Continuous-Time Markov Games with Asymmetric Information
    Fabien Gensbittel
    Dynamic Games and Applications, 2019, 9 : 671 - 699
  • [9] Optimal control for continuous-time Markov jump systems
    Engineering College, Air Force Engineering University, Xi'an 710038, China
    Kongzhi yu Juece Control Decis, 2013, 3 (396-401):
  • [10] Continuous-time stochastic games
    Neyman, Abraham
    GAMES AND ECONOMIC BEHAVIOR, 2017, 104 : 92 - 130