Maximal Cost-Bounded Reachability Probability on Continuous-Time Markov Decision Processes

被引:0
|
作者
Fu, Hongfei [1 ]
机构
[1] Rhein Westfal TH Aachen, Lehrstuhl Informat 2, Aachen, Germany
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, we consider multi-dimensional maximal costbounded reachability probability over continuous-time Markov decision processes (CTMDPs). Our major contributions are as follows. Firstly, we derive an integral characterization which states that the maximal cost-bounded reachability probability function is the least fixed-point of a system of integral equations. Secondly, we prove that the maximal cost-bounded reachability probability can be attained by a measurable deterministic cost-positional scheduler. Thirdly, we provide a numerical approximation algorithm for maximal cost-bounded reachability probability. We present these results under the setting of both early and late schedulers. Besides, we correct a fundamental proof error in the PhD Thesis by Martin Neuhaufier on maximal time-bounded reachability probability by completely new proofs for the more general case of multi-dimensional maximal cost-bounded reachability probability.
引用
收藏
页码:73 / 87
页数:15
相关论文
共 50 条
  • [1] Efficient computation of time-bounded reachability probabilities in uniform continuous-time Markov decision processes
    Baier, C
    Haverkort, B
    Hermanns, H
    Katoen, JP
    TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, PROCEEDINGS, 2004, 2988 : 61 - 76
  • [2] Efficient computation of time-bounded reachability probabilities in uniform continuous-time Markov decision processes
    Baier, C
    Hermanns, H
    Katoen, JP
    Haverkort, BR
    THEORETICAL COMPUTER SCIENCE, 2005, 345 (01) : 2 - 26
  • [3] The risk probability criterion for discounted continuous-time Markov decision processes
    Huo, Haifeng
    Zou, Xiaolong
    Guo, Xianping
    DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2017, 27 (04): : 675 - 699
  • [4] The risk probability criterion for discounted continuous-time Markov decision processes
    Haifeng Huo
    Xiaolong Zou
    Xianping Guo
    Discrete Event Dynamic Systems, 2017, 27 : 675 - 699
  • [5] Cost-Bounded Active Classification Using Partially Observable Markov Decision Processes
    Wu, Bo
    Ahmadi, Mohamadreza
    Bharadwaj, Suda
    Topcu, Ufuk
    2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, : 1216 - 1223
  • [6] Policy Learning for Time-Bounded Reachability in Continuous-Time Markov Decision Processes via Doubly-Stochastic Gradient Ascent
    Bartocci, Ezio
    Bortolussi, Luca
    Brazdil, Tomas
    Milios, Dimitrios
    Sanguinetti, Guido
    QUANTITATIVE EVALUATION OF SYSTEMS, QEST 2016, 2016, 9826 : 244 - 259
  • [7] ABSORBING CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH TOTAL COST CRITERIA
    Guo, Xianping
    Vykertas, Mantas
    Zhang, Yi
    ADVANCES IN APPLIED PROBABILITY, 2013, 45 (02) : 490 - 519
  • [8] Constrained optimization for average cost continuous-time markov decision processes
    Guo, Xianping
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2007, 52 (06) : 1139 - 1143
  • [9] Risk Probability Minimization Problems for Continuous-Time Markov Decision Processes on Finite Horizon
    Huo, Haifeng
    Guo, Xianping
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (07) : 3199 - 3206
  • [10] The Transformation Method for Continuous-Time Markov Decision Processes
    Piunovskiy, Alexey
    Zhang, Yi
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2012, 154 (02) : 691 - 712