Maximal Cost-Bounded Reachability Probability on Continuous-Time Markov Decision Processes

被引：0

作者：

Fu, Hongfei ^{[1
]}

机构：

[1] Rhein Westfal TH Aachen, Lehrstuhl Informat 2, Aachen, Germany

来源：

FOUNDATIONS OF SOFTWARE SCIENCE AND COMPUTATION STRUCTURES | 2014年 / 8412卷

关键词：

D O I：

暂无

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In this paper, we consider multi-dimensional maximal costbounded reachability probability over continuous-time Markov decision processes (CTMDPs). Our major contributions are as follows. Firstly, we derive an integral characterization which states that the maximal cost-bounded reachability probability function is the least fixed-point of a system of integral equations. Secondly, we prove that the maximal cost-bounded reachability probability can be attained by a measurable deterministic cost-positional scheduler. Thirdly, we provide a numerical approximation algorithm for maximal cost-bounded reachability probability. We present these results under the setting of both early and late schedulers. Besides, we correct a fundamental proof error in the PhD Thesis by Martin Neuhaufier on maximal time-bounded reachability probability by completely new proofs for the more general case of multi-dimensional maximal cost-bounded reachability probability.

引用

页码：73 / 87

页数：15

共 50 条

[1] Efficient computation of time-bounded reachability probabilities in uniform continuous-time Markov decision processes
Baier, C
Haverkort, B
Hermanns, H
Katoen, JP
TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, PROCEEDINGS, 2004, 2988 : 61 - 76
[2] Efficient computation of time-bounded reachability probabilities in uniform continuous-time Markov decision processes
Baier, C
Hermanns, H
Katoen, JP
Haverkort, BR
THEORETICAL COMPUTER SCIENCE, 2005, 345 (01) : 2 - 26
[3] The risk probability criterion for discounted continuous-time Markov decision processes
Huo, Haifeng
Zou, Xiaolong
Guo, Xianping
DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2017, 27 (04): : 675 - 699
[4] The risk probability criterion for discounted continuous-time Markov decision processes
Haifeng Huo
Xiaolong Zou
Xianping Guo
Discrete Event Dynamic Systems, 2017, 27 : 675 - 699
[5] Cost-Bounded Active Classification Using Partially Observable Markov Decision Processes
Wu, Bo
Ahmadi, Mohamadreza
Bharadwaj, Suda
Topcu, Ufuk
2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, : 1216 - 1223
[6] Policy Learning for Time-Bounded Reachability in Continuous-Time Markov Decision Processes via Doubly-Stochastic Gradient Ascent
Bartocci, Ezio
Bortolussi, Luca
Brazdil, Tomas
Milios, Dimitrios
Sanguinetti, Guido
QUANTITATIVE EVALUATION OF SYSTEMS, QEST 2016, 2016, 9826 : 244 - 259
[7] ABSORBING CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH TOTAL COST CRITERIA
Guo, Xianping
Vykertas, Mantas
Zhang, Yi
ADVANCES IN APPLIED PROBABILITY, 2013, 45 (02) : 490 - 519
[8] Constrained optimization for average cost continuous-time markov decision processes
Guo, Xianping
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2007, 52 (06) : 1139 - 1143
[9] Risk Probability Minimization Problems for Continuous-Time Markov Decision Processes on Finite Horizon
Huo, Haifeng
Guo, Xianping
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (07) : 3199 - 3206
[10] The Transformation Method for Continuous-Time Markov Decision Processes
Piunovskiy, Alexey
Zhang, Yi
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2012, 154 (02) : 691 - 712

← 1 2 3 4 5 →