Optimal stopping time on discounted semi-Markov processes

被引:0
|
作者
Fang Chen
Xianping Guo
Zhong-Wei Liao
机构
[1] Sun Yat-Sen University,School of Mathematics
[2] Beijing Normal University,College of Education for the Future
来源
关键词
Optimal stopping time; semi-Markov processes (SMPs); value function; semi-Markov decision processes (SMDPs); optimal policy; iterative algorithm; 90C40; 93E20; 60G40;
D O I
暂无
中图分类号
学科分类号
摘要
This paper attempts to study the optimal stopping time for semi-Markov processes (SMPs) under the discount optimization criteria with unbounded cost rates. In our work, we introduce an explicit construction of the equivalent semi-Markov decision processes (SMDPs). The equivalence is embodied in the expected discounted cost functions of SMPs and SMDPs, that is, every stopping time of SMPs can induce a policy of SMDPs such that the value functions are equal, and vice versa. The existence of the optimal stopping time of SMPs is proved by this equivalence relation. Next, we give the optimality equation of the value function and develop an effective iterative algorithm for computing it. Moreover, we show that the optimal and ε-optimal stopping time can be characterized by the hitting time of the special sets. Finally, to illustrate the validity of our results, an example of a maintenance system is presented in the end.
引用
收藏
页码:303 / 324
页数:21
相关论文
共 50 条
  • [21] RANDOM TIME TRANSFORMATIONS OF SEMI-MARKOV PROCESSES
    SERFOZO, RF
    ANNALS OF MATHEMATICAL STATISTICS, 1971, 42 (01): : 176 - &
  • [22] STRUCTURE OF OPTIMAL POLICIES FOR DISCOUNTED SEMI-MARKOV DECISION PROGRAMMING WITH UNBOUNDED REWARDS
    董泽清
    刘克
    数学进展, 1985, (01) : 68 - 69
  • [23] STRUCTURE OF OPTIMAL POLICIES FOR DISCOUNTED SEMI-MARKOV DECISION PROGRAMING WITH UNBOUNDED REWARDS
    董泽清
    刘克
    Science China Mathematics, 1986, (04) : 337 - 349
  • [24] Relations between discounted models and average models for semi-Markov decision processes
    Yin, Bao-Qun
    Li, Yan-Jie
    Tang, Hao
    Dai, Gui-Ping
    Xi, Hong-Sheng
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2006, 23 (01): : 65 - 68
  • [25] OPTIMAL REPLACEMENT WITH SEMI-MARKOV SHOCK MODELS USING DISCOUNTED COSTS.
    Feldman, Richard M.
    Mathematics of Operations Research, 1977, 2 (01) : 78 - 90
  • [26] ITERATIVE AGGREGATION-DISAGGREGATION PROCEDURES FOR DISCOUNTED SEMI-MARKOV REWARD PROCESSES
    SCHWEITZER, PJ
    PUTERMAN, ML
    KINDLE, KW
    OPERATIONS RESEARCH, 1985, 33 (03) : 589 - 605
  • [27] ON DISCRETE-TIME SEMI-MARKOV PROCESSES
    Pachon, Angelica
    Polito, Federico
    Ricciuti, Costantino
    DISCRETE AND CONTINUOUS DYNAMICAL SYSTEMS-SERIES B, 2021, 26 (03): : 1499 - 1529
  • [28] STRUCTURE OF OPTIMAL POLICIES FOR DISCOUNTED SEMI-MARKOV DECISION PROGRAMMING WITH UNBOUNDED REWARDS
    DONG, ZQ
    LIU, K
    SCIENTIA SINICA SERIES A-MATHEMATICAL PHYSICAL ASTRONOMICAL & TECHNICAL SCIENCES, 1986, 29 (04): : 337 - 349
  • [29] TIME-AVERAGE OPTIMAL CONSTRAINED SEMI-MARKOV DECISION-PROCESSES
    BEUTLER, FJ
    ROSS, KW
    ADVANCES IN APPLIED PROBABILITY, 1986, 18 (02) : 341 - 359
  • [30] Multi-objective discounted semi-Markov decision processes with multiple constraints
    Wang, YH
    Zhang, S
    Zhang, JH
    PROCEEDINGS OF THE SECOND ASIAN MATHEMATICAL CONFERENCE 1995, 1998, : 551 - 555