First passage models for denumerable semi-Markov decision processes with nonnegative discounted costs

被引:0
|
作者
Yong-hui Huang
Guo Xian-ping
机构
[1] Sun Yat-Sen University,School of Mathematics and Computational Science
关键词
Semi-Markov decision processes; target set; first passage time; discounted cost; optimal policy; 90C40; 93E20;
D O I
暂无
中图分类号
学科分类号
摘要
This paper considers a first passage model for discounted semi-Markov decision processes with denumerable states and nonnegative costs. The criterion to be optimized is the expected discounted cost incurred during a first passage time to a given target set. We first construct a semi-Markov decision process under a given semi-Markov decision kernel and a policy. Then, we prove that the value function satisfies the optimality equation and there exists an optimal (or ɛ-optimal) stationary policy under suitable conditions by using a minimum nonnegative solution approach. Further we give some properties of optimal policies. In addition, a value iteration algorithm for computing the value function and optimal policies is developed and an example is given. Finally, it is showed that our model is an extension of the first passage models for both discrete-time and continuous-time Markov decision processes.
引用
收藏
页码:177 / 190
页数:13
相关论文
共 50 条
  • [1] First passage models for denumerable semi-Markov decision processes with nonnegative discounted costs
    Huang, Yong-hui
    Guo, Xian-ping
    ACTA MATHEMATICAE APPLICATAE SINICA-ENGLISH SERIES, 2011, 27 (02): : 177 - 190
  • [2] Constrained discounted semi-Markov decision processes
    Feinberg, EA
    MARKOV PROCESSES AND CONTROLLED MARKOV CHAINS, 2002, : 233 - 244
  • [3] OPTIMIZATION OF DENUMERABLE SEMI-MARKOV DECISION PROCESSES.
    Staniewski, Piotr
    Weinfeld, Roman
    Systems Science, 1980, 6 (02): : 129 - 141
  • [4] Optimal risk probability for first passage models in semi-Markov decision processes
    Huang, Yonghui
    Guo, Xianping
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2009, 359 (01) : 404 - 420
  • [5] Relations between discounted models and average models for semi-Markov decision processes
    Yin, Bao-Qun
    Li, Yan-Jie
    Tang, Hao
    Dai, Gui-Ping
    Xi, Hong-Sheng
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2006, 23 (01): : 65 - 68
  • [6] Minimizing Risk Models in Denumerable Semi-Markov Decision Processes with a Target Set
    Huang Yonghui
    Guo Xianping
    PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE, 2010, : 1576 - 1581
  • [7] Mixed Markov decision processes in a semi-Markov environment with discounted criterion
    Hu, QY
    Wang, JL
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1998, 219 (01) : 1 - 20
  • [8] A SECONDARY APPROACH TO THE DISCOUNTED MODEL IN SEMI-MARKOV DECISION PROCESSES
    董泽清
    宋京生
    ScienceBulletin, 1988, (06) : 448 - 454
  • [9] DENUMERABLE UNDISCOUNTED SEMI-MARKOV DECISION-PROCESSES WITH UNBOUNDED REWARDS
    FEDERGRUEN, A
    SCHWEITZER, PJ
    TIJMS, HC
    MATHEMATICS OF OPERATIONS RESEARCH, 1983, 8 (02) : 298 - 313
  • [10] A SECONDARY APPROACH TO THE DISCOUNTED MODEL IN SEMI-MARKOV DECISION-PROCESSES
    DONG, ZQ
    SONG, JS
    KEXUE TONGBAO, 1988, 33 (06): : 448 - 454