First passage models for denumerable semi-Markov decision processes with nonnegative discounted costs

被引:0
|
作者
Yong-hui Huang
Guo Xian-ping
机构
[1] Sun Yat-Sen University,School of Mathematics and Computational Science
关键词
Semi-Markov decision processes; target set; first passage time; discounted cost; optimal policy; 90C40; 93E20;
D O I
暂无
中图分类号
学科分类号
摘要
This paper considers a first passage model for discounted semi-Markov decision processes with denumerable states and nonnegative costs. The criterion to be optimized is the expected discounted cost incurred during a first passage time to a given target set. We first construct a semi-Markov decision process under a given semi-Markov decision kernel and a policy. Then, we prove that the value function satisfies the optimality equation and there exists an optimal (or ɛ-optimal) stationary policy under suitable conditions by using a minimum nonnegative solution approach. Further we give some properties of optimal policies. In addition, a value iteration algorithm for computing the value function and optimal policies is developed and an example is given. Finally, it is showed that our model is an extension of the first passage models for both discrete-time and continuous-time Markov decision processes.
引用
收藏
页码:177 / 190
页数:13
相关论文
共 50 条
  • [21] MEAN-VARIANCE OPTIMALITY FOR SEMI-MARKOV DECISION PROCESSES UNDER FIRST PASSAGE CRITERIA
    Huang, Xiangxiang
    Huang, Yonghui
    KYBERNETIKA, 2017, 53 (01) : 59 - 81
  • [22] A minimization problem of the risk probability in first passage semi-Markov decision processes with loss rates
    HUANG XiangXiang
    ZOU XiaoLong
    GUO XianPing
    ScienceChina(Mathematics), 2015, 58 (09) : 1923 - 1938
  • [23] A minimization problem of the risk probability in first passage semi-Markov decision processes with loss rates
    XiangXiang Huang
    XiaoLong Zou
    XianPing Guo
    Science China Mathematics, 2015, 58 : 1923 - 1938
  • [24] CONTINUITY OF MEAN RECURRENCE TIMES IN DENUMERABLE SEMI-MARKOV PROCESSES
    DEPPE, H
    ZEITSCHRIFT FUR WAHRSCHEINLICHKEITSTHEORIE UND VERWANDTE GEBIETE, 1985, 69 (04): : 581 - 592
  • [25] TRUNCATION APPROXIMATION OF LIMIT PROBABILITIES FOR DENUMERABLE SEMI-MARKOV PROCESSES
    TWEEDIE, RL
    JOURNAL OF APPLIED PROBABILITY, 1975, 12 (01) : 161 - 163
  • [26] Optimal stopping time on discounted semi-Markov processes
    Chen, Fang
    Guo, Xianping
    Liao, Zhong-Wei
    FRONTIERS OF MATHEMATICS IN CHINA, 2021, 16 (02) : 303 - 324
  • [27] Optimal stopping time on discounted semi-Markov processes
    Fang Chen
    Xianping Guo
    Zhong-Wei Liao
    Frontiers of Mathematics in China, 2021, 16 : 303 - 324
  • [28] Bayesian nonparametric estimation of first passage distributions in semi-Markov processes
    Warr, Richard L.
    Woodfield, Travis B.
    APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2020, 36 (02) : 237 - 250
  • [29] Discounted Markov decision processes with fuzzy costs
    Abdellatif Semmouri
    Mostafa Jourhmane
    Zineb Belhallaj
    Annals of Operations Research, 2020, 295 : 769 - 786
  • [30] Discounted Markov decision processes with fuzzy costs
    Semmouri, Abdellatif
    Jourhmane, Mostafa
    Belhallaj, Zineb
    ANNALS OF OPERATIONS RESEARCH, 2020, 295 (02) : 769 - 786