First passage models for denumerable semi-Markov decision processes with nonnegative discounted costs

被引：0

作者：

Yong-hui Huang

Guo Xian-ping

机构：

[1] Sun Yat-Sen University,School of Mathematics and Computational Science

来源：

Acta Mathematicae Applicatae Sinica, English Series | 2011年 / 27卷

关键词：

Semi-Markov decision processes; target set; first passage time; discounted cost; optimal policy; 90C40; 93E20;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

This paper considers a first passage model for discounted semi-Markov decision processes with denumerable states and nonnegative costs. The criterion to be optimized is the expected discounted cost incurred during a first passage time to a given target set. We first construct a semi-Markov decision process under a given semi-Markov decision kernel and a policy. Then, we prove that the value function satisfies the optimality equation and there exists an optimal (or ɛ-optimal) stationary policy under suitable conditions by using a minimum nonnegative solution approach. Further we give some properties of optimal policies. In addition, a value iteration algorithm for computing the value function and optimal policies is developed and an example is given. Finally, it is showed that our model is an extension of the first passage models for both discrete-time and continuous-time Markov decision processes.

引用

页码：177 / 190

页数：13

共 50 条

[41] SEMI-MARKOV DECISION PROCESSES WITH UNBOUNDED REWARDS
LIPPMAN, SA
MANAGEMENT SCIENCE SERIES A-THEORY, 1973, 19 (07): : 717 - 731
[42] AVERAGE COST SEMI-MARKOV DECISION PROCESSES
ROSS, SM
JOURNAL OF APPLIED PROBABILITY, 1970, 7 (03) : 649 - &
[43] FINITE-STATE APPROXIMATIONS FOR DENUMERABLE STATE DISCOUNTED MARKOV DECISION-PROCESSES
CAVAZOSCADENA, R
APPLIED MATHEMATICS AND OPTIMIZATION, 1986, 14 (01): : 1 - 26
[44] Towards Analysis of Semi-Markov Decision Processes
Chen, Taolue
Lu, Jian
ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT I, 2010, 6319 : 41 - +
[45] Semi-markov decision processes nonstandard criteria
Baykal-Guersoy, M.
Guersoy, K.
PROBABILITY IN THE ENGINEERING AND INFORMATIONAL SCIENCES, 2007, 21 (04) : 635 - 657
[46] Comparing first-passage times for semi-Markov skip-free processes
Stat Probab Lett, 3 (247):
[47] Comparing first-passage times for semi-Markov skip-free processes
DiCrescenzo, A
Ricciardi, LM
STATISTICS & PROBABILITY LETTERS, 1996, 30 (03) : 247 - 256
[48] COMPUTING THE DISCOUNTED RETURN IN MARKOV AND SEMI-MARKOV CHAINS
PORTEUS, EL
NAVAL RESEARCH LOGISTICS, 1981, 28 (04) : 567 - 577
[49] Multistate models, flowgraph models, and semi-Markov processes
Huzurbazar, AV
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2004, 33 (03) : 457 - 474
[50] Second Order Optimality in Markov and Semi-Markov Decision Processes
Sladky, Karel
37TH INTERNATIONAL CONFERENCE ON MATHEMATICAL METHODS IN ECONOMICS 2019, 2019, : 338 - 343

← 1 2 3 4 5 →