Markov Decision Processes on Borel Spaces with Total Cost and Random Horizon

被引：6

作者：

Cruz-Suarez, Hugo ^{[1
]}

Ilhuicatzi-Roldan, Rocio ^{[1
]}

Montes-de-Oca, Raul ^{[2
]}

机构：

[1] Benemerita Univ Autonoma Puebla, Fac Ciencias Fis Matemat, Puebla, Mexico

[2] Univ Autonoma Metropolitana Iztapalapa, Dept Matemat, Mexico City 09340, DF, Mexico

来源：

JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS | 2014年 / 162卷 / 01期

关键词：

Markov decision process; Total cost; Random horizon; Varying-time discount factor;

D O I：

10.1007/s10957-012-0262-8

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

This paper deals with Markov Decision Processes (MDPs) on Borel spaces with possibly unbounded costs. The criterion to be optimized is the expected total cost with a random horizon of infinite support. In this paper, it is observed that this performance criterion is equivalent to the expected total discounted cost with an infinite horizon and a varying-time discount factor. Then, the optimal value function and the optimal policy are characterized through some suitable versions of the Dynamic Programming Equation. Moreover, it is proved that the optimal value function of the optimal control problem with a random horizon can be bounded from above by the optimal value function of a discounted optimal control problem with a fixed discount factor. In this case, the discount factor is defined in an adequate way by the parameters introduced for the study of the optimal control problem with a random horizon. To illustrate the theory developed, a version of the Linear-Quadratic model with a random horizon and a Logarithm Consumption-Investment model are presented.

引用

页码：329 / 346

页数：18

共 50 条

[1] Markov Decision Processes on Borel Spaces with Total Cost and Random Horizon
Hugo Cruz-Suárez
Rocio Ilhuicatzi-Roldán
Raúl Montes-de-Oca
Journal of Optimization Theory and Applications, 2014, 162 : 329 - 346
[2] Value Iteration for Average Cost Markov Decision Processes in Borel Spaces
Zhu, Quanxin
Guo, Xianping
APPLIED MATHEMATICS RESEARCH EXPRESS, 2005, (02) : 61 - 76
[3] Markov decision processes with random horizon
Iida, T
Mori, M
JOURNAL OF THE OPERATIONS RESEARCH SOCIETY OF JAPAN, 1996, 39 (04) : 592 - 603
[4] Constrained discounted Markov decision processes with Borel state spaces
Feinberg, Eugene A.
Jaskiewicz, Anna
Nowak, Andrzej S.
AUTOMATICA, 2020, 111
[5] Weakly Coupled Constrained Markov Decision Processes in Borel Spaces
Gagrani, Mukul
Nayyar, Ashutosh
2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 2790 - 2795
[6] AVERAGE COST OPTIMALITY INEQUALITY FOR MARKOV DECISION PROCESSES WITH BOREL SPACES AND UNIVERSALLY MEASURABLE POLICIES
Yu, Huizhen
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2020, 58 (04) : 2469 - 2502
[7] DISCOUNTED COST MARKOV DECISION-PROCESSES ON BOREL SPACES - THE LINEAR-PROGRAMMING FORMULATION
HERNANDEZLERMA, O
HERNANDEZHERNANDEZ, D
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1994, 183 (02) : 335 - 351
[8] Constrained average cost Markov control processes in Borel spaces
Hernández-Lerma, O
González-Hernández, J
López-Martínez, RR
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2003, 42 (02) : 442 - 468
[9] On the Asymptotic Optimality of Finite Approximations to Markov Decision Processes with Borel Spaces
Saldi, Naci
Yuksel, Serdar
Linder, Tamas
MATHEMATICS OF OPERATIONS RESEARCH, 2017, 42 (04) : 945 - 978
[10] Policy iteration for average cost Markov control processes on Borel spaces
HernandezLerma, O
Lasserre, JB
ACTA APPLICANDAE MATHEMATICAE, 1997, 47 (02) : 125 - 154

← 1 2 3 4 5 →