Markov Decision Processes on Borel Spaces with Total Cost and Random Horizon

被引:6
|
作者
Cruz-Suarez, Hugo [1 ]
Ilhuicatzi-Roldan, Rocio [1 ]
Montes-de-Oca, Raul [2 ]
机构
[1] Benemerita Univ Autonoma Puebla, Fac Ciencias Fis Matemat, Puebla, Mexico
[2] Univ Autonoma Metropolitana Iztapalapa, Dept Matemat, Mexico City 09340, DF, Mexico
关键词
Markov decision process; Total cost; Random horizon; Varying-time discount factor;
D O I
10.1007/s10957-012-0262-8
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
This paper deals with Markov Decision Processes (MDPs) on Borel spaces with possibly unbounded costs. The criterion to be optimized is the expected total cost with a random horizon of infinite support. In this paper, it is observed that this performance criterion is equivalent to the expected total discounted cost with an infinite horizon and a varying-time discount factor. Then, the optimal value function and the optimal policy are characterized through some suitable versions of the Dynamic Programming Equation. Moreover, it is proved that the optimal value function of the optimal control problem with a random horizon can be bounded from above by the optimal value function of a discounted optimal control problem with a fixed discount factor. In this case, the discount factor is defined in an adequate way by the parameters introduced for the study of the optimal control problem with a random horizon. To illustrate the theory developed, a version of the Linear-Quadratic model with a random horizon and a Logarithm Consumption-Investment model are presented.
引用
收藏
页码:329 / 346
页数:18
相关论文
共 50 条
  • [21] ANOTHER SET OF VERIFIABLE CONDITIONS FOR AVERAGE MARKOV DECISION PROCESSES WITH BOREL SPACES
    Zou, Xiaolong
    Guo, Xianping
    KYBERNETIKA, 2015, 51 (02) : 276 - 292
  • [22] Constrained Markov decision processes in Borel spaces: from discounted to average optimality
    Armando F. Mendoza-Pérez
    Héctor Jasso-Fuentes
    Omar A. De-la-Cruz Courtois
    Mathematical Methods of Operations Research, 2016, 84 : 489 - 525
  • [24] New Average Optimality Conditions for Semi-Markov Decision Processes in Borel Spaces
    Qingda Wei
    Xianping Guo
    Journal of Optimization Theory and Applications, 2012, 153 : 709 - 732
  • [25] Finite-State Approximation of Markov Decision Processes with Unbounded Costs and Borel Spaces
    Saldi, Naci
    Yuksel, Serdar
    Linder, Tumas
    2015 54TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2015, : 5085 - 5090
  • [26] New Average Optimality Conditions for Semi-Markov Decision Processes in Borel Spaces
    Wei, Qingda
    Guo, Xianping
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2012, 153 (03) : 709 - 732
  • [27] CONSTRAINED OPTIMALITY PROBLEM OF MARKOV DECISION PROCESSES WITH BOREL SPACES AND VARYING DISCOUNT FACTORS
    Wu, Xiao
    Tang, Yanqiu
    KYBERNETIKA, 2021, 57 (02) : 295 - 311
  • [28] MARKOV DECISION PROCESSES ON FINITE SPACES WITH FUZZY TOTAL REWARDS
    Carrero-Vera, Karla
    Cruz-Suarez, Hugo
    Montes-de-Oca, Raul
    KYBERNETIKA, 2022, 58 (02) : 180 - 199
  • [29] A Consumption and Investment Problem via a Markov Decision Processes Approach with Random Horizon
    Perez, Octavio Paredes
    Guevara, Victor HugoVazquez
    Cruz-Suarez, Hugo
    ADVANCES IN OPERATIONS RESEARCH, 2022, 2022
  • [30] MARKOV DECISION PROCESSES WITH TIME-VARYING DISCOUNT FACTORS AND RANDOM HORIZON
    Ilhuicatzi-Roldan, Rocio
    Cruz-Suarez, Hugo
    Chavez-Rodriguez, Selene
    KYBERNETIKA, 2017, 53 (01) : 82 - 98