Markov decision processes with random horizon

被引:4
|
作者
Iida, T [1 ]
Mori, M [1 ]
机构
[1] TOKYO INST TECHNOL,GRAD SCH DECIS SCI & TECHNOL,DEPT IND ENGN & MANAGEMENT,MEGURO KU,TOKYO 152,JAPAN
关键词
D O I
10.15807/jorsj.39.592
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
In this paper we formulate Markov Decision Processes with Random Horizon (MDPRH). We show the optimality equation for the MDPRH, however there may not exist optimal stationary strategies, or E-optimal stationary strategies for the processes. When the MDPRH has the probability distribution for the planning horizon with infinite support, we show Turnpike Planning Horizon Theorem. Then we evaluate rolling strategies and develop an algorithm obtaining an optimal first stage decision. Finally, some numerical experiments on a simple inventory model are done to understand the phenomena.
引用
收藏
页码:592 / 603
页数:12
相关论文
共 50 条
  • [21] Lexicographic refinements in possibilistic decision trees and finite-horizon Markov decision processes
    Ben Amor, Nahla
    El Khalfi, Zeineb
    Fargier, Helene
    Sabbadin, Regis
    FUZZY SETS AND SYSTEMS, 2019, 366 : 85 - 109
  • [22] Convergence of Value Functions for Finite Horizon Markov Decision Processes with Constraints
    Ichihara, Naoyuki
    APPLIED MATHEMATICS AND OPTIMIZATION, 2021, 84 (02): : 2177 - 2220
  • [23] A reinforcement learning based algorithm for finite horizon Markov decision processes
    Bhatnagar, Shalabh
    Abdulla, Mohammed Shahid
    PROCEEDINGS OF THE 45TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14, 2006, : 5519 - 5524
  • [24] An Approximate Stochastic Annealing Algorithm for Finite Horizon Markov Decision Processes
    Hu, Jiaqiao
    Chang, Hyeong Soo
    49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 5338 - 5343
  • [25] Poisoning finite-horizon Markov decision processes at design time
    Caballero, William N.
    Jenkins, Phillip R.
    Keith, Andrew J.
    COMPUTERS & OPERATIONS RESEARCH, 2021, 129
  • [26] Convergence of Value Functions for Finite Horizon Markov Decision Processes with Constraints
    Naoyuki Ichihara
    Applied Mathematics & Optimization, 2021, 84 : 2177 - 2220
  • [27] Pure Exploration in Episodic Fixed-Horizon Markov Decision Processes
    Putta, Sudeep Raja
    Tulabandhula, Theja
    AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 1703 - 1704
  • [28] Constrained Continuous-Time Markov Decision Processes on the Finite Horizon
    Xianping Guo
    Yonghui Huang
    Yi Zhang
    Applied Mathematics & Optimization, 2017, 75 : 317 - 341
  • [29] A Policy Gradient Approach for Finite Horizon Constrained Markov Decision Processes
    Guin, Soumyajit
    Bhatnagar, Shalabh
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 3353 - 3359
  • [30] A FORECAST HORIZON AND A STOPPING RULE FOR GENERAL MARKOV DECISION-PROCESSES
    HERNANDEZLERMA, O
    LASSERRE, JB
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1988, 132 (02) : 388 - 400