Markov decision processes with random horizon

被引：4

作者：

Iida, T ^{[1
]}

Mori, M ^{[1
]}

机构：

[1] TOKYO INST TECHNOL,GRAD SCH DECIS SCI & TECHNOL,DEPT IND ENGN & MANAGEMENT,MEGURO KU,TOKYO 152,JAPAN

来源：

JOURNAL OF THE OPERATIONS RESEARCH SOCIETY OF JAPAN | 1996年 / 39卷 / 04期

关键词：

D O I：

10.15807/jorsj.39.592

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

In this paper we formulate Markov Decision Processes with Random Horizon (MDPRH). We show the optimality equation for the MDPRH, however there may not exist optimal stationary strategies, or E-optimal stationary strategies for the processes. When the MDPRH has the probability distribution for the planning horizon with infinite support, we show Turnpike Planning Horizon Theorem. Then we evaluate rolling strategies and develop an algorithm obtaining an optimal first stage decision. Finally, some numerical experiments on a simple inventory model are done to understand the phenomena.

引用

页码：592 / 603

页数：12

共 50 条

[21] Lexicographic refinements in possibilistic decision trees and finite-horizon Markov decision processes
Ben Amor, Nahla
El Khalfi, Zeineb
Fargier, Helene
Sabbadin, Regis
FUZZY SETS AND SYSTEMS, 2019, 366 : 85 - 109
[22] Convergence of Value Functions for Finite Horizon Markov Decision Processes with Constraints
Ichihara, Naoyuki
APPLIED MATHEMATICS AND OPTIMIZATION, 2021, 84 (02): : 2177 - 2220
[23] A reinforcement learning based algorithm for finite horizon Markov decision processes
Bhatnagar, Shalabh
Abdulla, Mohammed Shahid
PROCEEDINGS OF THE 45TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14, 2006, : 5519 - 5524
[24] An Approximate Stochastic Annealing Algorithm for Finite Horizon Markov Decision Processes
Hu, Jiaqiao
Chang, Hyeong Soo
49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 5338 - 5343
[25] Poisoning finite-horizon Markov decision processes at design time
Caballero, William N.
Jenkins, Phillip R.
Keith, Andrew J.
COMPUTERS & OPERATIONS RESEARCH, 2021, 129
[26] Convergence of Value Functions for Finite Horizon Markov Decision Processes with Constraints
Naoyuki Ichihara
Applied Mathematics & Optimization, 2021, 84 : 2177 - 2220
[27] Pure Exploration in Episodic Fixed-Horizon Markov Decision Processes
Putta, Sudeep Raja
Tulabandhula, Theja
AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 1703 - 1704
[28] Constrained Continuous-Time Markov Decision Processes on the Finite Horizon
Xianping Guo
Yonghui Huang
Yi Zhang
Applied Mathematics & Optimization, 2017, 75 : 317 - 341
[29] A Policy Gradient Approach for Finite Horizon Constrained Markov Decision Processes
Guin, Soumyajit
Bhatnagar, Shalabh
2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 3353 - 3359
[30] A FORECAST HORIZON AND A STOPPING RULE FOR GENERAL MARKOV DECISION-PROCESSES
HERNANDEZLERMA, O
LASSERRE, JB
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1988, 132 (02) : 388 - 400

← 1 2 3 4 5 →