Markov decision processes with random horizon

被引：4

作者：

Iida, T ^{[1
]}

Mori, M ^{[1
]}

机构：

[1] TOKYO INST TECHNOL,GRAD SCH DECIS SCI & TECHNOL,DEPT IND ENGN & MANAGEMENT,MEGURO KU,TOKYO 152,JAPAN

来源：

JOURNAL OF THE OPERATIONS RESEARCH SOCIETY OF JAPAN | 1996年 / 39卷 / 04期

关键词：

D O I：

10.15807/jorsj.39.592

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

In this paper we formulate Markov Decision Processes with Random Horizon (MDPRH). We show the optimality equation for the MDPRH, however there may not exist optimal stationary strategies, or E-optimal stationary strategies for the processes. When the MDPRH has the probability distribution for the planning horizon with infinite support, we show Turnpike Planning Horizon Theorem. Then we evaluate rolling strategies and develop an algorithm obtaining an optimal first stage decision. Finally, some numerical experiments on a simple inventory model are done to understand the phenomena.

引用

页码：592 / 603

页数：12

共 50 条

[31] Constrained Continuous-Time Markov Decision Processes on the Finite Horizon
Guo, Xianping
Huang, Yonghui
Zhang, Yi
APPLIED MATHEMATICS AND OPTIMIZATION, 2017, 75 (02): : 317 - 341
[32] Finite horizon semi-Markov decision processes with multiple constraints
Huang, Yonghui
2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 1761 - 1768
[33] LIFE IS RANDOM, TIME IS NOT: MARKOV DECISION PROCESSES WITH WINDOW OBJECTIVES
Brihaye, Thomas
Delgrange, Florent
Randour, Mickael
Oualhadj, Youssouf
LOGICAL METHODS IN COMPUTER SCIENCE, 2020, 16 (04) : 1 - 13
[34] On Supervised Online Rolling-Horizon Control for Infinite-Horizon Discounted Markov Decision Processes
Chang, Hyeong Soo
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (02) : 1060 - 1065
[35] INFINITE HORIZON MARKOV DECISION-PROCESSES WITH UNKNOWN OR VARIABLE DISCOUNT FACTORS
WHITE, DJ
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1987, 28 (01) : 96 - 100
[36] Fast Approximate Dynamic Programming for Infinite-Horizon Markov Decision Processes
Kolarijani, M. A. S.
Max, G. F.
Esfahani, P. Mohajerin
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[37] FINITE STATE CONTINUOUS TIME MARKOV DECISION PROCESSES WITH AN INFINITE PLANNING HORIZON
MILLER, BL
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1968, 22 (03) : 552 - &
[38] Approximate stochastic annealing for online control of infinite horizon Markov decision processes
Hu, Jiaqiao
Chang, Hyeong Soo
AUTOMATICA, 2012, 48 (09) : 2182 - 2188
[39] Non-Stationary Semi-Markov Decision Processes on a Finite Horizon
Ghosh, Mrinal K.
Saha, Subhamay
STOCHASTIC ANALYSIS AND APPLICATIONS, 2013, 31 (01) : 183 - 190
[40] Risk probability optimization of finite horizon piecewise deterministic Markov decision processes
Huo, Haifeng
Wen, Xian
OPTIMIZATION, 2024,

← 1 2 3 4 5 →