Markov decision processes with random horizon

被引:4
|
作者
Iida, T [1 ]
Mori, M [1 ]
机构
[1] TOKYO INST TECHNOL,GRAD SCH DECIS SCI & TECHNOL,DEPT IND ENGN & MANAGEMENT,MEGURO KU,TOKYO 152,JAPAN
关键词
D O I
10.15807/jorsj.39.592
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
In this paper we formulate Markov Decision Processes with Random Horizon (MDPRH). We show the optimality equation for the MDPRH, however there may not exist optimal stationary strategies, or E-optimal stationary strategies for the processes. When the MDPRH has the probability distribution for the planning horizon with infinite support, we show Turnpike Planning Horizon Theorem. Then we evaluate rolling strategies and develop an algorithm obtaining an optimal first stage decision. Finally, some numerical experiments on a simple inventory model are done to understand the phenomena.
引用
收藏
页码:592 / 603
页数:12
相关论文
共 50 条