MEAN-VARIANCE CRITERIA IN AN UNDISCOUNTED MARKOV DECISION-PROCESS

被引:1
|
作者
CHUNG, KJ
机构
[1] Department of Industrial Management, National Taiwan Institute of Technology, Taipei
关键词
MARKOV DECISION; PARAMETRIC MARKOV DECISION; MEAN; VARIANCE; RANDOMIZED POLICY;
D O I
10.1016/0377-2217(93)90170-R
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
In the steady state of an undiscounted Markov decision process, we consider the problem of finding an optimal stationary probability distribution that minimizes the variance of the reward in a transition among the stationary probability distributions which give a mean not less than a specified value. The problem consists of a mathematical program with linear constraints and a non-linear objective. The solution technique replaces the non-linear part of the objective with a constant, inserts the constant as a constraint, and then parametrically analyzes the resulting linear program. Three numerical examples are discussed.
引用
收藏
页码:265 / 276
页数:12
相关论文
共 50 条