共 50 条
MEAN-VARIANCE CRITERIA IN AN UNDISCOUNTED MARKOV DECISION-PROCESS
被引:1
|作者:
CHUNG, KJ
机构:
[1] Department of Industrial Management, National Taiwan Institute of Technology, Taipei
关键词:
MARKOV DECISION;
PARAMETRIC MARKOV DECISION;
MEAN;
VARIANCE;
RANDOMIZED POLICY;
D O I:
10.1016/0377-2217(93)90170-R
中图分类号:
C93 [管理学];
学科分类号:
12 ;
1201 ;
1202 ;
120202 ;
摘要:
In the steady state of an undiscounted Markov decision process, we consider the problem of finding an optimal stationary probability distribution that minimizes the variance of the reward in a transition among the stationary probability distributions which give a mean not less than a specified value. The problem consists of a mathematical program with linear constraints and a non-linear objective. The solution technique replaces the non-linear part of the objective with a constant, inserts the constant as a constraint, and then parametrically analyzes the resulting linear program. Three numerical examples are discussed.
引用
收藏
页码:265 / 276
页数:12
相关论文