Nonstationary Policies and Average Optimality in Multichain Markov Decision Processes with a General Action Space

被引:0
|
作者
A. Y. Golubin
机构
[1] Institute of Electronics and Mathematics,
关键词
Decision Process; General Action; Action Space; Markov Decision Process; Average Optimality;
D O I
10.1023/B:JOTH.0000036314.29733.3d
中图分类号
学科分类号
摘要
引用
收藏
页码:3733 / 3740
页数:7
相关论文
共 50 条