Constrained semi-markov decision processes with average rewards

被引:2
|
作者
Feinberg, E.A.
机构
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
相关论文
共 50 条
  • [1] Semi-Markov decision processes with limiting ratio average rewards
    Sinha, Sagnik
    Mondal, Prasenjit
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2017, 455 (01) : 864 - 871
  • [2] SEMI-MARKOV DECISION PROCESSES WITH UNBOUNDED REWARDS
    LIPPMAN, SA
    MANAGEMENT SCIENCE SERIES A-THEORY, 1973, 19 (07): : 717 - 731
  • [3] AVERAGE COST SEMI-MARKOV DECISION PROCESSES
    ROSS, SM
    JOURNAL OF APPLIED PROBABILITY, 1970, 7 (03) : 649 - &
  • [4] TIME-AVERAGE OPTIMAL CONSTRAINED SEMI-MARKOV DECISION-PROCESSES
    BEUTLER, FJ
    ROSS, KW
    ADVANCES IN APPLIED PROBABILITY, 1986, 18 (02) : 341 - 359
  • [5] Constrained discounted semi-Markov decision processes
    Feinberg, EA
    MARKOV PROCESSES AND CONTROLLED MARKOV CHAINS, 2002, : 233 - 244
  • [6] Constrained semi-Markov decision processes with ratio and time expected average criteria in Polish spaces
    Wei, Qingda
    Guo, Xianping
    OPTIMIZATION, 2015, 64 (07) : 1593 - 1623
  • [7] Average Reward Reinforcement Learning for Semi-Markov Decision Processes
    Yang, Jiayuan
    Li, Yanjie
    Chen, Haoyao
    Li, Jiangang
    NEURAL INFORMATION PROCESSING, ICONIP 2017, PT I, 2017, 10634 : 768 - 777
  • [8] DENUMERABLE UNDISCOUNTED SEMI-MARKOV DECISION-PROCESSES WITH UNBOUNDED REWARDS
    FEDERGRUEN, A
    SCHWEITZER, PJ
    TIJMS, HC
    MATHEMATICS OF OPERATIONS RESEARCH, 1983, 8 (02) : 298 - 313
  • [9] RVI Reinforcement Learning for Semi-Markov Decision Processes with Average Reward
    Li, Yanjie
    Cao, Fang
    2010 8TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2010, : 1674 - 1679
  • [10] On average reward semi-markov decision processes with a general multichain structure
    Jianyong, L
    Xiaobo, Z
    MATHEMATICS OF OPERATIONS RESEARCH, 2004, 29 (02) : 339 - 352