Algebraic Markov Decision Processes

被引:0
|
作者
Perny, Patrice [1 ]
Spanjaard, Olivier [1 ]
Weng, Paul [1 ]
机构
[1] Univ Paris 06, LIP6, F-75252 Paris 05, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we provide an algebraic approach to Markov Decision Processes (MDPs), which allows a unified treatment of MDPs and includes many existing models (quantitative or qualitative) as particular cases. In algebraic MDPs, rewards are expressed in a semiring structure, uncertainty is represented by a decomposable plausibility measure valued on a second semiring structure, and preferences over policies are represented by Generalized Expected Utility. We recast the problem of finding an optimal policy at a finite horizon as an algebraic path problem in a decision rule graph where arcs are valued by functions, which justifies the use of the Jacobi algorithm to solve algebraic Bellman equations. In order to show the potential of this general approach, we exhibit new variations of MDPs, admitting complete or partial preference structures, as well as probabilistic or possibilistic representation of uncertainty.
引用
收藏
页码:1372 / 1377
页数:6
相关论文
共 50 条
  • [21] Robust Markov Decision Processes
    Wiesemann, Wolfram
    Kuhn, Daniel
    Rustem, Berc
    MATHEMATICS OF OPERATIONS RESEARCH, 2013, 38 (01) : 153 - 183
  • [22] Ordinal Decision Models for Markov Decision Processes
    Weng, Paul
    20TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2012), 2012, 242 : 828 - 833
  • [23] Markov Decision Processes with Arbitrary Reward Processes
    Yu, Jia Yuan
    Mannor, Shie
    Shimkin, Nahum
    MATHEMATICS OF OPERATIONS RESEARCH, 2009, 34 (03) : 737 - 757
  • [24] Markov Decision Processes with Arbitrary Reward Processes
    Yu, Jia Yuan
    Mannor, Shie
    Shimkin, Nahum
    RECENT ADVANCES IN REINFORCEMENT LEARNING, 2008, 5323 : 268 - +
  • [25] Markov Chains and Markov Decision Processes in Isabelle/HOL
    Hoelzl, Johannes
    JOURNAL OF AUTOMATED REASONING, 2017, 59 (03) : 345 - 387
  • [26] Markov Chains and Markov Decision Processes in Isabelle/HOL
    Johannes Hölzl
    Journal of Automated Reasoning, 2017, 59 : 345 - 387
  • [27] APPROXIMATING THE MARKOV PROPERTY IN MARKOV DECISION-PROCESSES
    WHITE, DJ
    INFORMATION AND DECISION TECHNOLOGIES, 1989, 15 (03): : 147 - 162
  • [28] Mean Field Markov Decision Processes
    Baeuerle, Nicole
    APPLIED MATHEMATICS AND OPTIMIZATION, 2023, 88 (01):
  • [29] Solving concurrent Markov decision processes
    Weld, M
    Weld, DS
    PROCEEDING OF THE NINETEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE SIXTEENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2004, : 716 - 722
  • [30] Approximate equivalence of Markov decision processes
    Even-Dar, E
    Mansour, Y
    LEARNING THEORY AND KERNEL MACHINES, 2003, 2777 : 581 - 594