Algebraic Markov Decision Processes

被引:0
|
作者
Perny, Patrice [1 ]
Spanjaard, Olivier [1 ]
Weng, Paul [1 ]
机构
[1] Univ Paris 06, LIP6, F-75252 Paris 05, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we provide an algebraic approach to Markov Decision Processes (MDPs), which allows a unified treatment of MDPs and includes many existing models (quantitative or qualitative) as particular cases. In algebraic MDPs, rewards are expressed in a semiring structure, uncertainty is represented by a decomposable plausibility measure valued on a second semiring structure, and preferences over policies are represented by Generalized Expected Utility. We recast the problem of finding an optimal policy at a finite horizon as an algebraic path problem in a decision rule graph where arcs are valued by functions, which justifies the use of the Jacobi algorithm to solve algebraic Bellman equations. In order to show the potential of this general approach, we exhibit new variations of MDPs, admitting complete or partial preference structures, as well as probabilistic or possibilistic representation of uncertainty.
引用
收藏
页码:1372 / 1377
页数:6
相关论文
共 50 条
  • [1] An Algebraic Theory of Markov Processes
    Bacci, Giorgio
    Mardare, Radu
    Panangaden, Prakash
    Plotkin, Gordon
    LICS'18: PROCEEDINGS OF THE 33RD ANNUAL ACM/IEEE SYMPOSIUM ON LOGIC IN COMPUTER SCIENCE, 2018, : 679 - 688
  • [2] Algebraic Hidden processes and Hidden Markov processes
    Accardi, Luigi
    Soueidi, El Gheteb
    Lu, Yun Gang
    Souissi, Abdessatar
    INFINITE DIMENSIONAL ANALYSIS QUANTUM PROBABILITY AND RELATED TOPICS, 2024,
  • [3] Markov decision processes
    White, D.J.
    Journal of the Operational Research Society, 1995, 46 (06):
  • [4] Markov Decision Processes
    Bäuerle N.
    Rieder U.
    Jahresbericht der Deutschen Mathematiker-Vereinigung, 2010, 112 (4) : 217 - 243
  • [5] ALGEBRAIC DUALITY OF MARKOV-PROCESSES
    VERVAAT, W
    STOCHASTIC PROCESSES AND THEIR APPLICATIONS, 1987, 26 (02) : 185 - 186
  • [6] Online Markov Decision Processes
    Even-Dar, Eyal
    Kakade, Sham M.
    Mansour, Yishay
    MATHEMATICS OF OPERATIONS RESEARCH, 2009, 34 (03) : 726 - 736
  • [7] MARKOV DECISION-PROCESSES
    SCHAL, M
    STOCHASTIC PROCESSES AND THEIR APPLICATIONS, 1984, 17 (01) : 13 - 13
  • [8] A review on Markov Decision Processes
    J. A. Filar and LIU Ke Centre for Industrial and Applicable Mathematics
    Institute of Applied Mathematics
    Chinese Science Bulletin, 1999, (07) : 672 - 672
  • [9] On constrained Markov decision processes
    Haviv, M
    OPERATIONS RESEARCH LETTERS, 1996, 19 (01) : 25 - 28
  • [10] MARKOV DECISION-PROCESSES
    WHITE, CC
    WHITE, DJ
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1989, 39 (01) : 1 - 16