Algebraic Markov Decision Processes

被引:0
|
作者
Perny, Patrice [1 ]
Spanjaard, Olivier [1 ]
Weng, Paul [1 ]
机构
[1] Univ Paris 06, LIP6, F-75252 Paris 05, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we provide an algebraic approach to Markov Decision Processes (MDPs), which allows a unified treatment of MDPs and includes many existing models (quantitative or qualitative) as particular cases. In algebraic MDPs, rewards are expressed in a semiring structure, uncertainty is represented by a decomposable plausibility measure valued on a second semiring structure, and preferences over policies are represented by Generalized Expected Utility. We recast the problem of finding an optimal policy at a finite horizon as an algebraic path problem in a decision rule graph where arcs are valued by functions, which justifies the use of the Jacobi algorithm to solve algebraic Bellman equations. In order to show the potential of this general approach, we exhibit new variations of MDPs, admitting complete or partial preference structures, as well as probabilistic or possibilistic representation of uncertainty.
引用
收藏
页码:1372 / 1377
页数:6
相关论文
共 50 条
  • [41] On Markov policies for minimax decision processes
    Iwamoto, S
    Tsurusaki, K
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2001, 253 (01) : 58 - 78
  • [42] Planning with Abstract Markov Decision Processes
    Gopalan, Nakul
    desJardins, Marie
    Littman, Michael L.
    MacGlashan, James
    Squire, Shawn
    Tellex, Stefanie
    Winder, John
    Wong, Lawson L. S.
    TWENTY-SEVENTH INTERNATIONAL CONFERENCE ON AUTOMATED PLANNING AND SCHEDULING, 2017, : 480 - 488
  • [43] Temporal concatenation for Markov decision processes
    Song, Ruiyang
    Xu, Kuang
    PROBABILITY IN THE ENGINEERING AND INFORMATIONAL SCIENCES, 2022, 36 (04) : 999 - 1026
  • [44] Reachability in recursive Markov decision processes
    Brazdil, Tomas
    Brozek, Vaclav
    Forejt, Vojtech
    Kucera, Antonin
    INFORMATION AND COMPUTATION, 2008, 206 (05) : 520 - 537
  • [45] Entropic Regularization of Markov Decision Processes
    Belousov, Boris
    Peters, Jan
    ENTROPY, 2019, 21 (07)
  • [46] ON THE GENERATION OF MARKOV DECISION-PROCESSES
    ARCHIBALD, TW
    MCKINNON, KIM
    THOMAS, LC
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1995, 46 (03) : 354 - 361
  • [47] Quantitative Programming and Markov Decision Processes
    Todoran, Eneia Nicolae
    2022 24TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING, SYNASC, 2022, : 117 - 124
  • [48] An analysis of transient Markov decision processes
    James, Huw W.
    Collins, E. J.
    JOURNAL OF APPLIED PROBABILITY, 2006, 43 (03) : 603 - 621
  • [49] Multitime scale Markov decision processes
    Chang, HS
    Fard, PJ
    Marcus, SI
    Shayman, M
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2003, 48 (06) : 976 - 987
  • [50] Networked Markov Decision Processes With Delays
    Adlakha, Sachin
    Lall, Sanjay
    Goldsmith, Andrea
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2012, 57 (04) : 1013 - 1018