Algebraic Markov Decision Processes

被引:0
|
作者
Perny, Patrice [1 ]
Spanjaard, Olivier [1 ]
Weng, Paul [1 ]
机构
[1] Univ Paris 06, LIP6, F-75252 Paris 05, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we provide an algebraic approach to Markov Decision Processes (MDPs), which allows a unified treatment of MDPs and includes many existing models (quantitative or qualitative) as particular cases. In algebraic MDPs, rewards are expressed in a semiring structure, uncertainty is represented by a decomposable plausibility measure valued on a second semiring structure, and preferences over policies are represented by Generalized Expected Utility. We recast the problem of finding an optimal policy at a finite horizon as an algebraic path problem in a decision rule graph where arcs are valued by functions, which justifies the use of the Jacobi algorithm to solve algebraic Bellman equations. In order to show the potential of this general approach, we exhibit new variations of MDPs, admitting complete or partial preference structures, as well as probabilistic or possibilistic representation of uncertainty.
引用
收藏
页码:1372 / 1377
页数:6
相关论文
共 50 条
  • [31] Mutually Dependent Markov Decision Processes
    Fujita, Toshiharu
    Kira, Akifumi
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2014, 18 (06) : 992 - 998
  • [32] Reachability in recursive Markov decision processes
    Brazdil, Tomas
    Brozek, Vaclav
    Forejt, Vojtech
    Kucera, Antonin
    CONCUR 2006 - CONCURRENCY THEORY, PROCEEDINGS, 2006, 4137 : 358 - 374
  • [33] An axiomatic approach to Markov decision processes
    Adam Jonsson
    Mathematical Methods of Operations Research, 2023, 97 : 117 - 133
  • [34] Learning to Collaborate in Markov Decision Processes
    Radanovic, Goran
    Devidze, Rati
    Parkes, David C.
    Singla, Adish
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [35] APPLICATIONS OF MARKOV DECISION-PROCESSES
    WIJNMALEN, DJD
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1994, 45 (05) : 607 - 608
  • [36] Probabilistic opacity for Markov decision processes
    Berard, Beatrice
    Chatterjee, Krishnendu
    Sznajder, Nathalie
    INFORMATION PROCESSING LETTERS, 2015, 115 (01) : 52 - 59
  • [37] Markov Decision Processes with Applications to Finance
    McAuliffe, Jon
    QUANTITATIVE FINANCE, 2012, 12 (01) : 15 - 16
  • [38] Preference Planning for Markov Decision Processes
    Li, Meilun
    She, Zhikun
    Turrini, Andrea
    Zhang, Lijun
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 3313 - 3319
  • [39] Probabilistic Hyperproperties of Markov Decision Processes
    Dimitrova, Rayna
    Finkbeiner, Bernd
    Torfah, Hazem
    AUTOMATED TECHNOLOGY FOR VERIFICATION AND ANALYSIS (ATVA 2020), 2020, 12302 : 484 - 500
  • [40] Active Exploration in Markov Decision Processes
    Tarbouriech, Jean
    Lazaric, Alessandro
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89