Algebraic Markov Decision Processes

被引：0

作者：

Perny, Patrice ^{[1
]}

Spanjaard, Olivier ^{[1
]}

Weng, Paul ^{[1
]}

机构：

[1] Univ Paris 06, LIP6, F-75252 Paris 05, France

来源：

19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05) | 2005年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we provide an algebraic approach to Markov Decision Processes (MDPs), which allows a unified treatment of MDPs and includes many existing models (quantitative or qualitative) as particular cases. In algebraic MDPs, rewards are expressed in a semiring structure, uncertainty is represented by a decomposable plausibility measure valued on a second semiring structure, and preferences over policies are represented by Generalized Expected Utility. We recast the problem of finding an optimal policy at a finite horizon as an algebraic path problem in a decision rule graph where arcs are valued by functions, which justifies the use of the Jacobi algorithm to solve algebraic Bellman equations. In order to show the potential of this general approach, we exhibit new variations of MDPs, admitting complete or partial preference structures, as well as probabilistic or possibilistic representation of uncertainty.

引用

页码：1372 / 1377

页数：6

共 50 条

[31] Mutually Dependent Markov Decision Processes
Fujita, Toshiharu
Kira, Akifumi
JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2014, 18 (06) : 992 - 998
[32] Reachability in recursive Markov decision processes
Brazdil, Tomas
Brozek, Vaclav
Forejt, Vojtech
Kucera, Antonin
CONCUR 2006 - CONCURRENCY THEORY, PROCEEDINGS, 2006, 4137 : 358 - 374
[33] An axiomatic approach to Markov decision processes
Adam Jonsson
Mathematical Methods of Operations Research, 2023, 97 : 117 - 133
[34] Learning to Collaborate in Markov Decision Processes
Radanovic, Goran
Devidze, Rati
Parkes, David C.
Singla, Adish
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[35] APPLICATIONS OF MARKOV DECISION-PROCESSES
WIJNMALEN, DJD
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1994, 45 (05) : 607 - 608
[36] Probabilistic opacity for Markov decision processes
Berard, Beatrice
Chatterjee, Krishnendu
Sznajder, Nathalie
INFORMATION PROCESSING LETTERS, 2015, 115 (01) : 52 - 59
[37] Markov Decision Processes with Applications to Finance
McAuliffe, Jon
QUANTITATIVE FINANCE, 2012, 12 (01) : 15 - 16
[38] Preference Planning for Markov Decision Processes
Li, Meilun
She, Zhikun
Turrini, Andrea
Zhang, Lijun
PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 3313 - 3319
[39] Probabilistic Hyperproperties of Markov Decision Processes
Dimitrova, Rayna
Finkbeiner, Bernd
Torfah, Hazem
AUTOMATED TECHNOLOGY FOR VERIFICATION AND ANALYSIS (ATVA 2020), 2020, 12302 : 484 - 500
[40] Active Exploration in Markov Decision Processes
Tarbouriech, Jean
Lazaric, Alessandro
22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89

← 1 2 3 4 5 →