Algebraic Markov Decision Processes

被引：0

作者：

Perny, Patrice ^{[1
]}

Spanjaard, Olivier ^{[1
]}

Weng, Paul ^{[1
]}

机构：

[1] Univ Paris 06, LIP6, F-75252 Paris 05, France

来源：

19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05) | 2005年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we provide an algebraic approach to Markov Decision Processes (MDPs), which allows a unified treatment of MDPs and includes many existing models (quantitative or qualitative) as particular cases. In algebraic MDPs, rewards are expressed in a semiring structure, uncertainty is represented by a decomposable plausibility measure valued on a second semiring structure, and preferences over policies are represented by Generalized Expected Utility. We recast the problem of finding an optimal policy at a finite horizon as an algebraic path problem in a decision rule graph where arcs are valued by functions, which justifies the use of the Jacobi algorithm to solve algebraic Bellman equations. In order to show the potential of this general approach, we exhibit new variations of MDPs, admitting complete or partial preference structures, as well as probabilistic or possibilistic representation of uncertainty.

引用

页码：1372 / 1377

页数：6

共 50 条

[41] On Markov policies for minimax decision processes
Iwamoto, S
Tsurusaki, K
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2001, 253 (01) : 58 - 78
[42] Planning with Abstract Markov Decision Processes
Gopalan, Nakul
desJardins, Marie
Littman, Michael L.
MacGlashan, James
Squire, Shawn
Tellex, Stefanie
Winder, John
Wong, Lawson L. S.
TWENTY-SEVENTH INTERNATIONAL CONFERENCE ON AUTOMATED PLANNING AND SCHEDULING, 2017, : 480 - 488
[43] Temporal concatenation for Markov decision processes
Song, Ruiyang
Xu, Kuang
PROBABILITY IN THE ENGINEERING AND INFORMATIONAL SCIENCES, 2022, 36 (04) : 999 - 1026
[44] Reachability in recursive Markov decision processes
Brazdil, Tomas
Brozek, Vaclav
Forejt, Vojtech
Kucera, Antonin
INFORMATION AND COMPUTATION, 2008, 206 (05) : 520 - 537
[45] Entropic Regularization of Markov Decision Processes
Belousov, Boris
Peters, Jan
ENTROPY, 2019, 21 (07)
[46] ON THE GENERATION OF MARKOV DECISION-PROCESSES
ARCHIBALD, TW
MCKINNON, KIM
THOMAS, LC
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1995, 46 (03) : 354 - 361
[47] Quantitative Programming and Markov Decision Processes
Todoran, Eneia Nicolae
2022 24TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING, SYNASC, 2022, : 117 - 124
[48] An analysis of transient Markov decision processes
James, Huw W.
Collins, E. J.
JOURNAL OF APPLIED PROBABILITY, 2006, 43 (03) : 603 - 621
[49] Multitime scale Markov decision processes
Chang, HS
Fard, PJ
Marcus, SI
Shayman, M
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2003, 48 (06) : 976 - 987
[50] Networked Markov Decision Processes With Delays
Adlakha, Sachin
Lall, Sanjay
Goldsmith, Andrea
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2012, 57 (04) : 1013 - 1018

← 1 2 3 4 5 →