Algebraic Markov Decision Processes

被引：0

作者：

Perny, Patrice ^{[1
]}

Spanjaard, Olivier ^{[1
]}

Weng, Paul ^{[1
]}

机构：

[1] Univ Paris 06, LIP6, F-75252 Paris 05, France

来源：

19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05) | 2005年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we provide an algebraic approach to Markov Decision Processes (MDPs), which allows a unified treatment of MDPs and includes many existing models (quantitative or qualitative) as particular cases. In algebraic MDPs, rewards are expressed in a semiring structure, uncertainty is represented by a decomposable plausibility measure valued on a second semiring structure, and preferences over policies are represented by Generalized Expected Utility. We recast the problem of finding an optimal policy at a finite horizon as an algebraic path problem in a decision rule graph where arcs are valued by functions, which justifies the use of the Jacobi algorithm to solve algebraic Bellman equations. In order to show the potential of this general approach, we exhibit new variations of MDPs, admitting complete or partial preference structures, as well as probabilistic or possibilistic representation of uncertainty.

引用

页码：1372 / 1377

页数：6

共 50 条

[21] Robust Markov Decision Processes
Wiesemann, Wolfram
Kuhn, Daniel
Rustem, Berc
MATHEMATICS OF OPERATIONS RESEARCH, 2013, 38 (01) : 153 - 183
[22] Ordinal Decision Models for Markov Decision Processes
Weng, Paul
20TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2012), 2012, 242 : 828 - 833
[23] Markov Decision Processes with Arbitrary Reward Processes
Yu, Jia Yuan
Mannor, Shie
Shimkin, Nahum
MATHEMATICS OF OPERATIONS RESEARCH, 2009, 34 (03) : 737 - 757
[24] Markov Decision Processes with Arbitrary Reward Processes
Yu, Jia Yuan
Mannor, Shie
Shimkin, Nahum
RECENT ADVANCES IN REINFORCEMENT LEARNING, 2008, 5323 : 268 - +
[25] Markov Chains and Markov Decision Processes in Isabelle/HOL
Hoelzl, Johannes
JOURNAL OF AUTOMATED REASONING, 2017, 59 (03) : 345 - 387
[26] Markov Chains and Markov Decision Processes in Isabelle/HOL
Johannes Hölzl
Journal of Automated Reasoning, 2017, 59 : 345 - 387
[27] APPROXIMATING THE MARKOV PROPERTY IN MARKOV DECISION-PROCESSES
WHITE, DJ
INFORMATION AND DECISION TECHNOLOGIES, 1989, 15 (03): : 147 - 162
[28] Mean Field Markov Decision Processes
Baeuerle, Nicole
APPLIED MATHEMATICS AND OPTIMIZATION, 2023, 88 (01):
[29] Solving concurrent Markov decision processes
Weld, M
Weld, DS
PROCEEDING OF THE NINETEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE SIXTEENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2004, : 716 - 722
[30] Approximate equivalence of Markov decision processes
Even-Dar, E
Mansour, Y
LEARNING THEORY AND KERNEL MACHINES, 2003, 2777 : 581 - 594

← 1 2 3 4 5 →