Planning, learning and coordination in multiagent decision processes

被引：0

作者：

Boutilier, C ^{[1
]}

机构：

[1] UNIV BRITISH COLUMBIA,DEPT COMP SCI,VANCOUVER,BC V6T 1Z4,CANADA

来源：

THEORETICAL ASPECTS OF RATIONALITY AND KNOWLEDGE | 1996年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

There has been a growing interest in AI in the design of multiagent systems, especially in multiagent cooperative planning. In this paper, we investigate the extent to which methods from single-agent planning and learning can be applied in multiagent settings. We survey a number of different techniques from decision-theoretic planning and reinforcement learning and describe a number of interesting issues that arise with regard to coordinating the policies of individual agents. To this end, we describe multiagent Markov decision processes as a general model in which to frame this discussion. These are special n-person cooperative games in which agents share the same utility function. We discuss coordination mechanisms based on imposed conventions (or social laws) as well as learning methods for coordination. Our focus is on the decomposition of sequential decision processes so that coordination can be learned (or imposed) locally, at the level of individual states. We also discuss the use of structured problem representations and their role in the generalization of learned conventions and in approximation.

引用

页码：195 / 210

页数：16

共 50 条

[21] Flexible coordination of multiagent team behavior using HTN planning
Obst, Oliver
Boedecker, Joschka
ROBOCUP 2005: ROBOT SOCCER WORLD CUP IX, 2006, 4020 : 521 - 528
[22] Anytime algorithms for multiagent decision making using coordination graphs
Vlassis, N
Elhorst, R
Kok, JR
2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 953 - 957
[23] Context-specific multiagent coordination and planning with factored MDPs
Guestrin, C
Venkataraman, S
Koller, D
EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 253 - 259
[24] Multiagent Decision Making and Learning in Urban Environments
Kumar, Akshat
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 6398 - 6402
[25] Dealing with Groups of Actions in Multiagent Markov Decision Processes
Debras, Guillaume
Mouaddib, Abdel-Illah
Pierre, Laurent Jean
Le Gloannec, Simon
PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE, VOL 1: ECTA, 2016, : 49 - 58
[26] A layered approach to learning coordination knowledge in multiagent environments
Erus, Guray
Polat, Faruk
APPLIED INTELLIGENCE, 2007, 27 (03) : 249 - 267
[27] Coordination for Multienergy Microgrids Using Multiagent Reinforcement Learning
Qiu, Dawei
Chen, Tianyi
Strbac, Goran
Bu, Shengrong
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (04) : 5689 - 5700
[28] A Bayesian Approach for Learning and Planning in Partially Observable Markov Decision Processes
Ross, Stephane
Pineau, Joelle
Chaib-draa, Brahim
Kreitmann, Pierre
JOURNAL OF MACHINE LEARNING RESEARCH, 2011, 12 : 1729 - 1770
[29] Policy Reuse for Learning and Planning in Partially Observable Markov Decision Processes
Wu, Bo
Feng, Yanpeng
2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE), 2017, : 549 - 552
[30] The Impact of Agent Definitions and Interactions on Multiagent Learning for Coordination
Chung, Jen Jen
Miklic, Damjan
Sabattini, Lorenzo
Tumer, Kagan
Siegwart, Roland
AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1752 - 1760

← 1 2 3 4 5 →