Planning, learning and coordination in multiagent decision processes

被引:0
|
作者
Boutilier, C [1 ]
机构
[1] UNIV BRITISH COLUMBIA,DEPT COMP SCI,VANCOUVER,BC V6T 1Z4,CANADA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There has been a growing interest in AI in the design of multiagent systems, especially in multiagent cooperative planning. In this paper, we investigate the extent to which methods from single-agent planning and learning can be applied in multiagent settings. We survey a number of different techniques from decision-theoretic planning and reinforcement learning and describe a number of interesting issues that arise with regard to coordinating the policies of individual agents. To this end, we describe multiagent Markov decision processes as a general model in which to frame this discussion. These are special n-person cooperative games in which agents share the same utility function. We discuss coordination mechanisms based on imposed conventions (or social laws) as well as learning methods for coordination. Our focus is on the decomposition of sequential decision processes so that coordination can be learned (or imposed) locally, at the level of individual states. We also discuss the use of structured problem representations and their role in the generalization of learned conventions and in approximation.
引用
收藏
页码:195 / 210
页数:16
相关论文
共 50 条
  • [21] Flexible coordination of multiagent team behavior using HTN planning
    Obst, Oliver
    Boedecker, Joschka
    ROBOCUP 2005: ROBOT SOCCER WORLD CUP IX, 2006, 4020 : 521 - 528
  • [22] Anytime algorithms for multiagent decision making using coordination graphs
    Vlassis, N
    Elhorst, R
    Kok, JR
    2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 953 - 957
  • [23] Context-specific multiagent coordination and planning with factored MDPs
    Guestrin, C
    Venkataraman, S
    Koller, D
    EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 253 - 259
  • [24] Multiagent Decision Making and Learning in Urban Environments
    Kumar, Akshat
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 6398 - 6402
  • [25] Dealing with Groups of Actions in Multiagent Markov Decision Processes
    Debras, Guillaume
    Mouaddib, Abdel-Illah
    Pierre, Laurent Jean
    Le Gloannec, Simon
    PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE, VOL 1: ECTA, 2016, : 49 - 58
  • [26] A layered approach to learning coordination knowledge in multiagent environments
    Erus, Guray
    Polat, Faruk
    APPLIED INTELLIGENCE, 2007, 27 (03) : 249 - 267
  • [27] Coordination for Multienergy Microgrids Using Multiagent Reinforcement Learning
    Qiu, Dawei
    Chen, Tianyi
    Strbac, Goran
    Bu, Shengrong
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (04) : 5689 - 5700
  • [28] A Bayesian Approach for Learning and Planning in Partially Observable Markov Decision Processes
    Ross, Stephane
    Pineau, Joelle
    Chaib-draa, Brahim
    Kreitmann, Pierre
    JOURNAL OF MACHINE LEARNING RESEARCH, 2011, 12 : 1729 - 1770
  • [29] Policy Reuse for Learning and Planning in Partially Observable Markov Decision Processes
    Wu, Bo
    Feng, Yanpeng
    2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE), 2017, : 549 - 552
  • [30] The Impact of Agent Definitions and Interactions on Multiagent Learning for Coordination
    Chung, Jen Jen
    Miklic, Damjan
    Sabattini, Lorenzo
    Tumer, Kagan
    Siegwart, Roland
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1752 - 1760