Planning, learning and coordination in multiagent decision processes

被引:0
|
作者
Boutilier, C [1 ]
机构
[1] UNIV BRITISH COLUMBIA,DEPT COMP SCI,VANCOUVER,BC V6T 1Z4,CANADA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There has been a growing interest in AI in the design of multiagent systems, especially in multiagent cooperative planning. In this paper, we investigate the extent to which methods from single-agent planning and learning can be applied in multiagent settings. We survey a number of different techniques from decision-theoretic planning and reinforcement learning and describe a number of interesting issues that arise with regard to coordinating the policies of individual agents. To this end, we describe multiagent Markov decision processes as a general model in which to frame this discussion. These are special n-person cooperative games in which agents share the same utility function. We discuss coordination mechanisms based on imposed conventions (or social laws) as well as learning methods for coordination. Our focus is on the decomposition of sequential decision processes so that coordination can be learned (or imposed) locally, at the level of individual states. We also discuss the use of structured problem representations and their role in the generalization of learned conventions and in approximation.
引用
收藏
页码:195 / 210
页数:16
相关论文
共 50 条
  • [1] Multiagent, Multitarget Path Planning in Markov Decision Processes
    Nawaz, Farhad
    Ornik, Melkior
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (12) : 7560 - 7574
  • [2] Hierarchical Method for Cooperative Multiagent Reinforcement Learning in Markov Decision Processes
    Bolshakov, V. E.
    Alfimtsev, A. N.
    DOKLADY MATHEMATICS, 2023, 108 (SUPPL 2) : S382 - S392
  • [3] Hierarchical Method for Cooperative Multiagent Reinforcement Learning in Markov Decision Processes
    V. E. Bolshakov
    A. N. Alfimtsev
    Doklady Mathematics, 2023, 108 : S382 - S392
  • [4] Multiagent Learning of Coordination in Loosely Coupled Multiagent Systems
    Yu, Chao
    Zhang, Minjie
    Ren, Fenghui
    Tan, Guozhen
    IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (12) : 2853 - 2867
  • [5] Partition Learning for Multiagent Planning
    Wood, Jared
    Hedrick, J. Karl
    JOURNAL OF ROBOTICS, 2012, 2012
  • [6] A multiagent supply chain planning and coordination architecture
    Fung, RYK
    Chen, TS
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2005, 25 (7-8): : 811 - 819
  • [7] Randomized Coordination Search for Scalable Multiagent Planning
    Ure, N. Kemal
    How, Jonathan P.
    Vian, John
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15), 2015, : 1793 - 1794
  • [8] A multiagent supply chain planning and coordination architecture
    Richard Y.K. Fung
    Tsiushuang Chen
    The International Journal of Advanced Manufacturing Technology, 2005, 25 : 811 - 819
  • [9] Coordination in multiagent reinforcement learning systems
    Kamal, MAS
    Murata, J
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2004, 3213 : 1197 - 1204
  • [10] Learning quantitative knowledge for multiagent coordination
    Jensen, D
    Atighetchi, M
    Vincent, R
    Lesser, V
    SIXTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-99)/ELEVENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE (IAAI-99), 1999, : 24 - 31