Closely Cooperative Multi-Agent Reinforcement Learning Based on Intention Sharing and Credit Assignment

被引:0
|
作者
Fu, Hao [1 ,2 ]
You, Mingyu [1 ,2 ]
Zhou, Hongjun [1 ,2 ]
He, Bin [1 ,2 ]
机构
[1] Tongji Univ, Shanghai Res Inst Intelligent Autonomous Syst, Coll Elect & Informat Engn, Shanghai 200070, Peoples R China
[2] Frontiers Sci Ctr Intelligent Autonomous Syst, State Key Lab Intelligent Autonomous Syst, Shanghai Key Lab Intelligent Autonomous Syst, Shanghai 201203, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Collaboration; Encoding; Training; Multi-agent systems; Autonomous systems; Mutual information; Decision making; Trajectory; Synchronization; MARL; closely collaborative tasks; intention sharing; credit assignment;
D O I
10.1109/LRA.2024.3497661
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Collaborative tasks are important in multi-agent systems. Multi-agent reinforcement learning is a commonly used technique for solving multi-agent cooperative policy learning. The closely collaborative task is a special but common case within cooperative tasks, where the change in the environmental state requires multiple agents to simultaneously perform specific actions. For example, in a box-pushing task where the boxes are heavy and require multiple agents to push simultaneously. The closely cooperative task faces some unique challenges. Firstly, the completion of a closely collaborative task requires agents to synchronize their actions, necessitating a consistent intention among them. Secondly, when some agents' erroneous actions lead to task failure, it becomes a challenge to avoid incorrectly penalizing agents who performed the correct actions. These challenges make most of the existing MARL methods perform poorly on this task. In this letter, we propose a closely collaborative multi-agent reinforcement learning(CC-MARL) algorithm based on intention sharing and credit assignment. We use a two-phase training to learn intention encoding and intention sharing respectively, and decompose joint action values based on counterfactual baseline ideas. We deployed scenarios in both simulated and real environments with various sizes, numbers of boxes, and numbers of agents and compare CC-MARL with various classical MARL algorithms on box-pushing tasks of different map scales in simulation, demonstrating the state-of-the-art of our method.
引用
收藏
页码:11770 / 11777
页数:8
相关论文
共 50 条
  • [31] A Cooperative Multi-Agent Reinforcement Learning Method Based on Coordination Degree
    Cui, Haoyan
    Zhang, Zhen
    IEEE ACCESS, 2021, 9 : 123805 - 123814
  • [32] Rationality of reward sharing in multi-agent reinforcement learning
    Kazuteru Miyazaki
    Shigenobu Kobayashi
    New Generation Computing, 2001, 19 : 157 - 172
  • [33] Rationality of reward sharing in multi-agent reinforcement learning
    Miyazaki, K
    Kobayashi, S
    NEW GENERATION COMPUTING, 2001, 19 (02) : 157 - 172
  • [34] On the rationality of Profit Sharing in multi-agent reinforcement learning
    Miyazaki, K
    Kobayashi, S
    ICCIMA 2001: FOURTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, PROCEEDINGS, 2001, : 123 - 127
  • [35] On the rationality of profit sharing in multi-agent reinforcement learning
    Miyazaki, K
    Kobayashi, S
    ICCIMA 2001: FOURTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, PROCEEDINGS, 2001, : 421 - 425
  • [36] A Multi-agent Reinforcement Learning with Weighted Experience Sharing
    Yu, Lasheng
    Abdulai, Issahaku
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2012, 6839 : 219 - 225
  • [37] Learning Cooperative Intrinsic Motivation in Multi-Agent Reinforcement Learning
    Hong, Seung-Jin
    Lee, Sang-Kwang
    12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 1697 - 1699
  • [38] Cooperative Learning of Multi-Agent Systems Via Reinforcement Learning
    Wang, Xin
    Zhao, Chen
    Huang, Tingwen
    Chakrabarti, Prasun
    Kurths, Juergen
    IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2023, 9 : 13 - 23
  • [39] Cooperative Multi-Agent Reinforcement Learning With Approximate Model Learning
    Park, Young Joon
    Lee, Young Jae
    Kim, Seoung Bum
    IEEE ACCESS, 2020, 8 : 125389 - 125400
  • [40] Cooperative Multi-Agent Reinforcement Learning with Hypergraph Convolution
    Bai, Yunpeng
    Gong, Chen
    Zhang, Bin
    Fan, Guoliang
    Hou, Xinwen
    Lu, Yu
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,