Closely Cooperative Multi-Agent Reinforcement Learning Based on Intention Sharing and Credit Assignment

被引:0
|
作者
Fu, Hao [1 ,2 ]
You, Mingyu [1 ,2 ]
Zhou, Hongjun [1 ,2 ]
He, Bin [1 ,2 ]
机构
[1] Tongji Univ, Shanghai Res Inst Intelligent Autonomous Syst, Coll Elect & Informat Engn, Shanghai 200070, Peoples R China
[2] Frontiers Sci Ctr Intelligent Autonomous Syst, State Key Lab Intelligent Autonomous Syst, Shanghai Key Lab Intelligent Autonomous Syst, Shanghai 201203, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Collaboration; Encoding; Training; Multi-agent systems; Autonomous systems; Mutual information; Decision making; Trajectory; Synchronization; MARL; closely collaborative tasks; intention sharing; credit assignment;
D O I
10.1109/LRA.2024.3497661
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Collaborative tasks are important in multi-agent systems. Multi-agent reinforcement learning is a commonly used technique for solving multi-agent cooperative policy learning. The closely collaborative task is a special but common case within cooperative tasks, where the change in the environmental state requires multiple agents to simultaneously perform specific actions. For example, in a box-pushing task where the boxes are heavy and require multiple agents to push simultaneously. The closely cooperative task faces some unique challenges. Firstly, the completion of a closely collaborative task requires agents to synchronize their actions, necessitating a consistent intention among them. Secondly, when some agents' erroneous actions lead to task failure, it becomes a challenge to avoid incorrectly penalizing agents who performed the correct actions. These challenges make most of the existing MARL methods perform poorly on this task. In this letter, we propose a closely collaborative multi-agent reinforcement learning(CC-MARL) algorithm based on intention sharing and credit assignment. We use a two-phase training to learn intention encoding and intention sharing respectively, and decompose joint action values based on counterfactual baseline ideas. We deployed scenarios in both simulated and real environments with various sizes, numbers of boxes, and numbers of agents and compare CC-MARL with various classical MARL algorithms on box-pushing tasks of different map scales in simulation, demonstrating the state-of-the-art of our method.
引用
收藏
页码:11770 / 11777
页数:8
相关论文
共 50 条
  • [1] Learning Implicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning
    Zhou, Meng
    Liu, Ziyu
    Sui, Pengwei
    Li, Yixuan
    Chung, Yuk Ying
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [2] Multi-Level Credit Assignment for Cooperative Multi-Agent Reinforcement Learning
    Feng, Lei
    Xie, Yuxuan
    Liu, Bing
    Wang, Shuyan
    APPLIED SCIENCES-BASEL, 2022, 12 (14):
  • [3] Credit assignment in heterogeneous multi-agent reinforcement learning for fully cooperative tasks
    Kun Jiang
    Wenzhang Liu
    Yuanda Wang
    Lu Dong
    Changyin Sun
    Applied Intelligence, 2023, 53 : 29205 - 29222
  • [4] Credit assignment in heterogeneous multi-agent reinforcement learning for fully cooperative tasks
    Jiang, Kun
    Liu, Wenzhang
    Wang, Yuanda
    Dong, Lu
    Sun, Changyin
    APPLIED INTELLIGENCE, 2023, 53 (23) : 29205 - 29222
  • [5] Cooperative targets assignment based on multi-agent reinforcement learning
    Ma Y.
    Wu L.
    Xu X.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2023, 45 (09): : 2793 - 2801
  • [6] Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning
    Chen, Hao
    Yang, Guangkai
    Zhang, Junge
    Yin, Qiyue
    Huang, Kaiqi
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [7] Learning Explicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning via Polarization Policy Gradient
    Chen, Wubing
    Li, Wenbin
    Liu, Xiao
    Yang, Shangdong
    Gao, Yang
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 11542 - 11550
  • [8] Credit assignment with predictive contribution measurement in multi-agent reinforcement learning
    Chen, Renlong
    Tan, Ying
    NEURAL NETWORKS, 2023, 164 : 681 - 690
  • [9] Reward-Filtering-Based Credit Assignment for Multi-Agent Deep Reinforcement Learning
    Xu C.
    Yin N.
    Duan S.-H.
    He H.
    Wang R.
    Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (11): : 2306 - 2320
  • [10] Cooperative Action Acquisition Based on Intention Estimation in a Multi-Agent Reinforcement Learning System
    Tsubakimoto, Tatsuya
    Kobayashi, Kunikazu
    ELECTRONICS AND COMMUNICATIONS IN JAPAN, 2017, 100 (06) : 3 - 10