Closely Cooperative Multi-Agent Reinforcement Learning Based on Intention Sharing and Credit Assignment

被引:0
|
作者
Fu, Hao [1 ,2 ]
You, Mingyu [1 ,2 ]
Zhou, Hongjun [1 ,2 ]
He, Bin [1 ,2 ]
机构
[1] Tongji Univ, Shanghai Res Inst Intelligent Autonomous Syst, Coll Elect & Informat Engn, Shanghai 200070, Peoples R China
[2] Frontiers Sci Ctr Intelligent Autonomous Syst, State Key Lab Intelligent Autonomous Syst, Shanghai Key Lab Intelligent Autonomous Syst, Shanghai 201203, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Collaboration; Encoding; Training; Multi-agent systems; Autonomous systems; Mutual information; Decision making; Trajectory; Synchronization; MARL; closely collaborative tasks; intention sharing; credit assignment;
D O I
10.1109/LRA.2024.3497661
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Collaborative tasks are important in multi-agent systems. Multi-agent reinforcement learning is a commonly used technique for solving multi-agent cooperative policy learning. The closely collaborative task is a special but common case within cooperative tasks, where the change in the environmental state requires multiple agents to simultaneously perform specific actions. For example, in a box-pushing task where the boxes are heavy and require multiple agents to push simultaneously. The closely cooperative task faces some unique challenges. Firstly, the completion of a closely collaborative task requires agents to synchronize their actions, necessitating a consistent intention among them. Secondly, when some agents' erroneous actions lead to task failure, it becomes a challenge to avoid incorrectly penalizing agents who performed the correct actions. These challenges make most of the existing MARL methods perform poorly on this task. In this letter, we propose a closely collaborative multi-agent reinforcement learning(CC-MARL) algorithm based on intention sharing and credit assignment. We use a two-phase training to learn intention encoding and intention sharing respectively, and decompose joint action values based on counterfactual baseline ideas. We deployed scenarios in both simulated and real environments with various sizes, numbers of boxes, and numbers of agents and compare CC-MARL with various classical MARL algorithms on box-pushing tasks of different map scales in simulation, demonstrating the state-of-the-art of our method.
引用
收藏
页码:11770 / 11777
页数:8
相关论文
共 50 条
  • [11] LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning
    Yang, Mingyu
    Zhao, Jian
    Hu, Xunhan
    Zhou, Wengang
    Zhu, Jiangcheng
    Li, Houqiang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [12] Information Sharing for Cooperative Robots via Multi-Agent Reinforcement Learning
    Siddiqua, Ayesha
    Liu, Siming
    Iqbal, Razib
    Ross, Logan
    Zweerink, Brian
    Eskridge, Ryan
    2024 IEEE INTERNATIONAL SYMPOSIUM ON ROBOTIC AND SENSORS ENVIRONMENTS, ROSE 2024, 2024,
  • [13] Multi-agent Cooperative Search based on Reinforcement Learning
    Sun, Yinjiang
    Zhang, Rui
    Liang, Wenbao
    Xu, Cheng
    PROCEEDINGS OF 2020 3RD INTERNATIONAL CONFERENCE ON UNMANNED SYSTEMS (ICUS), 2020, : 891 - 896
  • [14] Cooperative multi-agent game based on reinforcement learning
    Liu, Hongbo
    HIGH-CONFIDENCE COMPUTING, 2024, 4 (01):
  • [15] Multi-agent cooperative learning research based on reinforcement learning
    Liu, Fei
    Zeng, Guangzhou
    2006 10TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, PROCEEDINGS, VOLS 1 AND 2, 2006, : 1408 - 1413
  • [16] Cooperative Action Acquisition Based on Intention Estimation Method in a Multi-agent Reinforcement Learning System
    Tsubakimoto, Tatsuya
    Kobayashi, Kunikazu
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS (ICAROB 2014), 2014, : 122 - 125
  • [17] Cautiously-Optimistic Knowledge Sharing for Cooperative Multi-Agent Reinforcement Learning
    Ba, Yanwen
    Liu, Xuan
    Chen, Xinning
    Wang, Hao
    Xu, Yang
    Li, Kenli
    Zhang, Shigeng
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17299 - 17307
  • [18] Multi-Agent Evolutionary Reinforcement Learning Based on Cooperative Games
    Yu, Jin
    Zhang, Ya
    Sun, Changyin
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
  • [19] On the Robustness of Cooperative Multi-Agent Reinforcement Learning
    Lin, Jieyu
    Dzeparoska, Kristina
    Zhang, Sai Qian
    Leon-Garcia, Alberto
    Papernot, Nicolas
    2020 IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (SPW 2020), 2020, : 62 - 68
  • [20] Consensus Learning for Cooperative Multi-Agent Reinforcement Learning
    Xu, Zhiwei
    Zhang, Bin
    Li, Dapeng
    Zhang, Zeren
    Zhou, Guangchong
    Chen, Hao
    Fan, Guoliang
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 11726 - 11734