Efficient Multi-agent Communication via Self-supervised Information Aggregation

被引:0
|
作者
Guan, Cong [1 ]
Chen, Feng [1 ]
Yuan, Lei [1 ,2 ]
Wang, Chenghe [1 ]
Yin, Hao [1 ]
Zhang, Zongzhang [1 ]
Yu, Yang [1 ,2 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] Polixir Technol, Nanjing, Peoples R China
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Utilizing messages from teammates can improve coordination in cooperative Multi-agent Reinforcement Learning (MARL). To obtain meaningful information for decision-making, previous works typically combine raw messages generated by teammates with local information as inputs for policy. However, neglecting the aggregation of multiple messages poses great inefficiency for policy learning. Motivated by recent advances in representation learning, we argue that efficient message aggregation is essential for good coordination in MARL. In this paper, we propose Multi-Agent communication via Self-supervised Information Aggregation (MASIA), with which agents can aggregate the received messages into compact representations with high relevance to augment the local policy. Specifically, we design a permutation invariant message encoder to generate common information aggregated representation from raw messages and optimize it via reconstructing and shooting future information in a self-supervised manner. Each agent would utilize the most relevant parts of the aggregated representation for decision-making by a novel message extraction mechanism. Empirical results demonstrate that our method significantly outperforms strong baselines on multiple cooperative MARL tasks for various task settings.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Efficient Communication via Self-Supervised Information Aggregation for Online and Offline Multiagent Reinforcement Learning
    Guan, Cong
    Chen, Feng
    Yuan, Lei
    Zhang, Zongzhang
    Yu, Yang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [2] Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement Learning
    Chen, Yinda
    Huang, Wei
    Zhou, Shenglong
    Chen, Qi
    Xiong, Zhiwei
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 609 - 617
  • [3] COMPETITIVE MULTI-AGENT REINFORCEMENT LEARNING WITH SELF-SUPERVISED REPRESENTATION
    Su, DiJia
    Lee, Jason D.
    Mulvey, John M.
    Poor, H. Vincent
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4098 - 4102
  • [4] Learning Efficient and Robust Multi-Agent Communication via Graph Information Bottleneck
    Ding, Shifei
    Du, Wei
    Ding, Ling
    Guo, Lili
    Zhang, Jian
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17346 - 17353
  • [5] Handling Realistic Noise in Multi-Agent Systems with Self-Supervised Learning and Curiosity
    Szemenyei, Marton
    Reizinger, Patrik
    JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2022, 12 (02) : 135 - 148
  • [6] Efficient Multi-Agent Communication via Shapley Message Value
    Xue, Di
    Yuan, Lei
    Zhang, Zongzhang
    Yu, Yang
    PROCEEDINGS OF THE THIRTY-FIRST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2022, 2022, : 578 - 584
  • [7] Efficient DDPG via the Self-Supervised Method
    Zhang, Guanghao
    Chen, Hongliang
    Li, Jianxun
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 4636 - 4642
  • [8] Efficient agent communication in multi-agent systems
    Jang, MW
    Ahmed, A
    Agha, G
    SOFTWARE ENGINEERING FOR MULTI-AGENT SYSTEMS III: RESEARCH ISSUES AND PRACTICAL APPLICATIONS, 2004, 3390 : 236 - 253
  • [9] Efficient Collaboration via Interaction Information in Multi-agent System
    Shi, Meilong
    Liu, Quan
    Huang, Zhigang
    NEURAL INFORMATION PROCESSING, ICONIP 2023, PT II, 2024, 14448 : 401 - 412
  • [10] Multi-Agent Driving Behavior Prediction across Different Scenarios with Self-Supervised Domain Knowledge
    Ma, Hengbo
    Sun, Yaofeng
    Li, Jiachen
    Tomizuka, Masayoshi
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 3122 - 3129