Efficient Multi-agent Communication via Self-supervised Information Aggregation

被引:0
|
作者
Guan, Cong [1 ]
Chen, Feng [1 ]
Yuan, Lei [1 ,2 ]
Wang, Chenghe [1 ]
Yin, Hao [1 ]
Zhang, Zongzhang [1 ]
Yu, Yang [1 ,2 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] Polixir Technol, Nanjing, Peoples R China
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Utilizing messages from teammates can improve coordination in cooperative Multi-agent Reinforcement Learning (MARL). To obtain meaningful information for decision-making, previous works typically combine raw messages generated by teammates with local information as inputs for policy. However, neglecting the aggregation of multiple messages poses great inefficiency for policy learning. Motivated by recent advances in representation learning, we argue that efficient message aggregation is essential for good coordination in MARL. In this paper, we propose Multi-Agent communication via Self-supervised Information Aggregation (MASIA), with which agents can aggregate the received messages into compact representations with high relevance to augment the local policy. Specifically, we design a permutation invariant message encoder to generate common information aggregated representation from raw messages and optimize it via reconstructing and shooting future information in a self-supervised manner. Each agent would utilize the most relevant parts of the aggregated representation for decision-making by a novel message extraction mechanism. Empirical results demonstrate that our method significantly outperforms strong baselines on multiple cooperative MARL tasks for various task settings.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Self-Supervised Exploration via Disagreement
    Pathak, Deepak
    Gandhi, Dhiraj
    Gupta, Abhinav
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [32] Robust Multi-Agent Communication With Graph Information Bottleneck Optimization
    Ding, Shifei
    Du, Wei
    Ding, Ling
    Zhang, Jian
    Guo, Lili
    An, Bo
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (05) : 3096 - 3107
  • [33] Time Preference for Information in Multi-agent Exploration with Limited Communication
    Spirin, Victor
    Cameron, Stephen
    de Hoog, Julian
    TOWARDS AUTONOMOUS ROBOTIC SYSTEMS, 2014, 8069 : 34 - 45
  • [34] Communication-Efficient and Federated Multi-Agent Reinforcement Learning
    Krouka, Mounssif
    Elgabli, Anis
    Ben Issaid, Chaouki
    Bennis, Mehdi
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2022, 8 (01) : 311 - 320
  • [35] High-speed underwater acoustic communication for multi-agent supervised autonomy
    Jarrot, Arnaud
    Gelman, Andriy
    Choi, Gloria
    Speck, Andrew
    Strunk, Gavin
    Croux, Arnaud
    Osedach, Timothy P.
    Vannuffelen, Stephane
    Ossia, Sepand
    Vincent, Jack
    Grall, Sebastien
    Eudeline, Guillaume
    2021 FIFTH UNDERWATER COMMUNICATIONS AND NETWORKING CONFERENCE (UCOMMS), 2021,
  • [36] Limited Information Aggregation for Collaborative Driving in Multi-Agent Autonomous Vehicles
    Liang, Qingyi
    Liu, Jia
    Jiang, Zhengmin
    Yin, Jianwen
    Xu, Kun
    Li, Huiyun
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (07): : 6624 - 6631
  • [37] Position-Aware Communication via Self-Attention for Multi-Agent Reinforcement Learning
    Shih, Tsan-Hua
    Lin, Hsien-, I
    2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN), 2020,
  • [38] Self-Supervised Neural Aggregation Networks for Human Parsing
    Zhao, Jian
    Li, Jianshu
    Nie, Xuecheng
    Zhao, Fang
    Chen, Yunpeng
    Wang, Zhecan
    Feng, Jiashi
    Yan, Shuicheng
    2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 1595 - 1603
  • [39] Social-SSL: Self-supervised Cross-Sequence Representation Learning Based on Transformers for Multi-agent Trajectory Prediction
    Tsao, Li-Wu
    Wang, Yan-Kai
    Lin, Hao-Siang
    Shuai, Hong-Han
    Wong, Lai-Kuan
    Cheng, Wen-Huang
    COMPUTER VISION, ECCV 2022, PT XXII, 2022, 13682 : 234 - 250
  • [40] Multi-Agent Consensus with Noisy Communication via Time Averaging
    Morita, Ryosuke
    Wada, Takayuki
    Masubuchi, Izumi
    Asai, Toru
    Fujisaki, Yasumasa
    2014 EUROPEAN CONTROL CONFERENCE (ECC), 2014, : 1530 - 1535