Efficient Multi-agent Communication via Self-supervised Information Aggregation

被引:0
|
作者
Guan, Cong [1 ]
Chen, Feng [1 ]
Yuan, Lei [1 ,2 ]
Wang, Chenghe [1 ]
Yin, Hao [1 ]
Zhang, Zongzhang [1 ]
Yu, Yang [1 ,2 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] Polixir Technol, Nanjing, Peoples R China
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Utilizing messages from teammates can improve coordination in cooperative Multi-agent Reinforcement Learning (MARL). To obtain meaningful information for decision-making, previous works typically combine raw messages generated by teammates with local information as inputs for policy. However, neglecting the aggregation of multiple messages poses great inefficiency for policy learning. Motivated by recent advances in representation learning, we argue that efficient message aggregation is essential for good coordination in MARL. In this paper, we propose Multi-Agent communication via Self-supervised Information Aggregation (MASIA), with which agents can aggregate the received messages into compact representations with high relevance to augment the local policy. Specifically, we design a permutation invariant message encoder to generate common information aggregated representation from raw messages and optimize it via reconstructing and shooting future information in a self-supervised manner. Each agent would utilize the most relevant parts of the aggregated representation for decision-making by a novel message extraction mechanism. Empirical results demonstrate that our method significantly outperforms strong baselines on multiple cooperative MARL tasks for various task settings.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Evolving cooperation via communication in homogeneous multi-agent systems
    Baray, C
    INTELLIGENT INFORMATION SYSTEMS, (IIS'97) PROCEEDINGS, 1997, : 204 - 208
  • [42] Multi-Agent Incentive Communication via Decentralized Teammate Modeling
    Yuan, Lei
    Wang, Jianhao
    Zhang, Fuxiang
    Wang, Chenghe
    Zhang, Zongzhang
    Yu, Yang
    Zhang, Chongjie
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 9466 - 9474
  • [43] Coordination of multi-agent systems via asynchronous cloud communication
    Bowman, Sean L.
    Nowzari, Cameron
    Pappas, George J.
    2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 2215 - 2220
  • [44] Masked self-supervised ECG representation learning via multiview information bottleneck
    Yang, Shunxiang
    Lian, Cheng
    Zeng, Zhigang
    Xu, Bingrong
    Su, Yixin
    Xue, Chenyang
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (14): : 7625 - 7637
  • [45] Self-supervised learning for heterogeneous graph via structure information based on metapath
    Ma, Shuai
    Liu, Jian-wei
    Zuo, Xin
    APPLIED SOFT COMPUTING, 2023, 143
  • [46] Masked self-supervised ECG representation learning via multiview information bottleneck
    Shunxiang Yang
    Cheng Lian
    Zhigang Zeng
    Bingrong Xu
    Yixin Su
    Chenyang Xue
    Neural Computing and Applications, 2024, 36 : 7625 - 7637
  • [47] Self-Supervised Contrastive Graph Clustering Network via Structural Information Fusion
    Ji, Xiaoyang
    Zhou, Yuchen
    Yang, Haofu
    Xu, Shiyue
    Li, Jiahao
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 254 - 259
  • [48] Sample Efficient Detection and Classification of Adversarial Attacks via Self-Supervised Embeddings
    Moayeri, Mazda
    Feizi, Soheil
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7657 - 7666
  • [49] Self-Supervised Information Bottleneck for Deep Multi-View Subspace Clustering
    Wang, Shiye
    Li, Changsheng
    Li, Yanming
    Yuan, Ye
    Wang, Guoren
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 (1555-1567) : 1555 - 1567
  • [50] Efficient Anomaly Detection Using Self-Supervised Multi-Cue Tasks
    Jezequel, Loic
    Vu, Ngoc-Son
    Beaudet, Jean
    Histace, Aymeric
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 807 - 821