GAT-MF: Graph Attention Mean Field for Very Large Scale Multi-Agent Reinforcement Learning

被引:5
|
作者
Hao, Qianyue [1 ]
Huang, Wenzhen [1 ]
Feng, Tao [1 ]
Yuan, Jian [1 ]
Li, Yong [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, BNRist, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-agent reinforcement learning; large-scale decision problem; mean field; graph attention; GAME;
D O I
10.1145/3580305.3599359
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent advancements in reinforcement learning have witnessed remarkable achievements by intelligent agents ranging from game-playing to industrial applications. Of particular interest is the area of multi-agent reinforcement learning (MARL), which holds significant potential for real-world scenarios. However, typical MARL methods are limited in their ability to handle tens of agents, leaving scenarios with up to hundreds or even thousands of agents almost unexplored. The scaling up of the number of agents presents two primary challenges: (1) agent-agent interactions are crucial in multi-agent systems while the number of interactions grows quadratically with the number of agents, resulting in substantial computational complexity and difficulty in strategies-learning; (2) the strengths of interactions among agents exhibit variations both across agents and over time, making it difficult to precisely model such interactions. In this paper, we propose a novel approach named Graph Attention Mean Field (GAT-MF). By converting agent-agent interactions into interactions between each agent and a weighted mean field, we achieve a substantial reduction in computational complexity. The proposed method offers a precise modeling of interaction dynamics with mathematical proofs of its correctness. Additionally, we design a graph attention mechanism to automatically capture the diverse and time-varying strengths of interactions, ensuring an accurate representation of agent interactions. Through extensive experimentation conducted in both manual and real-world scenarios involving over 3000 agents, we validate the efficacy of our method. The results demonstrate that our method outperforms the best baseline method with a remarkable improvement of 42.7%. Furthermore, our method saves 86.4% training time and 19.2% GPU memory compared to the best baseline method.
引用
收藏
页码:685 / 697
页数:13
相关论文
共 50 条
  • [41] A Graph Attention Mechanism Based Multi-Agent Reinforcement Learning Method for Efficient Traffic Light Control
    Su, Changqing
    Yan, Yan
    Wang, Tao
    Zhang, Baoxian
    Li, Cheng
    IWCMC 2021: 2021 17TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2021, : 1332 - 1337
  • [42] Weighted Mean-Field Multi-Agent Reinforcement Learning via Reward Attribution Decomposition
    Wu, Tingyu
    Li, Wenhao
    Jin, Bo
    Zhang, Wei
    Wang, Xiangfeng
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS. DASFAA 2022 INTERNATIONAL WORKSHOPS, 2022, 13248 : 301 - 316
  • [43] Learning Decentralized Traffic Signal Controllers With Multi-Agent Graph Reinforcement Learning
    Zhang, Yao
    Yu, Zhiwen
    Zhang, Jun
    Wang, Liang
    Luan, Tom H.
    Guo, Bin
    Yuen, Chau
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (06) : 7180 - 7195
  • [44] Routing with Graph Convolutional Networks and Multi-Agent Deep Reinforcement Learning
    Bhavanasi, Sai Shreyas
    Pappone, Lorenzo
    Esposito, Flavio
    2022 IEEE CONFERENCE ON NETWORK FUNCTION VIRTUALIZATION AND SOFTWARE DEFINED NETWORKS (IEEE NFV-SDN), 2022, : 72 - 77
  • [45] Multi-Agent Graph Convolutional Reinforcement Learning for Intelligent Load Balancing
    Houidi, Omar
    Bakri, Sihem
    Zeghlache, Djamal
    PROCEEDINGS OF THE IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM 2022, 2022,
  • [46] Graph Convolutional Multi-Agent Reinforcement Learning for UAV Coverage Control
    Dai, Anna
    Li, Rongpeng
    Zhaot, Zhifeng
    Zhang, Honggang
    2020 12TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2020, : 1106 - 1111
  • [47] Distributed Cooperative Multi-Agent Reinforcement Learning with Directed Coordination Graph
    Jing, Gangshan
    Bai, He
    George, Jemin
    Chakrabortty, Aranya
    Sharma, Piyush K.
    2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 3273 - 3278
  • [48] Hierarchical graph multi-agent reinforcement learning for traffic signal control
    Yang, Shantian
    INFORMATION SCIENCES, 2023, 634 : 55 - 72
  • [49] Effective Methods for Reinforcement Learning in Large Multi-Agent Domains
    Riedmiller, Martin
    Withopf, Daniel
    IT-INFORMATION TECHNOLOGY, 2005, 47 (05): : 241 - 249
  • [50] Cooperative Multi-Agent Reinforcement Learning in a Large Stationary Environment
    Zemzem, Wiem
    Tagina, Moncef
    2017 16TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS 2017), 2017, : 365 - 371