GAT-MF: Graph Attention Mean Field for Very Large Scale Multi-Agent Reinforcement Learning

被引:5
|
作者
Hao, Qianyue [1 ]
Huang, Wenzhen [1 ]
Feng, Tao [1 ]
Yuan, Jian [1 ]
Li, Yong [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, BNRist, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-agent reinforcement learning; large-scale decision problem; mean field; graph attention; GAME;
D O I
10.1145/3580305.3599359
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent advancements in reinforcement learning have witnessed remarkable achievements by intelligent agents ranging from game-playing to industrial applications. Of particular interest is the area of multi-agent reinforcement learning (MARL), which holds significant potential for real-world scenarios. However, typical MARL methods are limited in their ability to handle tens of agents, leaving scenarios with up to hundreds or even thousands of agents almost unexplored. The scaling up of the number of agents presents two primary challenges: (1) agent-agent interactions are crucial in multi-agent systems while the number of interactions grows quadratically with the number of agents, resulting in substantial computational complexity and difficulty in strategies-learning; (2) the strengths of interactions among agents exhibit variations both across agents and over time, making it difficult to precisely model such interactions. In this paper, we propose a novel approach named Graph Attention Mean Field (GAT-MF). By converting agent-agent interactions into interactions between each agent and a weighted mean field, we achieve a substantial reduction in computational complexity. The proposed method offers a precise modeling of interaction dynamics with mathematical proofs of its correctness. Additionally, we design a graph attention mechanism to automatically capture the diverse and time-varying strengths of interactions, ensuring an accurate representation of agent interactions. Through extensive experimentation conducted in both manual and real-world scenarios involving over 3000 agents, we validate the efficacy of our method. The results demonstrate that our method outperforms the best baseline method with a remarkable improvement of 42.7%. Furthermore, our method saves 86.4% training time and 19.2% GPU memory compared to the best baseline method.
引用
收藏
页码:685 / 697
页数:13
相关论文
共 50 条
  • [21] Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning
    Li, Minne
    Qin, Zhiwei
    Jiao, Yan
    Yang, Yaodong
    Gong, Zhichen
    Wang, Jun
    Wang, Chenxi
    Wu, Guobin
    Ye, Jieping
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 983 - 994
  • [22] Recursive Reasoning Graph for Multi-Agent Reinforcement Learning
    Ma, Xiaobai
    Isele, David
    Gupta, Jayesh K.
    Fujimura, Kikuo
    Kochenderfer, Mykel J.
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7664 - 7671
  • [23] SparseMAAC: Sparse Attention for Multi-agent Reinforcement Learning
    Li, Wenhao
    Jin, Bo
    Wang, Xiangfeng
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 : 96 - 110
  • [24] An overview: Attention mechanisms in multi-agent reinforcement learning
    Hu, Kai
    Xu, Keer
    Xia, Qingfeng
    Li, Mingyang
    Song, Zhiqiang
    Song, Lipeng
    Sun, Ning
    NEUROCOMPUTING, 2024, 598
  • [25] Decentralized Multi-agent Reinforcement Learning for Large-scale Mobile Wireless Sensor Network Control Using Mean Field Games
    Zhou, Zejian
    Qian, Lijun
    Xu, Hao
    2024 33RD INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS, ICCCN 2024, 2024,
  • [26] Graph-based multi-agent reinforcement learning for large-scale UAVs swarm system control
    Zhao, Bocheng
    Huo, Mingying
    Li, Zheng
    Yu, Ze
    Qi, Naiming
    AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 150
  • [27] Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems
    Fu, Qingxu
    Qiu, Tenghai
    Yi, Jianqiang
    Pu, Zhiqiang
    Wu, Shiguang
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 9341 - 9349
  • [28] Multi-Agent Hierarchical Graph Attention Reinforcement Learning for Grid-Aware Energy Management
    FENG Bingyi
    FENG Mingxiao
    WANG Minrui
    ZHOU Wengang
    LI Houqiang
    ZTE Communications, 2023, 21 (03) : 11 - 21
  • [29] Mean-Field Approximation of Cooperative Constrained Multi-Agent Reinforcement Learning (CMARL)
    Mondal, Washim Uddin
    Aggarwal, Vaneet
    Ukkusuri, Satish, V
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25
  • [30] Mean Field Multi-Agent Reinforcement Learning Method for Area Traffic Signal Control
    Zhang, Zundong
    Zhang, Wei
    Liu, Yuke
    Xiong, Gang
    ELECTRONICS, 2023, 12 (22)