GAT-MF: Graph Attention Mean Field for Very Large Scale Multi-Agent Reinforcement Learning

被引:5
|
作者
Hao, Qianyue [1 ]
Huang, Wenzhen [1 ]
Feng, Tao [1 ]
Yuan, Jian [1 ]
Li, Yong [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, BNRist, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-agent reinforcement learning; large-scale decision problem; mean field; graph attention; GAME;
D O I
10.1145/3580305.3599359
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent advancements in reinforcement learning have witnessed remarkable achievements by intelligent agents ranging from game-playing to industrial applications. Of particular interest is the area of multi-agent reinforcement learning (MARL), which holds significant potential for real-world scenarios. However, typical MARL methods are limited in their ability to handle tens of agents, leaving scenarios with up to hundreds or even thousands of agents almost unexplored. The scaling up of the number of agents presents two primary challenges: (1) agent-agent interactions are crucial in multi-agent systems while the number of interactions grows quadratically with the number of agents, resulting in substantial computational complexity and difficulty in strategies-learning; (2) the strengths of interactions among agents exhibit variations both across agents and over time, making it difficult to precisely model such interactions. In this paper, we propose a novel approach named Graph Attention Mean Field (GAT-MF). By converting agent-agent interactions into interactions between each agent and a weighted mean field, we achieve a substantial reduction in computational complexity. The proposed method offers a precise modeling of interaction dynamics with mathematical proofs of its correctness. Additionally, we design a graph attention mechanism to automatically capture the diverse and time-varying strengths of interactions, ensuring an accurate representation of agent interactions. Through extensive experimentation conducted in both manual and real-world scenarios involving over 3000 agents, we validate the efficacy of our method. The results demonstrate that our method outperforms the best baseline method with a remarkable improvement of 42.7%. Furthermore, our method saves 86.4% training time and 19.2% GPU memory compared to the best baseline method.
引用
收藏
页码:685 / 697
页数:13
相关论文
共 50 条
  • [1] Mean Field Multi-Agent Reinforcement Learning
    Yang, Yaodong
    Luo, Rui
    Li, Minne
    Zhou, Ming
    Zhang, Weinan
    Wang, Jun
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [2] A Weighted Mean Field Reinforcement Learning Algorithm for Large-Scale Multi-Agent Collaboration
    Xinwei Yuan
    He Wang
    Wenwu Yu
    Guidance,Navigation and Control, 2023, (02) : 42 - 60
  • [3] Partially Observable Mean Field Multi-Agent Reinforcement Learning Based on Graph Attention Network for UAV Swarms
    Yang, Min
    Liu, Guanjun
    Zhou, Ziyuan
    Wang, Jiacun
    DRONES, 2023, 7 (07)
  • [4] Adaptive mean field multi-agent reinforcement learning
    Wang, Xiaoqiang
    Ke, Liangjun
    Zhang, Gewei
    Zhu, Dapeng
    INFORMATION SCIENCES, 2024, 669
  • [5] Causal Mean Field Multi-Agent Reinforcement Learning
    Ma, Hao
    Pu, Zhiqiang
    Pan, Yi
    Liu, Boyin
    Gao, Junlong
    Guo, Zhenyu
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [6] Packet Routing with Graph Attention Multi-Agent Reinforcement Learning
    Mai, Xuan
    Fu, Quanzhi
    Chen, Yi
    2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [7] A Soft Graph Attention Reinforcement Learning for Multi-Agent Cooperation
    Wang, Huimu
    Pu, Zhiqiang
    Liu, Zhen
    Yi, Jianqiang
    Qiu, Tenghai
    2020 IEEE 16TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2020, : 1257 - 1262
  • [8] Towards a Very Large Scale Traffic Simulator for Multi-Agent Reinforcement Learning Testbeds
    Hu, Zijian
    Zhuge, Chengxiang
    Ma, Wei
    2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 363 - 368
  • [9] Caching for Edge Inference at Scale: A Mean Field Multi-Agent Reinforcement Learning Approach
    Lu, Yanqing
    Zhang, Meng
    Tang, Ming
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 332 - 337
  • [10] GAMA: Graph Attention Multi-agent reinforcement learning algorithm for cooperation
    Chen, Haoqiang
    Liu, Yadong
    Zhou, Zongtan
    Hu, Dewen
    Zhang, Ming
    APPLIED INTELLIGENCE, 2020, 50 (12) : 4195 - 4205