GAT-MF: Graph Attention Mean Field for Very Large Scale Multi-Agent Reinforcement Learning

被引：5

作者：

Hao, Qianyue ^{[1
]}

Huang, Wenzhen ^{[1
]}

Feng, Tao ^{[1
]}

Yuan, Jian ^{[1
]}

Li, Yong ^{[1
]}

机构：

[1] Tsinghua Univ, Dept Elect Engn, BNRist, Beijing, Peoples R China

来源：

PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023 | 2023年

基金：

中国国家自然科学基金;

关键词：

Multi-agent reinforcement learning; large-scale decision problem; mean field; graph attention; GAME;

D O I：

10.1145/3580305.3599359

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recent advancements in reinforcement learning have witnessed remarkable achievements by intelligent agents ranging from game-playing to industrial applications. Of particular interest is the area of multi-agent reinforcement learning (MARL), which holds significant potential for real-world scenarios. However, typical MARL methods are limited in their ability to handle tens of agents, leaving scenarios with up to hundreds or even thousands of agents almost unexplored. The scaling up of the number of agents presents two primary challenges: (1) agent-agent interactions are crucial in multi-agent systems while the number of interactions grows quadratically with the number of agents, resulting in substantial computational complexity and difficulty in strategies-learning; (2) the strengths of interactions among agents exhibit variations both across agents and over time, making it difficult to precisely model such interactions. In this paper, we propose a novel approach named Graph Attention Mean Field (GAT-MF). By converting agent-agent interactions into interactions between each agent and a weighted mean field, we achieve a substantial reduction in computational complexity. The proposed method offers a precise modeling of interaction dynamics with mathematical proofs of its correctness. Additionally, we design a graph attention mechanism to automatically capture the diverse and time-varying strengths of interactions, ensuring an accurate representation of agent interactions. Through extensive experimentation conducted in both manual and real-world scenarios involving over 3000 agents, we validate the efficacy of our method. The results demonstrate that our method outperforms the best baseline method with a remarkable improvement of 42.7%. Furthermore, our method saves 86.4% training time and 19.2% GPU memory compared to the best baseline method.

引用

页码：685 / 697

页数：13

共 50 条

[1] Mean Field Multi-Agent Reinforcement Learning
Yang, Yaodong
Luo, Rui
Li, Minne
Zhou, Ming
Zhang, Weinan
Wang, Jun
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[2] A Weighted Mean Field Reinforcement Learning Algorithm for Large-Scale Multi-Agent Collaboration
Xinwei Yuan
He Wang
Wenwu Yu
Guidance,Navigation and Control, 2023, (02) : 42 - 60
[3] Partially Observable Mean Field Multi-Agent Reinforcement Learning Based on Graph Attention Network for UAV Swarms
Yang, Min
Liu, Guanjun
Zhou, Ziyuan
Wang, Jiacun
DRONES, 2023, 7 (07)
[4] Adaptive mean field multi-agent reinforcement learning
Wang, Xiaoqiang
Ke, Liangjun
Zhang, Gewei
Zhu, Dapeng
INFORMATION SCIENCES, 2024, 669
[5] Causal Mean Field Multi-Agent Reinforcement Learning
Ma, Hao
Pu, Zhiqiang
Pan, Yi
Liu, Boyin
Gao, Junlong
Guo, Zhenyu
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
[6] Packet Routing with Graph Attention Multi-Agent Reinforcement Learning
Mai, Xuan
Fu, Quanzhi
Chen, Yi
2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
[7] A Soft Graph Attention Reinforcement Learning for Multi-Agent Cooperation
Wang, Huimu
Pu, Zhiqiang
Liu, Zhen
Yi, Jianqiang
Qiu, Tenghai
2020 IEEE 16TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2020, : 1257 - 1262
[8] Towards a Very Large Scale Traffic Simulator for Multi-Agent Reinforcement Learning Testbeds
Hu, Zijian
Zhuge, Chengxiang
Ma, Wei
2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 363 - 368
[9] Caching for Edge Inference at Scale: A Mean Field Multi-Agent Reinforcement Learning Approach
Lu, Yanqing
Zhang, Meng
Tang, Ming
IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 332 - 337
[10] GAMA: Graph Attention Multi-agent reinforcement learning algorithm for cooperation
Chen, Haoqiang
Liu, Yadong
Zhou, Zongtan
Hu, Dewen
Zhang, Ming
APPLIED INTELLIGENCE, 2020, 50 (12) : 4195 - 4205

← 1 2 3 4 5 →