A Heterogeneous Acceleration System for Attention-Based Multi-Agent Reinforcement Learning

被引：0

作者：

Wiggins, Samuel ^{[1
]}

Meng, Yuan ^{[1
]}

Iyer, Mahesh A. ^{[2
]}

Prasanna, Viktor ^{[1
]}

机构：

[1] Univ Southern Calif, Ming Hsieh Dept Elect & Comp Engn, Los Angeles, CA 90007 USA

[2] Intel Corp, Santa Clara, CA USA

来源：

2024 34TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL 2024 | 2024年

基金：

美国国家科学基金会;

关键词：

Multi-Agent Reinforcement Learning; Hardware Accelerator; Heterogeneous Computing;

D O I：

10.1109/FPL64840.2024.00040

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multi-Agent Reinforcement Learning (MARL) is an emerging technology that has seen success in many AI applications. Multi-Actor-Attention-Critic (MAAC) is a state-of-the-art MARL algorithm that uses a Multi-Head Attention (MHA) mechanism to learn messages communicated among agents during the training process. Current implementations of MAAC using CPU and CPU-GPU platforms lack fine-grained parallelism among agents, sequentially executing each stage of the training loop, and their performance suffers from costly data movement involved in MHA communication learning. In this work, we develop the first high-throughput accelerator for MARL with attention-based communication on a CPU-FPGA heterogeneous system. We alleviate the limitations of existing implementations through a combination of data- and pipeline-parallel modules in our accelerator design and enable fine-grained system scheduling for exploiting concurrency among heterogeneous resources. Our design increases the overall system throughput by 4.6x and 4.1x compared to CPU and CPU-GPU implementations, respectively.

引用

页码：236 / 242

页数：7

共 50 条

[1] Attention-Based Fault-Tolerant Approach for Multi-Agent Reinforcement Learning Systems
Gu, Shanzhi
Geng, Mingyang
Lan, Long
ENTROPY, 2021, 23 (09)
[2] Multi-Agent attention-based deep reinforcement learning for demand response in grid-responsive buildings*
Xie, Jiahan
Ajagekar, Akshay
You, Fengqi
APPLIED ENERGY, 2023, 342
[3] Spatial-Temporal Graph Attention-based Multi-Agent Reinforcement Learning in Cooperative Edge Caching
Hou, Jiacheng
Nayak, Amiya
ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 3078 - 3083
[4] Hierarchical Attention Master-Slave for heterogeneous multi-agent reinforcement learning
Wang, Jiao
Yuan, Mingrui
Li, Yun
Zhao, Zihui
NEURAL NETWORKS, 2023, 162 : 359 - 368
[5] SparseMAAC: Sparse Attention for Multi-agent Reinforcement Learning
Li, Wenhao
Jin, Bo
Wang, Xiangfeng
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 : 96 - 110
[6] An overview: Attention mechanisms in multi-agent reinforcement learning
Hu, Kai
Xu, Keer
Xia, Qingfeng
Li, Mingyang
Song, Zhiqiang
Song, Lipeng
Sun, Ning
NEUROCOMPUTING, 2024, 598
[7] Evaluating Multi-Agent Reinforcement Learning on Heterogeneous Platforms
Wiggins, Samuel
Meng, Yuan
Kannan, Rajgopal
Prasanna, Viktor
ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS V, 2023, 12538
[8] Learning Heterogeneous Strategies via Graph-based Multi-agent Reinforcement Learning
Li, Yang
Luo, Xiangfeng
Xie, Shaorong
2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 709 - 713
[9] Attention based multi-agent intrusion detection systems using reinforcement learning
Sethi, Kamalakanta
Madhav, Y. Venu
Kumar, Rahul
Bera, Padmalochan
JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2021, 61
[10] Emotion-Based Heterogeneous Multi-agent Reinforcement Learning with Sparse Reward
Fang B.
Ma Y.
Wang Z.
Wang H.
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2021, 34 (03): : 223 - 231

← 1 2 3 4 5 →