A Heterogeneous Acceleration System for Attention-Based Multi-Agent Reinforcement Learning

被引:0
|
作者
Wiggins, Samuel [1 ]
Meng, Yuan [1 ]
Iyer, Mahesh A. [2 ]
Prasanna, Viktor [1 ]
机构
[1] Univ Southern Calif, Ming Hsieh Dept Elect & Comp Engn, Los Angeles, CA 90007 USA
[2] Intel Corp, Santa Clara, CA USA
基金
美国国家科学基金会;
关键词
Multi-Agent Reinforcement Learning; Hardware Accelerator; Heterogeneous Computing;
D O I
10.1109/FPL64840.2024.00040
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-Agent Reinforcement Learning (MARL) is an emerging technology that has seen success in many AI applications. Multi-Actor-Attention-Critic (MAAC) is a state-of-the-art MARL algorithm that uses a Multi-Head Attention (MHA) mechanism to learn messages communicated among agents during the training process. Current implementations of MAAC using CPU and CPU-GPU platforms lack fine-grained parallelism among agents, sequentially executing each stage of the training loop, and their performance suffers from costly data movement involved in MHA communication learning. In this work, we develop the first high-throughput accelerator for MARL with attention-based communication on a CPU-FPGA heterogeneous system. We alleviate the limitations of existing implementations through a combination of data- and pipeline-parallel modules in our accelerator design and enable fine-grained system scheduling for exploiting concurrency among heterogeneous resources. Our design increases the overall system throughput by 4.6x and 4.1x compared to CPU and CPU-GPU implementations, respectively.
引用
收藏
页码:236 / 242
页数:7
相关论文
共 50 条
  • [1] Attention-Based Fault-Tolerant Approach for Multi-Agent Reinforcement Learning Systems
    Gu, Shanzhi
    Geng, Mingyang
    Lan, Long
    ENTROPY, 2021, 23 (09)
  • [2] Multi-Agent attention-based deep reinforcement learning for demand response in grid-responsive buildings*
    Xie, Jiahan
    Ajagekar, Akshay
    You, Fengqi
    APPLIED ENERGY, 2023, 342
  • [3] Spatial-Temporal Graph Attention-based Multi-Agent Reinforcement Learning in Cooperative Edge Caching
    Hou, Jiacheng
    Nayak, Amiya
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 3078 - 3083
  • [4] Hierarchical Attention Master-Slave for heterogeneous multi-agent reinforcement learning
    Wang, Jiao
    Yuan, Mingrui
    Li, Yun
    Zhao, Zihui
    NEURAL NETWORKS, 2023, 162 : 359 - 368
  • [5] SparseMAAC: Sparse Attention for Multi-agent Reinforcement Learning
    Li, Wenhao
    Jin, Bo
    Wang, Xiangfeng
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 : 96 - 110
  • [6] An overview: Attention mechanisms in multi-agent reinforcement learning
    Hu, Kai
    Xu, Keer
    Xia, Qingfeng
    Li, Mingyang
    Song, Zhiqiang
    Song, Lipeng
    Sun, Ning
    NEUROCOMPUTING, 2024, 598
  • [7] Evaluating Multi-Agent Reinforcement Learning on Heterogeneous Platforms
    Wiggins, Samuel
    Meng, Yuan
    Kannan, Rajgopal
    Prasanna, Viktor
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS V, 2023, 12538
  • [8] Learning Heterogeneous Strategies via Graph-based Multi-agent Reinforcement Learning
    Li, Yang
    Luo, Xiangfeng
    Xie, Shaorong
    2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 709 - 713
  • [9] Attention based multi-agent intrusion detection systems using reinforcement learning
    Sethi, Kamalakanta
    Madhav, Y. Venu
    Kumar, Rahul
    Bera, Padmalochan
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2021, 61
  • [10] Emotion-Based Heterogeneous Multi-agent Reinforcement Learning with Sparse Reward
    Fang B.
    Ma Y.
    Wang Z.
    Wang H.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2021, 34 (03): : 223 - 231