Characterizing and Optimizing the End-to-End Performance of Multi-Agent Reinforcement Learning Systems

被引:0
|
作者
Gogineni, Kailash [1 ]
Mei, Yongsheng [1 ]
Gogineni, Karthikeya
Wei, Peng [1 ]
Lan, Tian [1 ]
Venkataramani, Guru [1 ]
机构
[1] George Washington Univ, Washington, DC 20052 USA
基金
美国国家科学基金会;
关键词
Multi-Agent Systems; Performance Analysis; Reinforcement Learning; Performance Optimization;
D O I
10.1109/IISWC63097.2024.00028
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Multi-Agent Reinforcement Learning Systems (MARL) can unlock the potential to model and control multiple autonomous decision-making agents simultaneously. During online training, MARL algorithms involve performance-intensive computations, such as exploration and exploitation phases originating from a large observation-action space and a huge number of training steps. Understanding and mitigating the MARL performance limiters is key to their practical adoption. In this paper, we first present a detailed workload characterization of MARL workloads under different multi-agent settings. Our experimental analysis identifies a critical performance bottleneck that affects scaling within the mini-batch sampling on transition data. To mitigate this issue, we explore a series of optimization strategies. First, we investigate cache locality-aware sampling that prioritizes intra-agent neighbor transitions over other randomly picked transition data samples within the baseline MARL algorithms. Next, we explore importance sampling techniques that preserve the learning performance/distribution and capture the neighbors of important transitions. Finally, we design an additional algorithmic optimization that reorganizes the transition data layout to improve the cache locality between different agents during the mini-batch sampling process. We evaluate our optimizations using popular MARL workloads on multi-agent particle games. Our work highlights several opportunities for enhancing the performance of multi-agent systems, with end-to-end training time improvements ranging from 8.2% (3 agents) to 20.5% (24 agents) compared to the baseline MADDPG, affirming the usefulness of deeply understanding MARL performance bottlenecks and mitigating them effectively.
引用
收藏
页码:224 / 235
页数:12
相关论文
共 50 条
  • [1] End-to-end Deep Reinforcement Learning for Multi-agent Collaborative Exploration
    Chen, Zichen
    Subagdja, Budhitama
    Tan, Ah-Hwee
    2019 IEEE INTERNATIONAL CONFERENCE ON AGENTS (ICA), 2019, : 99 - 102
  • [2] WarpDrive: Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU
    Lan, Tian
    Srinivasa, Sunil
    Wang, Huan
    Zheng, Stephan
    Journal of Machine Learning Research, 2022, 23
  • [3] End-to-End control of USV swarm using graph centric Multi-agent Reinforcement Learning
    Lee, Kanghoon
    Ahn, Kyuree
    Park, Jinkyoo
    2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 925 - 929
  • [4] Multi-Agent Reinforcement Learning-Based Resource Management for End-to-End Network Slicing
    Kim, Yohan
    Lim, Hyuk
    IEEE ACCESS, 2021, 9 : 56178 - 56190
  • [5] End-to-end Control of Kart Agent with Deep Reinforcement Learning
    Zhang Ruiming
    Liu Chengju
    Chen Qijun
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2018, : 1688 - 1693
  • [6] Dynamic and efficient resource allocation for 5G end-to-end network slicing: A multi-agent deep reinforcement learning approach
    Asim Ejaz, Muhammad
    Wu, Guowei
    Iqbal, Tahir
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2024, 37 (17)
  • [7] Utility-Driven End-to-End Network Slicing for Diverse IoT Users in MEC: A Multi-Agent Deep Reinforcement Learning Approach
    Ejaz, Muhammad Asim
    Wu, Guowei
    Ahmed, Adeel
    Iftikhar, Saman
    Bawazeer, Shaikhan
    SENSORS, 2024, 24 (17)
  • [8] On using multi-agent systems in end to end adaptive monitoring
    Merghem, L
    Gaiti, D
    Pujolle, G
    MANAGEMENT OF MULTIMEDIA NETWORKS AND SERVICES, PROCEEDINGS, 2003, 2839 : 422 - 435
  • [9] End-to-End Autonomous Driving With Semantic Depth Cloud Mapping and Multi-Agent
    Natan, Oskar
    Miura, Jun
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (01): : 557 - 571
  • [10] Optimizing end-to-end distortion in MIMO systems
    Holliday, T
    Goldsmith, A
    2005 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), VOLS 1 AND 2, 2005, : 1671 - 1675