VMAS: A Vectorized Multi-agent Simulator for Collective Robot Learning

被引:0
|
作者
Bettini, Matteo [1 ]
Kortvelesy, Ryan [1 ]
Blumenkamp, Jan [1 ]
Prorok, Amanda [1 ]
机构
[1] Univ Cambridge, Dept Comp Sci & Technol, Cambridge, England
基金
英国工程与自然科学研究理事会; 欧洲研究理事会;
关键词
simulator; multi-robot learning; vectorization;
D O I
10.1007/978-3-031-51497-5_4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While many multi-robot coordination problems can be solved optimally by exact algorithms, solutions are often not scalable in the number of robots. Multi-Agent Reinforcement Learning (MARL) is gaining increasing attention in the robotics community as a promising solution to tackle such problems. Nevertheless, we still lack the tools that allow us to quickly and efficiently find solutions to largescale collective learning tasks. In this work, we introduce the Vectorized Multi-Agent Simulator (VMAS). VMAS is an open-source framework designed for efficient MARL benchmarking. It is comprised of a vectorized 2D physics engine written in PyTorch and a set of twelve challenging multi-robot scenarios. Additional scenarios can be implemented through a simple and modular interface. We demonstrate how vectorization enables parallel simulation on accelerated hardware without added complexity. When comparing VMAS to OpenAI MPE, we show how MPE's execution time increases linearly in the number of simulations while VMAS is able to execute 30,000 parallel simulations in under 10 s, proving more than 100x faster. Using VMAS's RLlib interface, we benchmark our multi-robot scenarios using various Proximal Policy Optimization (PPO)-based MARL algorithms. VMAS's scenarios prove challenging in orthogonal ways for state-of-the-art MARL algorithms. The VMAS framework is available at: https://github.com/proroklab/ VectorizedMultiAgentSimulator. A video of VMAS scenarios and experiments is available https://youtu.be/aaDRYfiesAY
引用
收藏
页码:42 / 56
页数:15
相关论文
共 50 条
  • [21] Collective Online Learning of Gaussian Processes in Massive Multi-Agent Systems
    Trong Nghia Hoang
    Quang Minh Hoang
    Low, Kian Hsiang
    How, Jonathan
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 7850 - 7857
  • [22] Multi-agent learning
    Eduardo Alonso
    Autonomous Agents and Multi-Agent Systems, 2007, 15 : 3 - 4
  • [23] Multi-Agent Deep Reinforcement Learning for Multi-Robot Applications: A Survey
    Orr, James
    Dutta, Ayan
    SENSORS, 2023, 23 (07)
  • [24] Multi-agent learning
    Alonso, Eduardo
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2007, 15 (01) : 3 - 4
  • [25] Heterogeneous Multi-Robot Cooperation With Asynchronous Multi-Agent Reinforcement Learning
    Zhang, Han
    Zhang, Xiaohui
    Feng, Zhao
    Xiao, Xiaohui
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (01): : 159 - 166
  • [26] Process mining of a multi-agent business simulator
    Sohei Ito
    Dominik Vymětal
    Roman Šperka
    Michal Halaška
    Computational and Mathematical Organization Theory, 2018, 24 : 500 - 531
  • [27] People flow prediction by multi-agent simulator
    Sato D.
    Matsubayashi T.
    Adachi T.
    Ooi S.
    Tanaka Y.
    Nagano S.
    Muto Y.
    Shiohara H.
    Miyamoto M.
    Toda H.
    Transactions of the Japanese Society for Artificial Intelligence, 2020, 35 (02)
  • [28] Process mining of a multi-agent business simulator
    Ito, Sohei
    Vymetal, Dominik
    Sperka, Roman
    Halaska, Michal
    COMPUTATIONAL AND MATHEMATICAL ORGANIZATION THEORY, 2018, 24 (04) : 500 - 531
  • [29] People Flow Prediction by Multi-Agent Simulator
    Sato, Daisuke
    Matsubayashi, Tatsushi
    Nagano, Shoichi
    Toda, Hiroyuki
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2019, : 436 - 439
  • [30] A multi-agent simulator for generating novelty in monopoly
    Kejriwal, Mayank
    Thomas, Shilpa
    SIMULATION MODELLING PRACTICE AND THEORY, 2021, 112