VMAS: A Vectorized Multi-agent Simulator for Collective Robot Learning

被引：0

作者：

Bettini, Matteo ^{[1
]}

Kortvelesy, Ryan ^{[1
]}

Blumenkamp, Jan ^{[1
]}

Prorok, Amanda ^{[1
]}

机构：

[1] Univ Cambridge, Dept Comp Sci & Technol, Cambridge, England

来源：

DISTRIBUTED AUTONOMOUS ROBOTIC SYSTEMS, DARS 2022 | 2024年 / 28卷

基金：

英国工程与自然科学研究理事会; 欧洲研究理事会;

关键词：

simulator; multi-robot learning; vectorization;

D O I：

10.1007/978-3-031-51497-5_4

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

While many multi-robot coordination problems can be solved optimally by exact algorithms, solutions are often not scalable in the number of robots. Multi-Agent Reinforcement Learning (MARL) is gaining increasing attention in the robotics community as a promising solution to tackle such problems. Nevertheless, we still lack the tools that allow us to quickly and efficiently find solutions to largescale collective learning tasks. In this work, we introduce the Vectorized Multi-Agent Simulator (VMAS). VMAS is an open-source framework designed for efficient MARL benchmarking. It is comprised of a vectorized 2D physics engine written in PyTorch and a set of twelve challenging multi-robot scenarios. Additional scenarios can be implemented through a simple and modular interface. We demonstrate how vectorization enables parallel simulation on accelerated hardware without added complexity. When comparing VMAS to OpenAI MPE, we show how MPE's execution time increases linearly in the number of simulations while VMAS is able to execute 30,000 parallel simulations in under 10 s, proving more than 100x faster. Using VMAS's RLlib interface, we benchmark our multi-robot scenarios using various Proximal Policy Optimization (PPO)-based MARL algorithms. VMAS's scenarios prove challenging in orthogonal ways for state-of-the-art MARL algorithms. The VMAS framework is available at: https://github.com/proroklab/ VectorizedMultiAgentSimulator. A video of VMAS scenarios and experiments is available https://youtu.be/aaDRYfiesAY

引用

页码：42 / 56

页数：15

共 50 条

[21] Collective Online Learning of Gaussian Processes in Massive Multi-Agent Systems
Trong Nghia Hoang
Quang Minh Hoang
Low, Kian Hsiang
How, Jonathan
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 7850 - 7857
[22] Multi-agent learning
Eduardo Alonso
Autonomous Agents and Multi-Agent Systems, 2007, 15 : 3 - 4
[23] Multi-Agent Deep Reinforcement Learning for Multi-Robot Applications: A Survey
Orr, James
Dutta, Ayan
SENSORS, 2023, 23 (07)
[24] Multi-agent learning
Alonso, Eduardo
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2007, 15 (01) : 3 - 4
[25] Heterogeneous Multi-Robot Cooperation With Asynchronous Multi-Agent Reinforcement Learning
Zhang, Han
Zhang, Xiaohui
Feng, Zhao
Xiao, Xiaohui
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (01): : 159 - 166
[26] Process mining of a multi-agent business simulator
Sohei Ito
Dominik Vymětal
Roman Šperka
Michal Halaška
Computational and Mathematical Organization Theory, 2018, 24 : 500 - 531
[27] People flow prediction by multi-agent simulator
Sato D.
Matsubayashi T.
Adachi T.
Ooi S.
Tanaka Y.
Nagano S.
Muto Y.
Shiohara H.
Miyamoto M.
Toda H.
Transactions of the Japanese Society for Artificial Intelligence, 2020, 35 (02)
[28] Process mining of a multi-agent business simulator
Ito, Sohei
Vymetal, Dominik
Sperka, Roman
Halaska, Michal
COMPUTATIONAL AND MATHEMATICAL ORGANIZATION THEORY, 2018, 24 (04) : 500 - 531
[29] People Flow Prediction by Multi-Agent Simulator
Sato, Daisuke
Matsubayashi, Tatsushi
Nagano, Shoichi
Toda, Hiroyuki
2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2019, : 436 - 439
[30] A multi-agent simulator for generating novelty in monopoly
Kejriwal, Mayank
Thomas, Shilpa
SIMULATION MODELLING PRACTICE AND THEORY, 2021, 112

← 1 2 3 4 5 →