Differentiable Learning of Scalable Multi-Agent Navigation Policies

被引：4

作者：

Ye, Xiaohan ^{[1
,2
]}

Pan, Zherong ^{[1
]}

Gao, Xifeng

Wu, Kui

Ren, Bo ^{[2
]}

机构：

[1] Tencent, LightSpeed Studios, Shenzhen 518054, Peoples R China

[2] Nankai Univ, Coll Comp Sci, Tianjin 300350, Peoples R China

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2023年 / 8卷 / 04期

关键词：

Navigation; Task analysis; Heuristic algorithms; Trajectory; Training; Kernel; Mathematical models; Multi-robot systems; robotics and automation; swarm robotics;

D O I：

10.1109/LRA.2023.3248440

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

We present an end-to-end differentiable learning algorithm for multi-agent navigation policies. Compared with prior model-free learning algorithms, our method leads to a significant speedup via the gradient information. Our key innovation lies in a novel differentiability analysis of the optimization-based crowd simulation algorithm via the implicit function theorem. Inspired by continuum multi-agent modeling techniques, we further propose a kernel-based policy parameterization, allowing our learned policy to scale up to an arbitrary number of agents without re-training. We evaluate our algorithm on two tasks in obstacle-rich environments, partially labeled navigation and evacuation, for which loss functions can be defined making the entire task learnable in an end-to-end manner. The results show that our method can achieve more than one order of magnitude speedup over model-free baselines and readily scale to unseen target configurations and agent sizes.

引用

页码：2229 / 2236

页数：8

共 50 条

[1] Scalable Reinforcement Learning Policies for Multi-Agent Control
Hsu, Christopher D.
Jeong, Heejin
Pappas, George J.
Chaudhari, Pratik
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 4785 - 4791
[2] Scalable Reinforcement Learning of Localized Policies for Multi-Agent Networked Systems
Qu, Guannan
Wierman, Adam
Li, Na
LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 : 256 - 266
[3] Adaptive Learning for Multi-Agent Navigation
Godoy, Julio
Karamouzas, Ioannis
Guy, Stephen J.
Gini, Maria
PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15), 2015, : 1577 - 1585
[4] Constrained Multi-Agent Reinforcement Learning Policies for Cooperative Intersection Navigation and Traffic Compliance
Adan, Fahmy
Feng, Yuxiang
Angeloudis, Panagiotis
Quddus, Mohammed
Ochieng, Washington
2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 4079 - 4085
[5] ALAN: adaptive learning for multi-agent navigation
Godoy, Julio
Chen, Tiannan
Guy, Stephen J.
Karamouzas, Ioannis
Gini, Maria
AUTONOMOUS ROBOTS, 2018, 42 (08) : 1543 - 1562
[6] Coordinating Multi-Agent Navigation by Learning Communication
Hildreth, Dalto N.
Guy, Stephen J.
PROCEEDINGS OF THE ACM ON COMPUTER GRAPHICS AND INTERACTIVE TECHNIQUES, 2019, 2 (02)
[7] ALAN: adaptive learning for multi-agent navigation
Julio Godoy
Tiannan Chen
Stephen J. Guy
Ioannis Karamouzas
Maria Gini
Autonomous Robots, 2018, 42 : 1543 - 1562
[8] Scalable Multi-Agent Reinforcement Learning with General Utilities
Ying, Donghao
Ding, Yuhao
Koppel, Alec
Lavaei, Javad
2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, : 3977 - 3982
[9] Reinforcement learning for multi-agent formation navigation with scalability
Yalei Gong
Hongyun Xiong
MengMeng Li
Haibo Wang
Xiaohong Nian
Applied Intelligence, 2023, 53 : 28207 - 28225
[10] Reinforcement learning for multi-agent formation navigation with scalability
Gong, Yalei
Xiong, Hongyun
Li, Mengmeng
Wang, Haibo
Nian, Xiaohong
APPLIED INTELLIGENCE, 2023, 53 (23) : 28207 - 28225

← 1 2 3 4 5 →