Differentiable Learning of Scalable Multi-Agent Navigation Policies

被引:4
|
作者
Ye, Xiaohan [1 ,2 ]
Pan, Zherong [1 ]
Gao, Xifeng
Wu, Kui
Ren, Bo [2 ]
机构
[1] Tencent, LightSpeed Studios, Shenzhen 518054, Peoples R China
[2] Nankai Univ, Coll Comp Sci, Tianjin 300350, Peoples R China
关键词
Navigation; Task analysis; Heuristic algorithms; Trajectory; Training; Kernel; Mathematical models; Multi-robot systems; robotics and automation; swarm robotics;
D O I
10.1109/LRA.2023.3248440
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
We present an end-to-end differentiable learning algorithm for multi-agent navigation policies. Compared with prior model-free learning algorithms, our method leads to a significant speedup via the gradient information. Our key innovation lies in a novel differentiability analysis of the optimization-based crowd simulation algorithm via the implicit function theorem. Inspired by continuum multi-agent modeling techniques, we further propose a kernel-based policy parameterization, allowing our learned policy to scale up to an arbitrary number of agents without re-training. We evaluate our algorithm on two tasks in obstacle-rich environments, partially labeled navigation and evacuation, for which loss functions can be defined making the entire task learnable in an end-to-end manner. The results show that our method can achieve more than one order of magnitude speedup over model-free baselines and readily scale to unseen target configurations and agent sizes.
引用
收藏
页码:2229 / 2236
页数:8
相关论文
共 50 条
  • [1] Scalable Reinforcement Learning Policies for Multi-Agent Control
    Hsu, Christopher D.
    Jeong, Heejin
    Pappas, George J.
    Chaudhari, Pratik
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 4785 - 4791
  • [2] Scalable Reinforcement Learning of Localized Policies for Multi-Agent Networked Systems
    Qu, Guannan
    Wierman, Adam
    Li, Na
    LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 : 256 - 266
  • [3] Adaptive Learning for Multi-Agent Navigation
    Godoy, Julio
    Karamouzas, Ioannis
    Guy, Stephen J.
    Gini, Maria
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15), 2015, : 1577 - 1585
  • [4] Constrained Multi-Agent Reinforcement Learning Policies for Cooperative Intersection Navigation and Traffic Compliance
    Adan, Fahmy
    Feng, Yuxiang
    Angeloudis, Panagiotis
    Quddus, Mohammed
    Ochieng, Washington
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 4079 - 4085
  • [5] ALAN: adaptive learning for multi-agent navigation
    Godoy, Julio
    Chen, Tiannan
    Guy, Stephen J.
    Karamouzas, Ioannis
    Gini, Maria
    AUTONOMOUS ROBOTS, 2018, 42 (08) : 1543 - 1562
  • [6] Coordinating Multi-Agent Navigation by Learning Communication
    Hildreth, Dalto N.
    Guy, Stephen J.
    PROCEEDINGS OF THE ACM ON COMPUTER GRAPHICS AND INTERACTIVE TECHNIQUES, 2019, 2 (02)
  • [7] ALAN: adaptive learning for multi-agent navigation
    Julio Godoy
    Tiannan Chen
    Stephen J. Guy
    Ioannis Karamouzas
    Maria Gini
    Autonomous Robots, 2018, 42 : 1543 - 1562
  • [8] Scalable Multi-Agent Reinforcement Learning with General Utilities
    Ying, Donghao
    Ding, Yuhao
    Koppel, Alec
    Lavaei, Javad
    2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, : 3977 - 3982
  • [9] Reinforcement learning for multi-agent formation navigation with scalability
    Yalei Gong
    Hongyun Xiong
    MengMeng Li
    Haibo Wang
    Xiaohong Nian
    Applied Intelligence, 2023, 53 : 28207 - 28225
  • [10] Reinforcement learning for multi-agent formation navigation with scalability
    Gong, Yalei
    Xiong, Hongyun
    Li, Mengmeng
    Wang, Haibo
    Nian, Xiaohong
    APPLIED INTELLIGENCE, 2023, 53 (23) : 28207 - 28225