Reinforcement learning for multi-agent formation navigation with scalability

被引:3
|
作者
Gong, Yalei [1 ]
Xiong, Hongyun [1 ]
Li, Mengmeng [1 ]
Wang, Haibo [1 ]
Nian, Xiaohong [1 ]
机构
[1] Cent South Univ, Clustered Unmanned Syst Res Inst, Sch Automat, Changsha 410073, Hunan, Peoples R China
关键词
Deep reinforcement learning; Multi-agent formations; Collision avoidance; Scalability; AVOIDANCE;
D O I
10.1007/s10489-023-05007-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the multi-agent formation obstacle avoidance (MAFOA) problem using multi-agent deep reinforcement learning (MADRL). MAFOA control aims to achieve and maintain a desired formation while avoiding collisions among agents or with obstacles. It is a research hotspot in multi-agent cooperation due to its wide applications and challenges. However, current MADRL methods face two major difficulties in solving this problem: 1) the high complexity and uncertainty of the environment when there are many agents; 2) the lack of scalability when the number of agents varies. To overcome these difficulties, we propose: 1) A local multi-agent deep deterministic policy gradient algorithm that allows each agent to learn from its local neighbors' strategies during training and act independently during execution; 2) A reinforcement learning framework based on local information that uses partial observation as input and adapts to different numbers of agents; 3) A hybrid control method that switches between reinforcement learning and PID control to ensure formation stability. We evaluate our method on the multiagent particle environment environment and compare it with other algorithms to demonstrate its feasibility and superiority for solving the MAFOA problem.
引用
收藏
页码:28207 / 28225
页数:19
相关论文
共 50 条
  • [41] Coordinating Multi-Agent Navigation by Learning Communication
    Hildreth, Dalto N.
    Guy, Stephen J.
    PROCEEDINGS OF THE ACM ON COMPUTER GRAPHICS AND INTERACTIVE TECHNIQUES, 2019, 2 (02)
  • [42] ALAN: adaptive learning for multi-agent navigation
    Julio Godoy
    Tiannan Chen
    Stephen J. Guy
    Ioannis Karamouzas
    Maria Gini
    Autonomous Robots, 2018, 42 : 1543 - 1562
  • [43] TEAM POLICY LEARNING FOR MULTI-AGENT REINFORCEMENT LEARNING
    Cassano, Lucas
    Alghunaim, Sulaiman A.
    Sayed, Ali H.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3062 - 3066
  • [44] Aggregation Transfer Learning for Multi-Agent Reinforcement learning
    Xu, Dongsheng
    Qiao, Peng
    Dou, Yong
    2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 547 - 551
  • [45] Learning to Communicate with Deep Multi-Agent Reinforcement Learning
    Foerster, Jakob N.
    Assael, Yannis M.
    de Freitas, Nando
    Whiteson, Shimon
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [46] Consensus Learning for Cooperative Multi-Agent Reinforcement Learning
    Xu, Zhiwei
    Zhang, Bin
    Li, Dapeng
    Zhang, Zeren
    Zhou, Guangchong
    Chen, Hao
    Fan, Guoliang
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 11726 - 11734
  • [47] Concept Learning for Interpretable Multi-Agent Reinforcement Learning
    Zabounidis, Renos
    Campbell, Joseph
    Stepputtis, Simon
    Hughes, Dana
    Sycara, Katia
    CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 1828 - 1837
  • [48] Learning structured communication for multi-agent reinforcement learning
    Sheng, Junjie
    Wang, Xiangfeng
    Jin, Bo
    Yan, Junchi
    Li, Wenhao
    Chang, Tsung-Hui
    Wang, Jun
    Zha, Hongyuan
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2022, 36 (02)
  • [49] Learning structured communication for multi-agent reinforcement learning
    Junjie Sheng
    Xiangfeng Wang
    Bo Jin
    Junchi Yan
    Wenhao Li
    Tsung-Hui Chang
    Jun Wang
    Hongyuan Zha
    Autonomous Agents and Multi-Agent Systems, 2022, 36
  • [50] Generalized learning automata for multi-agent reinforcement learning
    De Hauwere, Yann-Michael
    Vrancx, Peter
    Nowe, Ann
    AI COMMUNICATIONS, 2010, 23 (04) : 311 - 324