Distributed Multirobot Path Planning Based on MRDWA-MADDPG

被引:8
|
作者
Wu, Qichao [1 ]
Lin, Rui [1 ]
Ren, Ziwu [1 ]
机构
[1] Soochow Univ, Sch Mech & Elect Engn, Suzhou 215000, Peoples R China
基金
中国国家自然科学基金;
关键词
Robots; Multi-robot systems; Robot kinematics; Collision avoidance; Robot sensing systems; Training; Heuristic algorithms; Deep reinforcement learning; multirobot dynamic window approach (MRDWA); multirobot path planning; sensor application; trajectory optimization; UNKNOWN ENVIRONMENTS; REINFORCEMENT;
D O I
10.1109/JSEN.2023.3310519
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Multirobot path planning in complex environments is a challenging research area. This article proposes a path planning method for multirobot systems based on distributed multiagent deep reinforcement learning. We propose a multirobot dynamic window approach (MRDWA) in which a central controller facilitates sensor information sharing among robots, enabling locally optimal path planning considering the behavior of other robots. We incorporate the output velocity information into the observation function to form an efficient and low-dimensional state representation. Additionally, we employ the multiagent deep deterministic policy gradient (MADDPG) reinforcement learning algorithm to directly map part of the observation information to motion commands for multiple robots, enabling effective obstacle avoidance strategies. An improved action module is developed by using velocity and angular velocity increments and an action selector to refine the output. Furthermore, we introduce a multirobot reward module utilizing heuristic functions to guide the robots to quickly and efficiently identify feasible paths. We also propose a multirobot dynamic constraint reward function to optimize the multirobot trajectories. The MRDWA-MADDPG algorithm is validated through simulations and real-world experiments, demonstrating its effectiveness in diverse complex multirobot path planning scenarios. Our method outperforms conventional algorithms in terms of success rate, arrival time, and overall decision making in complex scenarios. Moreover, our method has a faster computation speed and a shorter training time, produces smoother trajectories, and is easier to deploy on real robots than other learning-based methods.
引用
收藏
页码:25420 / 25432
页数:13
相关论文
共 50 条
  • [41] Deep Reinforcement Learning With Multicritic TD3 for Decentralized Multirobot Path Planning
    Yin, Heqing
    Wang, Chang
    Yan, Chao
    Xiang, Xiaojia
    Cai, Boliang
    Wei, Changyun
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (04) : 1233 - 1247
  • [42] An Application of Self-Organizing Map for Multirobot Multigoal Path Planning with Minmax Objective
    Faigl, Jan
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2016, 2016
  • [43] Distributed Multirobot Exploration Based on Scene Partitioning and Frontier Selection
    Lopez-Perez, Jose J.
    Hernandez-Belmonte, Uriel H.
    Ramirez-Paredes, Juan-Pablo
    Contreras-Cruz, Marco A.
    Ayala-Ramirez, Victor
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2018, 2018
  • [44] Obstacle Avoidance in Distributed Optimal Coordination of Multirobot Systems: A Trajectory Planning and Tracking Strategy
    An, Liwei
    Yang, Guang-Hong
    Wasly, Saud
    IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2024, 11 (03): : 1335 - 1344
  • [45] Random-Finite-Set-Based Distributed Multirobot SLAM
    Gao, Lin
    Battistelli, Giorgio
    Chisci, Luigi
    IEEE TRANSACTIONS ON ROBOTICS, 2020, 36 (06) : 1758 - 1777
  • [46] Multirobot Task Planning Method Based on the Energy Penalty Strategy
    Liang, Lidong
    Zhu, Liangheng
    Jia, Wenyou
    Cheng, Xiaoliang
    APPLIED SCIENCES-BASEL, 2023, 13 (08):
  • [47] PARALLEL PATH PLANNING ON THE DISTRIBUTED ARRAY PROCESSOR
    SHU, C
    BUXTON, H
    PARALLEL COMPUTING, 1995, 21 (11) : 1749 - 1767
  • [48] Distributed Path Planning of Swarm Mobile Robots
    Lee, Ya-Ting
    Zeng, Song-Fung
    Chiu, Chian-Song
    2019 12TH ASIAN CONTROL CONFERENCE (ASCC), 2019, : 49 - 54
  • [49] Distributed neuro-evolutionary path planning
    Dozier, G
    Tunstel, E
    Homaifar, A
    PROCEEDINGS OF THE FIFTH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1 AND 2, 2000, : 586 - 589
  • [50] A type of biased consensus-based distributed neural network for path planning
    Yinyan Zhang
    Shuai Li
    Hongliang Guo
    Nonlinear Dynamics, 2017, 89 : 1803 - 1815