Distributed Multirobot Path Planning Based on MRDWA-MADDPG

被引:8
|
作者
Wu, Qichao [1 ]
Lin, Rui [1 ]
Ren, Ziwu [1 ]
机构
[1] Soochow Univ, Sch Mech & Elect Engn, Suzhou 215000, Peoples R China
基金
中国国家自然科学基金;
关键词
Robots; Multi-robot systems; Robot kinematics; Collision avoidance; Robot sensing systems; Training; Heuristic algorithms; Deep reinforcement learning; multirobot dynamic window approach (MRDWA); multirobot path planning; sensor application; trajectory optimization; UNKNOWN ENVIRONMENTS; REINFORCEMENT;
D O I
10.1109/JSEN.2023.3310519
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Multirobot path planning in complex environments is a challenging research area. This article proposes a path planning method for multirobot systems based on distributed multiagent deep reinforcement learning. We propose a multirobot dynamic window approach (MRDWA) in which a central controller facilitates sensor information sharing among robots, enabling locally optimal path planning considering the behavior of other robots. We incorporate the output velocity information into the observation function to form an efficient and low-dimensional state representation. Additionally, we employ the multiagent deep deterministic policy gradient (MADDPG) reinforcement learning algorithm to directly map part of the observation information to motion commands for multiple robots, enabling effective obstacle avoidance strategies. An improved action module is developed by using velocity and angular velocity increments and an action selector to refine the output. Furthermore, we introduce a multirobot reward module utilizing heuristic functions to guide the robots to quickly and efficiently identify feasible paths. We also propose a multirobot dynamic constraint reward function to optimize the multirobot trajectories. The MRDWA-MADDPG algorithm is validated through simulations and real-world experiments, demonstrating its effectiveness in diverse complex multirobot path planning scenarios. Our method outperforms conventional algorithms in terms of success rate, arrival time, and overall decision making in complex scenarios. Moreover, our method has a faster computation speed and a shorter training time, produces smoother trajectories, and is easier to deploy on real robots than other learning-based methods.
引用
收藏
页码:25420 / 25432
页数:13
相关论文
共 50 条
  • [21] Hierarchical Area-Based and Path-Based Heuristic Approaches for Multirobot Coverage Path Planning with Performance Analysis in Surveillance Systems
    Gong, Junghwan
    Lee, Seunghwan
    SENSORS, 2023, 23 (20)
  • [22] Optimal Multirobot Path Planning on Graphs: Complete Algorithms and Effective Heuristics
    Yu, Jingjin
    LaValle, Steven M.
    IEEE TRANSACTIONS ON ROBOTICS, 2016, 32 (05) : 1163 - 1177
  • [23] A Novel Cooperative Path Planning for Multirobot Persistent Coverage in Complex Environments
    Tang, Yuan
    Zhou, Rui
    Sun, Guibin
    Di, Bin
    Xiong, Rongling
    IEEE SENSORS JOURNAL, 2020, 20 (08) : 4485 - 4495
  • [24] Multirobot conflict coordination and dynamic obstacle avoidance cooperative path planning
    Liu, Yuting
    Ding, Qiangqiang
    Zou, Yunhe
    Guo, Shijie
    Tang, Shufeng
    INTELLIGENT SERVICE ROBOTICS, 2025,
  • [25] A Multirobot Path-Planning Strategy for Autonomous Wilderness Search and Rescue
    Macwan, Ashish
    Vilela, Julio
    Nejat, Goldie
    Benhabib, Beno
    IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (09) : 1784 - 1797
  • [26] Multirobot Simultaneous Path Planning and Task Assignment on Graphs with Stochastic Costs
    Yang, Fan
    Chakraborty, Nilanjan
    2019 INTERNATIONAL SYMPOSIUM ON MULTI-ROBOT AND MULTI-AGENT SYSTEMS (MRS 2019), 2019, : 89 - 91
  • [27] ODrM* Optimal Multirobot Path Planning in Low Dimensional Search Spaces
    Ferner, Cornelia
    Wagner, Glenn
    Choset, Howie
    2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2013, : 3854 - 3859
  • [28] Fire Evacuation Path Planning Based on Improved MADDPG (Multi-Agent Deep Deterministic Policy Gradient) Algorithm
    Huang, Qiong
    Si, Ying
    Wang, Haoyu
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (07) : 387 - 395
  • [29] A distributed multirobot system based on edutainment robots
    Fernandez, J
    Lopez, P
    Oliva, J
    2005 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-4, 2005, : 4260 - 4265
  • [30] Genetic algorithms based multirobot trajectory planning
    Cruz-Martín, A
    Muñoz, VF
    García-Cerezo, A
    Robotics: Trends, Principles and Applications, Vol 15, 2004, 15 : 155 - 160