Distributed Multirobot Path Planning Based on MRDWA-MADDPG

被引：8

作者：

Wu, Qichao ^{[1
]}

Lin, Rui ^{[1
]}

Ren, Ziwu ^{[1
]}

机构：

[1] Soochow Univ, Sch Mech & Elect Engn, Suzhou 215000, Peoples R China

来源：

IEEE SENSORS JOURNAL | 2023年 / 23卷 / 20期

基金：

中国国家自然科学基金;

关键词：

Robots; Multi-robot systems; Robot kinematics; Collision avoidance; Robot sensing systems; Training; Heuristic algorithms; Deep reinforcement learning; multirobot dynamic window approach (MRDWA); multirobot path planning; sensor application; trajectory optimization; UNKNOWN ENVIRONMENTS; REINFORCEMENT;

D O I：

10.1109/JSEN.2023.3310519

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Multirobot path planning in complex environments is a challenging research area. This article proposes a path planning method for multirobot systems based on distributed multiagent deep reinforcement learning. We propose a multirobot dynamic window approach (MRDWA) in which a central controller facilitates sensor information sharing among robots, enabling locally optimal path planning considering the behavior of other robots. We incorporate the output velocity information into the observation function to form an efficient and low-dimensional state representation. Additionally, we employ the multiagent deep deterministic policy gradient (MADDPG) reinforcement learning algorithm to directly map part of the observation information to motion commands for multiple robots, enabling effective obstacle avoidance strategies. An improved action module is developed by using velocity and angular velocity increments and an action selector to refine the output. Furthermore, we introduce a multirobot reward module utilizing heuristic functions to guide the robots to quickly and efficiently identify feasible paths. We also propose a multirobot dynamic constraint reward function to optimize the multirobot trajectories. The MRDWA-MADDPG algorithm is validated through simulations and real-world experiments, demonstrating its effectiveness in diverse complex multirobot path planning scenarios. Our method outperforms conventional algorithms in terms of success rate, arrival time, and overall decision making in complex scenarios. Moreover, our method has a faster computation speed and a shorter training time, produces smoother trajectories, and is easier to deploy on real robots than other learning-based methods.

引用

页码：25420 / 25432

页数：13

共 50 条

[21] Hierarchical Area-Based and Path-Based Heuristic Approaches for Multirobot Coverage Path Planning with Performance Analysis in Surveillance Systems
Gong, Junghwan
Lee, Seunghwan
SENSORS, 2023, 23 (20)
[22] Optimal Multirobot Path Planning on Graphs: Complete Algorithms and Effective Heuristics
Yu, Jingjin
LaValle, Steven M.
IEEE TRANSACTIONS ON ROBOTICS, 2016, 32 (05) : 1163 - 1177
[23] A Novel Cooperative Path Planning for Multirobot Persistent Coverage in Complex Environments
Tang, Yuan
Zhou, Rui
Sun, Guibin
Di, Bin
Xiong, Rongling
IEEE SENSORS JOURNAL, 2020, 20 (08) : 4485 - 4495
[24] Multirobot conflict coordination and dynamic obstacle avoidance cooperative path planning
Liu, Yuting
Ding, Qiangqiang
Zou, Yunhe
Guo, Shijie
Tang, Shufeng
INTELLIGENT SERVICE ROBOTICS, 2025,
[25] A Multirobot Path-Planning Strategy for Autonomous Wilderness Search and Rescue
Macwan, Ashish
Vilela, Julio
Nejat, Goldie
Benhabib, Beno
IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (09) : 1784 - 1797
[26] Multirobot Simultaneous Path Planning and Task Assignment on Graphs with Stochastic Costs
Yang, Fan
Chakraborty, Nilanjan
2019 INTERNATIONAL SYMPOSIUM ON MULTI-ROBOT AND MULTI-AGENT SYSTEMS (MRS 2019), 2019, : 89 - 91
[27] ODrM* Optimal Multirobot Path Planning in Low Dimensional Search Spaces
Ferner, Cornelia
Wagner, Glenn
Choset, Howie
2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2013, : 3854 - 3859
[28] Fire Evacuation Path Planning Based on Improved MADDPG (Multi-Agent Deep Deterministic Policy Gradient) Algorithm
Huang, Qiong
Si, Ying
Wang, Haoyu
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (07) : 387 - 395
[29] A distributed multirobot system based on edutainment robots
Fernandez, J
Lopez, P
Oliva, J
2005 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-4, 2005, : 4260 - 4265
[30] Genetic algorithms based multirobot trajectory planning
Cruz-Martín, A
Muñoz, VF
García-Cerezo, A
Robotics: Trends, Principles and Applications, Vol 15, 2004, 15 : 155 - 160

← 1 2 3 4 5 →