Distributed Multirobot Path Planning Based on MRDWA-MADDPG

被引：8

作者：

Wu, Qichao ^{[1
]}

Lin, Rui ^{[1
]}

Ren, Ziwu ^{[1
]}

机构：

[1] Soochow Univ, Sch Mech & Elect Engn, Suzhou 215000, Peoples R China

来源：

IEEE SENSORS JOURNAL | 2023年 / 23卷 / 20期

基金：

中国国家自然科学基金;

关键词：

Robots; Multi-robot systems; Robot kinematics; Collision avoidance; Robot sensing systems; Training; Heuristic algorithms; Deep reinforcement learning; multirobot dynamic window approach (MRDWA); multirobot path planning; sensor application; trajectory optimization; UNKNOWN ENVIRONMENTS; REINFORCEMENT;

D O I：

10.1109/JSEN.2023.3310519

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Multirobot path planning in complex environments is a challenging research area. This article proposes a path planning method for multirobot systems based on distributed multiagent deep reinforcement learning. We propose a multirobot dynamic window approach (MRDWA) in which a central controller facilitates sensor information sharing among robots, enabling locally optimal path planning considering the behavior of other robots. We incorporate the output velocity information into the observation function to form an efficient and low-dimensional state representation. Additionally, we employ the multiagent deep deterministic policy gradient (MADDPG) reinforcement learning algorithm to directly map part of the observation information to motion commands for multiple robots, enabling effective obstacle avoidance strategies. An improved action module is developed by using velocity and angular velocity increments and an action selector to refine the output. Furthermore, we introduce a multirobot reward module utilizing heuristic functions to guide the robots to quickly and efficiently identify feasible paths. We also propose a multirobot dynamic constraint reward function to optimize the multirobot trajectories. The MRDWA-MADDPG algorithm is validated through simulations and real-world experiments, demonstrating its effectiveness in diverse complex multirobot path planning scenarios. Our method outperforms conventional algorithms in terms of success rate, arrival time, and overall decision making in complex scenarios. Moreover, our method has a faster computation speed and a shorter training time, produces smoother trajectories, and is easier to deploy on real robots than other learning-based methods.

引用

页码：25420 / 25432

页数：13

共 50 条

[41] Deep Reinforcement Learning With Multicritic TD3 for Decentralized Multirobot Path Planning
Yin, Heqing
Wang, Chang
Yan, Chao
Xiang, Xiaojia
Cai, Boliang
Wei, Changyun
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (04) : 1233 - 1247
[42] An Application of Self-Organizing Map for Multirobot Multigoal Path Planning with Minmax Objective
Faigl, Jan
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2016, 2016
[43] Distributed Multirobot Exploration Based on Scene Partitioning and Frontier Selection
Lopez-Perez, Jose J.
Hernandez-Belmonte, Uriel H.
Ramirez-Paredes, Juan-Pablo
Contreras-Cruz, Marco A.
Ayala-Ramirez, Victor
MATHEMATICAL PROBLEMS IN ENGINEERING, 2018, 2018
[44] Obstacle Avoidance in Distributed Optimal Coordination of Multirobot Systems: A Trajectory Planning and Tracking Strategy
An, Liwei
Yang, Guang-Hong
Wasly, Saud
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2024, 11 (03): : 1335 - 1344
[45] Random-Finite-Set-Based Distributed Multirobot SLAM
Gao, Lin
Battistelli, Giorgio
Chisci, Luigi
IEEE TRANSACTIONS ON ROBOTICS, 2020, 36 (06) : 1758 - 1777
[46] Multirobot Task Planning Method Based on the Energy Penalty Strategy
Liang, Lidong
Zhu, Liangheng
Jia, Wenyou
Cheng, Xiaoliang
APPLIED SCIENCES-BASEL, 2023, 13 (08):
[47] PARALLEL PATH PLANNING ON THE DISTRIBUTED ARRAY PROCESSOR
SHU, C
BUXTON, H
PARALLEL COMPUTING, 1995, 21 (11) : 1749 - 1767
[48] Distributed Path Planning of Swarm Mobile Robots
Lee, Ya-Ting
Zeng, Song-Fung
Chiu, Chian-Song
2019 12TH ASIAN CONTROL CONFERENCE (ASCC), 2019, : 49 - 54
[49] Distributed neuro-evolutionary path planning
Dozier, G
Tunstel, E
Homaifar, A
PROCEEDINGS OF THE FIFTH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1 AND 2, 2000, : 586 - 589
[50] A type of biased consensus-based distributed neural network for path planning
Yinyan Zhang
Shuai Li
Hongliang Guo
Nonlinear Dynamics, 2017, 89 : 1803 - 1815

← 1 2 3 4 5 →