Deep reinforcement learning and ant colony optimization supporting multi-UGV path planning and task assignment in 3D environments

被引：0

作者：

Jin, Binghui ^{[1
]}

Sun, Yang ^{[1
]}

Wu, Wenjun ^{[1
]}

Gao, Qiang ^{[1
]}

Si, Pengbo ^{[1
]}

机构：

[1] Beijing Univ Sci & Technol, Sch Informat Engn, Beijing 100124, Peoples R China

来源：

IET INTELLIGENT TRANSPORT SYSTEMS | 2024年 / 18卷 / 09期

关键词：

ant colony optimization; deep reinforcement learning; multiple unmanned ground vehicles; path planning; task assignment; ALGORITHM; FIELD;

D O I：

10.1049/itr2.12535

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

With the development of artificial intelligence, the application of unmanned ground vehicles (UGV) in outdoor hazardous scenarios has received more attention. However, the terrains in these environments are often complex and undulating, which also pose higher challenges to the multi-UGV path planning and task assignment (MUPPTA) optimization. To efficiently improve the multi-UGV collaboration in 3D environments, a MUPPTA method is proposed based on double deep Q learning network (DDQN) and ant colony optimization (ACO) to jointly optimize the path planning and task assignment decisions of multiple UGVs. The authors first comprehensively consider the characteristics of the 3D environments, and model the MUPPTA problem as a combinatorial optimization problem. To tackle it, the original problem is decomposed into the multi-UGV path planning sub-problem and task assignment sub-problem, and solve them separately. First, the path planning sub-problem in the 3D environments is transformed into a Markov decision process (MDP) model, and a multi-UGV path planning algorithm based on DDQN (MUPP-DDQN) is proposed to obtain the optimal paths and actual path costs between tasks through extensive offline learning and training. Based on this, a multi-UGV task assignment algorithm is further proposed based on ACO (MUTA-ACO) to solve the task assignment sub-problem and achieve the optimal task assignment solution. Simulation results show that the proposed method is more cost-effective and time-saving compared to other comparison algorithms. This paper focus on the multi-UGV path planning and task assignment (MUPPTA) problem in 3D environments, and propose a multi-UGV path planning and task assignment method based on double DQN and ACO. Specifically, the algorithm takes the complex terrain and actual cost in 3D environments into consideration, and an optimization mechanism for multi-UGV path planning and task assignment is established to guide the multi-UGV coordination and reduce the system costs. image

引用

页码：1652 / 1664

页数：13

共 50 条

[1] A Comparative Study of Task Assignment and Path Planning Methods for Multi-UGV Missions
Thunberg, Johan
Anisi, David A.
Ogren, Petter
OPTIMIZATION AND COOPERATIVE CONTROL STRATEGIES, 2009, 381 : 167 - +
[2] Energy-efficient green ant colony optimization for path planning in dynamic 3D environments
Sangeetha, V.
Krishankumar, R.
Ravichandran, K. S.
Kar, Samarjit
SOFT COMPUTING, 2021, 25 (06) : 4749 - 4769
[3] Improved Ant Colony Optimization for Ground Robot 3D Path Planning
Wang, Lanfei
Kan, Jiangming
Guo, Jun
Wang, Chao
2018 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY (CYBERC 2018), 2018, : 106 - +
[4] 3D Path Planning of AUV Based on Improved Ant Colony Optimization
Zhang Guanglei
Jia Heming
2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 5017 - 5022
[5] 3D Path Planning for the Ground Robot with Improved Ant Colony Optimization
Wang, Lanfei
Kan, Jiangming
Guo, Jun
Wang, Chao
SENSORS, 2019, 19 (04)
[6] Using Ant Colony Optimization and Cuckoo Search in AUV 3D Path Planning
Luan, X. L.
Gong, F. X.
Wei, Z. Q.
Yin, B.
Sun, Y. T.
PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND INFORMATION TECHNOLOGY (SEIT2015), 2016, : 208 - 212
[7] Multi-UAV Cooperative 3D Coverage Path Planning Based on Asynchronous Ant Colony Optimization
Li, Hui
Chen, Yang
Chen, Zhihuan
Wu, Huaiyu
2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 4255 - 4260
[8] Research on Path Planning in 3D Complex Environments Based on Improved Ant Colony Algorithm
Zhou, Hang
Jiang, Ziqi
Xue, Yuting
Li, Weicong
Cai, Fanger
Li, Yunchen
SYMMETRY-BASEL, 2022, 14 (09):
[9] UAV Path Planning and Obstacle Avoidance Based on Reinforcement Learning in 3D Environments
Tu, Guan-Ting
Juang, Jih-Gau
ACTUATORS, 2023, 12 (02)
[10] Task Scheduling and Resource Allocation Based on Ant-Colony Optimization and Deep Reinforcement Learning
Rugwiro, Ulysse
Gu, Chunhua
Ding, Weichao
JOURNAL OF INTERNET TECHNOLOGY, 2019, 20 (05): : 1463 - 1475

← 1 2 3 4 5 →