Multi-UAV Cooperative Pursuit of a Fast-Moving Target UAV Based on the GM-TD3 Algorithm

被引:0
|
作者
Zhang, Yaozhong [1 ]
Ding, Meiyan [1 ]
Yuan, Yao [1 ]
Zhang, Jiandong [1 ]
Yang, Qiming [1 ]
Shi, Guoqing [1 ]
Jiang, Frank [2 ]
Lu, Meiqu [3 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China
[2] Deakin Univ, Fac Sci Engn & Built Environm, Melbourne 3125, Australia
[3] Guangxi Minzu Univ, Sch Artificial Intelligence, Nanning 530006, Peoples R China
关键词
UAV pursuit game; TD3; genetic algorithm; maximum mean discrepancy; evolutionary reinforcement learning;
D O I
10.3390/drones8100557
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Recently, developing multi-UAVs to cooperatively pursue a fast-moving target has become a research hotspot in the current world. Although deep reinforcement learning (DRL) has made a lot of achievements in the UAV pursuit game, there are still some problems such as high-dimensional parameter space, the ease of falling into local optimization, the long training time, and the low task success rate. To solve the above-mentioned issues, we propose an improved twin delayed deep deterministic policy gradient algorithm combining the genetic algorithm and maximum mean discrepancy method (GM-TD3) for multi-UAV cooperative pursuit of high-speed targets. Firstly, this paper combines GA-based evolutionary strategies with TD3 to generate action networks. Then, in order to avoid local optimization in the algorithm training process, the maximum mean difference (MMD) method is used to increase the diversity of the policy population in the updating process of the population parameters. Finally, by setting the sensitivity weights of the genetic memory buffer of UAV individuals, the mutation operator is improved to enhance the stability of the algorithm. In addition, this paper designs a hybrid reward function to accelerate the convergence speed of training. Through simulation experiments, we have verified that the training efficiency of the improved algorithm has been greatly improved, which can achieve faster convergence; the successful rate of the task has reached 95%, and further validated UAVs can better cooperate to complete the pursuit game task.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] Multi-Conflict-Based Optimal Algorithm for Multi-UAV Cooperative Path Planning
    Liu, Xiaoxiong
    Su, Yuzhan
    Wu, Yan
    Guo, Yicong
    DRONES, 2023, 7 (03)
  • [22] Distributed Cooperative Control Algorithm for Multi-UAV Mission Rendezvous
    Liu Guoliang
    Xing Dongjing
    Hou Jianyong
    Jin Guting
    Zhen Ziyang
    Transactions of Nanjing University of Aeronautics and Astronautics, 2017, 34 (06) : 617 - 626
  • [23] Multi-UAV Cooperative Trajectory Planning Based on the Modified Cheetah Optimization Algorithm
    Fu, Yuwen
    Yang, Shuai
    Liu, Bo
    Xia, E.
    Huang, Duan
    ENTROPY, 2023, 25 (09)
  • [24] Track planning of multi-UAV cooperative reconnaissance based on improved genetic algorithm
    Li W.
    Hu Y.
    Pang Q.
    Li Y.
    Jia H.
    Hu, Yongjiang (huyongjiang_jxxy@163.com), 1600, Editorial Department of Journal of Chinese Inertial Technology (28): : 248 - 255
  • [25] Receding Horizon Multi-UAV Cooperative Tracking of Moving RF Source
    Koohifar, Farshad
    Kumbhar, Abhaykumar
    Guvenc, Ismail
    IEEE COMMUNICATIONS LETTERS, 2017, 21 (06) : 1433 - 1436
  • [26] Hierarchical probabilistic graphical models for multi-UAV cooperative pursuit in dynamic environments
    Huang, Yixin
    Xiang, Xiaojia
    Yan, Chao
    Zhou, Han
    Tang, Dengqing
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2025, 185
  • [27] Multi-UAV collaborative system with a feature fast matching algorithm
    Wang, Tian-miao
    Zhang, Yi-cheng
    Liang, Jian-hong
    Chen, Yang
    Wang, Chao-lei
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2020, 21 (12) : 1695 - 1712
  • [28] Multi-UAV collaborative system with a feature fast matching algorithm
    Tian-miao Wang
    Yi-cheng Zhang
    Jian-hong Liang
    Yang Chen
    Chao-lei Wang
    Frontiers of Information Technology & Electronic Engineering, 2020, 21 : 1695 - 1712
  • [29] Research on Distributed Control-Based Multi-UAV Cooperative Target Coverage Method
    Tao, Zhonglaing
    Zhou, Yihui
    2024 8TH INTERNATIONAL CONFERENCE ON ROBOTICS, CONTROL AND AUTOMATION, ICRCA 2024, 2024, : 485 - 489
  • [30] A multi-UAV fast search path planning algorithm research
    Yu, Xiang
    Wang, Binbin
    Wang, Ziyi
    Deng, Fuigui
    2023 IEEE 97TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-SPRING, 2023,