Multi-UAV Cooperative Pursuit of a Fast-Moving Target UAV Based on the GM-TD3 Algorithm

被引:0
|
作者
Zhang, Yaozhong [1 ]
Ding, Meiyan [1 ]
Yuan, Yao [1 ]
Zhang, Jiandong [1 ]
Yang, Qiming [1 ]
Shi, Guoqing [1 ]
Jiang, Frank [2 ]
Lu, Meiqu [3 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China
[2] Deakin Univ, Fac Sci Engn & Built Environm, Melbourne 3125, Australia
[3] Guangxi Minzu Univ, Sch Artificial Intelligence, Nanning 530006, Peoples R China
关键词
UAV pursuit game; TD3; genetic algorithm; maximum mean discrepancy; evolutionary reinforcement learning;
D O I
10.3390/drones8100557
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Recently, developing multi-UAVs to cooperatively pursue a fast-moving target has become a research hotspot in the current world. Although deep reinforcement learning (DRL) has made a lot of achievements in the UAV pursuit game, there are still some problems such as high-dimensional parameter space, the ease of falling into local optimization, the long training time, and the low task success rate. To solve the above-mentioned issues, we propose an improved twin delayed deep deterministic policy gradient algorithm combining the genetic algorithm and maximum mean discrepancy method (GM-TD3) for multi-UAV cooperative pursuit of high-speed targets. Firstly, this paper combines GA-based evolutionary strategies with TD3 to generate action networks. Then, in order to avoid local optimization in the algorithm training process, the maximum mean difference (MMD) method is used to increase the diversity of the policy population in the updating process of the population parameters. Finally, by setting the sensitivity weights of the genetic memory buffer of UAV individuals, the mutation operator is improved to enhance the stability of the algorithm. In addition, this paper designs a hybrid reward function to accelerate the convergence speed of training. Through simulation experiments, we have verified that the training efficiency of the improved algorithm has been greatly improved, which can achieve faster convergence; the successful rate of the task has reached 95%, and further validated UAVs can better cooperate to complete the pursuit game task.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] Bird Flocking Inspired Methods for Multi-UAV Cooperative Target Search
    Shen, Yankai
    Wei, Chen
    Sun, Yongbin
    Duan, Haibin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (02) : 702 - 706
  • [32] Multi-UAV cooperative target tracking with bounded noise for connectivity preservation
    Zhou, Rui
    Feng, Yu
    Di, Bin
    Zhao, Jiang
    Hu, Yan
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2020, 21 (10) : 1494 - 1503
  • [33] Multi-UAV cooperative target tracking with bounded noise for connectivity preservation
    Rui Zhou
    Yu Feng
    Bin Di
    Jiang Zhao
    Yan Hu
    Frontiers of Information Technology & Electronic Engineering, 2020, 21 : 1494 - 1503
  • [34] A New Fast Consensus Algorithm Applied in Rendezvous of Multi-UAV
    Xu Wei
    Duan Fengyang
    Zhang Qingjie
    Zhu Bing
    Sun Hongchang
    2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2015, : 55 - 60
  • [35] Game theory based multi-UAV cooperative searching model and fast solution approach
    Du, Ji-Yong
    Zhang, Feng-Ming
    Mao, Hong-Bao
    Liu, Hua-Wei
    Yang, Ji
    Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2013, 47 (04): : 667 - 673
  • [36] Multi-UAV cooperative target encirclement within an annular virtual tube
    Gao, Yan
    Bai, Chenggang
    Zhang, Lei
    Quan, Quan
    AEROSPACE SCIENCE AND TECHNOLOGY, 2022, 128
  • [37] Multi-UAV Information Fusion and Cooperative Trajectory Optimization in Target Search
    Yao, Peng
    Wei, Xin
    IEEE SYSTEMS JOURNAL, 2022, 16 (03): : 4325 - 4333
  • [38] A Fast Multi-UAV Cooperative Reconnaissance Method Exploiting Payload Diversity
    Ma, Yinghong
    Li, Xunan
    Jiao, Yi
    Guo, Lin
    Ren, Suping
    Zhang, Qi
    2022 IEEE 96TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-FALL), 2022,
  • [39] Reinforcement-Learning-Based Multi-UAV Cooperative Search for Moving Targets in 3D Scenarios
    Liu, Yifei
    Li, Xiaoshuai
    Wang, Jian
    Wei, Feiyu
    Yang, Junan
    DRONES, 2024, 8 (08)
  • [40] A multi-objective evolutionary algorithm for multi-UAV cooperative reconnaissance problem
    Tian, Jing
    Shen, Lincheng
    NEURAL INFORMATION PROCESSING, PT 3, PROCEEDINGS, 2006, 4234 : 900 - 909