Multi-UAV Cooperative Pursuit of a Fast-Moving Target UAV Based on the GM-TD3 Algorithm

被引:0
|
作者
Zhang, Yaozhong [1 ]
Ding, Meiyan [1 ]
Yuan, Yao [1 ]
Zhang, Jiandong [1 ]
Yang, Qiming [1 ]
Shi, Guoqing [1 ]
Jiang, Frank [2 ]
Lu, Meiqu [3 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China
[2] Deakin Univ, Fac Sci Engn & Built Environm, Melbourne 3125, Australia
[3] Guangxi Minzu Univ, Sch Artificial Intelligence, Nanning 530006, Peoples R China
关键词
UAV pursuit game; TD3; genetic algorithm; maximum mean discrepancy; evolutionary reinforcement learning;
D O I
10.3390/drones8100557
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Recently, developing multi-UAVs to cooperatively pursue a fast-moving target has become a research hotspot in the current world. Although deep reinforcement learning (DRL) has made a lot of achievements in the UAV pursuit game, there are still some problems such as high-dimensional parameter space, the ease of falling into local optimization, the long training time, and the low task success rate. To solve the above-mentioned issues, we propose an improved twin delayed deep deterministic policy gradient algorithm combining the genetic algorithm and maximum mean discrepancy method (GM-TD3) for multi-UAV cooperative pursuit of high-speed targets. Firstly, this paper combines GA-based evolutionary strategies with TD3 to generate action networks. Then, in order to avoid local optimization in the algorithm training process, the maximum mean difference (MMD) method is used to increase the diversity of the policy population in the updating process of the population parameters. Finally, by setting the sensitivity weights of the genetic memory buffer of UAV individuals, the mutation operator is improved to enhance the stability of the algorithm. In addition, this paper designs a hybrid reward function to accelerate the convergence speed of training. Through simulation experiments, we have verified that the training efficiency of the improved algorithm has been greatly improved, which can achieve faster convergence; the successful rate of the task has reached 95%, and further validated UAVs can better cooperate to complete the pursuit game task.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Multi-UAV Cooperative Search method for a Moving Target on the Ground or Sea
    Qu, Yaohong
    Sun, Ying
    Wang, Kai
    Zhang, Feng
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 4049 - 4054
  • [2] Design of control method for multi-UAV cooperative detection of fast target
    Fu X.
    Chen Z.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2021, 43 (11): : 3295 - 3304
  • [3] Multi-UAV Cooperative Target Tracking Based on Swarm Intelligence
    Xia, Zhaoyue
    Du, Jun
    Jiang, Chunxiao
    Wang, Jingjing
    Ren, Yong
    Li, Gang
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [4] Multi-UAV Cooperative Multi-Target Allocation Method based on Differential Evolutionary Algorithm
    Song, Yuanjie
    Xi, Qingbiao
    Xing, Xiaojun
    Yang, Bing
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 1655 - 1660
  • [5] Research on Multi-uav Cooperative Tracking Target Based on PSO Predictive Control Algorithm
    Wang Xudong
    Zhou Wei
    Wang Xiaowei
    2019 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE APPLICATIONS AND TECHNOLOGIES (AIAAT 2019), 2019, 646
  • [6] Multi-UAV Cooperative Search Planning Algorithm Based on Dynamic Target Probability Model
    Ao, Zihang
    Zhang, Yulong
    Huang, Jing
    Lin, Yichen
    Zhou, Xiaoden
    Zhang, Youmin
    2023 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS, ICUAS, 2023, : 543 - 548
  • [7] Multi-UAV Cooperative Target Allocation Based on AC-DSDE Evolutionary Algorithm
    Huang G.
    Li J.-H.
    Zidonghua Xuebao/Acta Automatica Sinica, 2021, 47 (01): : 173 - 184
  • [8] Multiple Moving Targets Surveillance Based on a Cooperative Network for Multi-UAV
    Gu, Jingjing
    Su, Tao
    Wang, Qiuhong
    Du, Xiaojiang
    Guizani, Mohsen
    IEEE COMMUNICATIONS MAGAZINE, 2018, 56 (04) : 82 - 89
  • [9] The Multi-UAV cooperative target tracking simulation system
    Wei, Zhou
    Pei, Wang
    Chao, Wu
    GLOBAL INTELLIGENCE INDUSTRY CONFERENCE (GIIC 2018), 2018, 10835
  • [10] Multi-UAV Cooperative Target Tracking Strategy Based on Formation Control
    Wang, Duo
    Peng, Zhihong
    Ju, Xiaojie
    Yu, Tao
    Wang, Xue
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 6224 - 6229