The Optimal Strategies of Maneuver Decision in Air Combat of UCAV Based on the Improved TD3 Algorithm

被引:2
|
作者
Gao, Xianzhong [1 ]
Zhang, Yue [2 ]
Wang, Baolai [3 ]
Leng, Zhihui [4 ]
Hou, Zhongxi [1 ,2 ]
机构
[1] Natl Univ Def Technol, Test Ctr, Xian 710106, Peoples R China
[2] Natl Univ Def Technol, Coll Aerosp Sci & Engn, Changsha 410073, Peoples R China
[3] Natl Univ Def Technol, Coll Comp, Changsha 410073, Peoples R China
[4] Jiangxi Hongdu Aviat Ind Grp Co Ltd, Nanchang 330096, Peoples R China
关键词
unmanned combat aerial vehicles (UCAVs); maneuver decision-making; autonomous air combat; deep reinforcement learning; scenario-transfer training;
D O I
10.3390/drones8090501
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Nowadays, unmanned aerial vehicles (UAVs) pose a significant challenge to air defense systems. Unmanned combat aerial vehicles (UCAVs) have been proven to be an effective method to counter the threat of UAVs in application. Therefore, maneuver decision-making has become the crucial technology to achieve autonomous air combat for UCAVs. In order to solve the problem of maneuver decision-making, an autonomous model of UCAVs based on the deep reinforcement learning method was proposed in this paper. Firstly, the six-degree-of-freedom (DoF) dynamic model was built in three-dimensional space, and the continuous actions of tangential overload, normal overload, and roll angle were selected as the maneuver inputs. Secondly, to improve the convergence speed for the deep reinforcement learning method, the idea of "scenario-transfer training" was introduced into the twin delayed deep deterministic (TD3) policy gradient algorithm, the results showing that the improved algorithm could cut off about 60% of the training time. Thirdly, for the "nose-to-nose turns", which is one of the classical maneuvers for experienced pilots, the optimal maneuver generated by the proposed method was analyzed. The results showed that the maneuver strategy obtained by the proposed method was highly consistent with that made by experienced fighter pilots. This is also the first time in a public article that compared the maneuver decisions made by the deep reinforcement learning method with experienced fighter pilots. This research can provide some meaningful references to generate autonomous decision-making strategies for UCAVs.
引用
收藏
页数:26
相关论文
共 50 条
  • [21] Multi-UAVs Air Combat Maneuver Decision based on MADDPG Algorithm with Introduced Update Switcher
    Wang, Kum
    Cai, Meng
    Li, Jianxun
    2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 88 - 93
  • [22] Close air combat maneuver decision based on deep stochastic game
    Ma W.
    Li H.
    Wang Z.
    Huang Z.
    Wu Z.
    Chen X.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2021, 43 (02): : 443 - 451
  • [23] Maneuver decision of UAV in air combat based on deterministic policy gradient
    Guo, Junxiao
    Wang, Zihan
    Lan, Jun
    Dong, Bingchen
    Li, Ran
    Yang, Qiming
    Zhang, Jiandong
    2022 IEEE 17TH INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION, ICCA, 2022, : 243 - 248
  • [24] Air Combat Maneuver Decision Method Based on A3C Deep Reinforcement Learning
    Fan, Zihao
    Xu, Yang
    Kang, Yuhang
    Luo, Delin
    MACHINES, 2022, 10 (11)
  • [25] Moving Time UCAV Maneuver Decision Based on the Dynamic Relational Weight Algorithm and Trajectory Prediction
    Xie Lei
    Ding Dali
    Wei Zhenglei
    Xi Zhifei
    Tang Andi
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [26] Moving Time UCAV Maneuver Decision Based on the Dynamic Relational Weight Algorithm and Trajectory Prediction
    Lei, Xie
    Dali, Ding
    Zhenglei, Wei
    Zhifei, Xi
    Andi, Tang
    Mathematical Problems in Engineering, 2021, 2021
  • [27] Multi-objective solution of optimal power flow based on TD3 deep reinforcement learning algorithm
    Sun, Bowei
    Song, Minggang
    Li, Ang
    Zou, Nan
    Pan, Pengfei
    Lu, Xi
    Yang, Qun
    Zhang, Hengrui
    Kong, Xiangyu
    SUSTAINABLE ENERGY GRIDS & NETWORKS, 2023, 34
  • [28] Air Combat Maneuver Decision Based on Deep Reinforcement Learning and Game Theory
    Yin, Shuhui
    Kang, Yu
    Zhao, Yunbo
    Xue, Jian
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6939 - 6943
  • [29] Air combat maneuver decision based on deep reinforcement learning with auxiliary reward
    Zhang T.
    Wang Y.
    Sun M.
    Chen Z.
    Neural Computing and Applications, 2024, 36 (21) : 13341 - 13356
  • [30] Design of Intelligent Controller for Aero-engine Based on TD3 Algorithm
    Zhu, Jianming
    Tang, Wei
    Dong, Jianhua
    INFORMATION TECHNOLOGY AND CONTROL, 2023, 52 (04): : 1010 - 1024