A Deep Reinforcement Learning-Based Intelligent Maneuvering Strategy for the High-Speed UAV Pursuit-Evasion Game

被引:1
|
作者
Yan, Tian [1 ,2 ,3 ]
Liu, Can [1 ]
Gao, Mengjing [1 ]
Jiang, Zijian [1 ]
Li, Tong [1 ]
机构
[1] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ, Natl Key Lab Unmanned Aerial Vehicle Technol, Xian 710072, Peoples R China
[3] Northwestern Polytech Univ, Integrated Res & Dev Platform Unmanned Aerial Vehi, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
pursuit-evasion game; line-of-sight angle rate; high-speed UAV; deep reinforcement learning; PROPORTIONAL NAVIGATION;
D O I
10.3390/drones8070309
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Given the rapid advancements in kinetic pursuit technology, this paper introduces an innovative maneuvering strategy, denoted as LSRC-TD3, which integrates line-of-sight (LOS) angle rate correction with deep reinforcement learning (DRL) for high-speed unmanned aerial vehicle (UAV) pursuit-evasion (PE) game scenarios, with the aim of effectively evading high-speed and high-dynamic pursuers. In the challenging situations of the game, where both speed and maximum available overload are at a disadvantage, the playing field of UAVs is severely compressed, and the difficulty of evasion is significantly increased, placing higher demands on the strategy and timing of maneuvering to change orbit. While considering evasion, trajectory constraint, and energy consumption, we formulated the reward function by combining "terminal" and "process" rewards, as well as "strong" and "weak" incentive guidance to reduce pre-exploration difficulty and accelerate convergence of the game network. Additionally, this paper presents a correction factor for LOS angle rate into the double-delay deterministic gradient strategy (TD3), thereby enhancing the sensitivity of high-speed UAVs to changes in LOS rate, as well as the accuracy of evasion timing, which improves the effectiveness and adaptive capability of the intelligent maneuvering strategy. The Monte Carlo simulation results demonstrate that the proposed method achieves a high level of evasion performance-integrating energy optimization with the requisite miss distance for high-speed UAVs-and accomplishes efficient evasion under highly challenging PE game scenarios.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] A Pursuit-Evasion Algorithm Based on Hierarchical Reinforcement Learning
    Liu, Jie
    Liu, Shuhua
    Wu, Hongyan
    Zhang, Yu
    2009 INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION, VOL II, 2009, : 482 - 486
  • [22] A Pursuit-Evasion Algorithm Based on Hierarchical Reinforcement Learning
    Liu, Shuhua
    Liu, Jie
    Cheng, Yu
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2010, 13 (03): : 635 - 645
  • [23] Game of Drones: UAV Pursuit-Evasion Game With Type-2 Fuzzy Logic Controllers Tuned by Reinforcement Learning
    Camci, Efe
    Kayacan, Erdal
    2016 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2016, : 618 - 625
  • [24] An open loop Stackelberg solution to optimal strategy for UAV pursuit-evasion game
    Zhang, Yiqun
    Zhang, Pengfei
    Wang, Xiaodong
    Song, Feng
    Li, Chaoyong
    Hao, Junhong
    AEROSPACE SCIENCE AND TECHNOLOGY, 2022, 129
  • [25] Escape Strategy Based on Apollonius Circles in the Pursuit-Evasion Game
    Huang, Yuting
    Luo, Yifan
    Nie, Yuhan
    Hou, Tianle
    Fu, Xiaowei
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 2143 - 2153
  • [26] Near-optimal interception strategy for orbital pursuit-evasion using deep reinforcement learning
    Zhang, Jingrui
    Zhang, Kunpeng
    Zhang, Yao
    Shi, Heng
    Tang, Liang
    Li, Mou
    ACTA ASTRONAUTICA, 2022, 198 : 9 - 25
  • [27] Terminal-guidance Based Reinforcement-learning for Orbital Pursuit-evasion Game of the Spacecraft
    Geng Y.-Z.
    Yuan L.
    Huang H.
    Tang L.
    Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (05): : 974 - 984
  • [28] Integral reinforcement learning based dynamic stackelberg pursuit-evasion game for unmanned surface vehicles
    Hu, Xiaoxiang
    Liu, Shuaizheng
    Xu, Jingwen
    Xiao, Bing
    Guo, Chenguang
    ALEXANDRIA ENGINEERING JOURNAL, 2024, 108 : 428 - 435
  • [29] Optimization of high-speed fixed-wing UAV penetration strategy based on deep reinforcement learning
    Zhuang, Xing
    Li, Dongguang
    Wang, Yue
    Liu, Xinyu
    Li, Hanyu
    AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 148
  • [30] Reinforcement learning-based formation-surrounding control for multiple quadrotor UAVs pursuit-evasion games
    Xiong, Hang
    Zhang, Ying
    ISA TRANSACTIONS, 2024, 145 : 205 - 224