A Deep Reinforcement Learning-Based Intelligent Maneuvering Strategy for the High-Speed UAV Pursuit-Evasion Game

被引：1

作者：

Yan, Tian ^{[1
,2
,3
]}

Liu, Can ^{[1
]}

Gao, Mengjing ^{[1
]}

Jiang, Zijian ^{[1
]}

Li, Tong ^{[1
]}

机构：

[1] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Peoples R China

[2] Northwestern Polytech Univ, Natl Key Lab Unmanned Aerial Vehicle Technol, Xian 710072, Peoples R China

[3] Northwestern Polytech Univ, Integrated Res & Dev Platform Unmanned Aerial Vehi, Xian 710072, Peoples R China

来源：

DRONES | 2024年 / 8卷 / 07期

基金：

中国国家自然科学基金;

关键词：

pursuit-evasion game; line-of-sight angle rate; high-speed UAV; deep reinforcement learning; PROPORTIONAL NAVIGATION;

D O I：

10.3390/drones8070309

中图分类号：

TP7 [遥感技术];

学科分类号：

081102 ; 0816 ; 081602 ; 083002 ; 1404 ;

摘要：

Given the rapid advancements in kinetic pursuit technology, this paper introduces an innovative maneuvering strategy, denoted as LSRC-TD3, which integrates line-of-sight (LOS) angle rate correction with deep reinforcement learning (DRL) for high-speed unmanned aerial vehicle (UAV) pursuit-evasion (PE) game scenarios, with the aim of effectively evading high-speed and high-dynamic pursuers. In the challenging situations of the game, where both speed and maximum available overload are at a disadvantage, the playing field of UAVs is severely compressed, and the difficulty of evasion is significantly increased, placing higher demands on the strategy and timing of maneuvering to change orbit. While considering evasion, trajectory constraint, and energy consumption, we formulated the reward function by combining "terminal" and "process" rewards, as well as "strong" and "weak" incentive guidance to reduce pre-exploration difficulty and accelerate convergence of the game network. Additionally, this paper presents a correction factor for LOS angle rate into the double-delay deterministic gradient strategy (TD3), thereby enhancing the sensitivity of high-speed UAVs to changes in LOS rate, as well as the accuracy of evasion timing, which improves the effectiveness and adaptive capability of the intelligent maneuvering strategy. The Monte Carlo simulation results demonstrate that the proposed method achieves a high level of evasion performance-integrating energy optimization with the requisite miss distance for high-speed UAVs-and accomplishes efficient evasion under highly challenging PE game scenarios.

引用

页数：20

共 50 条

[31] PRD-MADDPG: An efficient learning-based algorithm for orbital pursuit-evasion game with impulsive maneuvers
Zhao, Liran
Zhang, Yulin
Dang, Zhaohui
ADVANCES IN SPACE RESEARCH, 2023, 72 (02) : 211 - 230
[32] A Deep Reinforcement Learning Approach for the Pursuit Evasion Game in the Presence of Obstacles
Qi, Qi
Zhang, Xuebo
Guo, Xian
2020 IEEE INTERNATIONAL CONFERENCE ON REAL-TIME COMPUTING AND ROBOTICS (IEEE-RCAR 2020), 2020, : 68 - 73
[33] Large Scale Pursuit-Evasion Under Collision Avoidance Using Deep Reinforcement Learning
Yang, Helei
Ge, Peng
Cao, Junjie
Yang, Yifan
Liu, Yong
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 2232 - 2239
[34] Optimal Guidance Laws for a Hypersonic Multiplayer Pursuit-Evasion Game Based on a Differential Game Strategy
Liang, Haizhao
Li, Zhi
Wu, Jinze
Zheng, Yu
Chu, Hongyu
Wang, Jianying
AEROSPACE, 2022, 9 (02)
[35] Pursuit-Evasion Games for Multi-agent Based on Reinforcement Learning with Obstacles
Hu, Penglin
Guo, Yaning
Hu, Jinwen
Pan, Quan
PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 1015 - 1024
[36] High-dynamic intelligent maneuvering guidance strategy via deep reinforcement learning
Zhao, Sibo
Zhu, Jianwen
Bao, Weimin
Li, Xiaoping
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART G-JOURNAL OF AEROSPACE ENGINEERING, 2023, 237 (11) : 2617 - 2631
[37] Deep Reinforcement Learning-Based Differential Game Guidance Law against Maneuvering Evaders
Xi, Axing
Cai, Yuanli
AEROSPACE, 2024, 11 (07)
[38] Maneuvering Decision Making Based on Cloud Modeling Algorithm for UAV Evasion-Pursuit Game
Huang, Hanqiao
Weng, Weiye
Zhou, Huan
Jiang, Zijian
Dong, Yue
AEROSPACE, 2024, 11 (03)
[39] Hierarchical Maneuver Decision Method Based on PG-Option for UAV Pursuit-Evasion Game
Li, Bo
Zhang, Haohui
He, Pingkuan
Wang, Geng
Yue, Kaiqiang
Neretin, Evgeny
DRONES, 2023, 7 (07)
[40] Intelligent Beam Management Based on Deep Reinforcement Learning in High-Speed Railway Scenarios
Qiao, Yuanyuan
Niu, Yong
Zhang, Xiangfei
Chen, Sheng
Zhong, Zhangdui
Wang, Ning
Ai, Bo
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (03) : 3917 - 3931

← 1 2 3 4 5 →