Guidance law of interceptors against a high-speed maneuvering target based on deep Q-Network

被引：10

作者：

Wu, Ming-yu ^{[1
]}

He, Xian-jun ^{[1
]}

Qiu, Zhi-ming ^{[2
]}

Chen, Zhi-hua ^{[1
]}

机构：

[1] Nanjing Univ Sci & Technol, Natl Key Lab Transient Phys, Nanjing 210094, Peoples R China

[2] Naval Res Acad, Shanghai, Peoples R China

来源：

TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL | 2022年 / 44卷 / 07期

关键词：

High-speed maneuvering target; guidance law; convergence of LOS rate; deep reinforcement learning; deep Q-Network; prioritized experience replay; PROPORTIONAL-NAVIGATION;

D O I：

10.1177/01423312211052742

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper proposes a novel guidance law for intercepting a high-speed maneuvering target based on deep reinforcement learning, which mainly includes the interceptor-target relative motion model and value function approximation model based on deep Q-Network (DQN) with prioritized experience replay. First, a method called prioritized experience replay is applied to extract more efficient samples and reduce the training time. Second, to cope with the discrete action space of DQN, a normal acceleration is introduced to the state space, and the normal acceleration rate is chosen as the action. Then, the continuous normal acceleration command is obtained using numerical integral method. Third, to make the line-of-sight (LOS) rate converge rapidly, the reward function whose absolute value tends to zero has been constructed. Finally, compared with proportional navigation guidance (PNG) and the Q-Learning-based guidance law (QLG), the simulation experiments are implemented to intercept high-speed maneuvering targets at different acceleration policies. Simulation results demonstrate that the proposed DQN-based guidance law (DQNG) can obtain continuous acceleration command, make the LOS rate converge to zero rapidly, and hit the maneuvering targets using only the LOS rate. It also confirms that DQNG can realize the parallel-like approach and improve the interception performance of the interceptor to high-speed maneuvering targets. The proposed DQNG also has the advantages of avoiding the complicated formula derivation of traditional guidance law and eliminates the acceleration buffeting.

引用

页码：1373 / 1387

页数：15

共 50 条

[41] High-speed maneuvering target detection approach based on joint RFT and keystone transform
Tian Jing
Cui Wei
Shen Qing
Wei ZiXiang
Wu SiLiang
SCIENCE CHINA-INFORMATION SCIENCES, 2013, 56 (06) : 1 - 13
[42] Distributed observer-based fixed-time cooperative guidance law against maneuvering target
He, Zhichuan
Fan, Shipeng
Wang, Jiang
Wang, Peng
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (01) : 27 - 53
[43] High-speed maneuvering target detection approach based on joint RFT and keystone transform
TIAN Jing
CUI Wei
SHEN Qing
WEI ZiXiang
WU SiLiang
ScienceChina(InformationSciences), 2013, 56 (06) : 93 - 105
[44] High-speed maneuvering target detection approach based on joint RFT and keystone transform
Jing Tian
Wei Cui
Qing Shen
ZiXiang Wei
SiLiang Wu
Science China Information Sciences, 2013, 56 : 1 - 13
[45] Combined Proportional Navigation Law for Intercepting High-speed Maneuvering Targets Based on MPSP
Shen, Lianjie
Wang, Yongzhou
Hao, Feng
PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 8049 - 8054
[46] The X-Layer Optimization in CRN Using Deep Q-Network for Secure High Speed Communication
Islam, Chowdhury Sajadul
Mollah, Md. Sarwar Hossain
2019 11TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND ELECTRICAL ENGINEERING (ICITEE 2019), 2019,
[47] Supervised contrastive deep Q-Network for imbalanced radar automatic target recognition
Liu, Guanliang
Chen, Wenchao
Chen, Bo
Feng, Bo
Wang, Penghui
Liu, Hongwei
PATTERN RECOGNITION, 2025, 161
[48] Dynamic constrained evolutionary optimization based on deep Q-network
Liang, Zhengping
Yang, Ruitai
Wang, Jigang
Liu, Ling
Ma, Xiaoliang
Zhu, Zexuan
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
[49] Deep Q-network learning-based active speed management under autonomous driving environments
Kang, Kawon
Park, Nuri
Park, Juneyoung
Abdel-Aty, Mohamed
COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2024, 39 (21) : 3225 - 3242
[50] High-Speed Maneuvering and Spread Target Detection in High-Resolution Radar
Tian, Yunlian
Li, Wujun
Cao, Xi
Yi, Wei
2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 709 - 714

← 1 2 3 4 5 →