Guidance law of interceptors against a high-speed maneuvering target based on deep Q-Network

被引:10
|
作者
Wu, Ming-yu [1 ]
He, Xian-jun [1 ]
Qiu, Zhi-ming [2 ]
Chen, Zhi-hua [1 ]
机构
[1] Nanjing Univ Sci & Technol, Natl Key Lab Transient Phys, Nanjing 210094, Peoples R China
[2] Naval Res Acad, Shanghai, Peoples R China
关键词
High-speed maneuvering target; guidance law; convergence of LOS rate; deep reinforcement learning; deep Q-Network; prioritized experience replay; PROPORTIONAL-NAVIGATION;
D O I
10.1177/01423312211052742
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a novel guidance law for intercepting a high-speed maneuvering target based on deep reinforcement learning, which mainly includes the interceptor-target relative motion model and value function approximation model based on deep Q-Network (DQN) with prioritized experience replay. First, a method called prioritized experience replay is applied to extract more efficient samples and reduce the training time. Second, to cope with the discrete action space of DQN, a normal acceleration is introduced to the state space, and the normal acceleration rate is chosen as the action. Then, the continuous normal acceleration command is obtained using numerical integral method. Third, to make the line-of-sight (LOS) rate converge rapidly, the reward function whose absolute value tends to zero has been constructed. Finally, compared with proportional navigation guidance (PNG) and the Q-Learning-based guidance law (QLG), the simulation experiments are implemented to intercept high-speed maneuvering targets at different acceleration policies. Simulation results demonstrate that the proposed DQN-based guidance law (DQNG) can obtain continuous acceleration command, make the LOS rate converge to zero rapidly, and hit the maneuvering targets using only the LOS rate. It also confirms that DQNG can realize the parallel-like approach and improve the interception performance of the interceptor to high-speed maneuvering targets. The proposed DQNG also has the advantages of avoiding the complicated formula derivation of traditional guidance law and eliminates the acceleration buffeting.
引用
收藏
页码:1373 / 1387
页数:15
相关论文
共 50 条
  • [41] High-speed maneuvering target detection approach based on joint RFT and keystone transform
    Tian Jing
    Cui Wei
    Shen Qing
    Wei ZiXiang
    Wu SiLiang
    SCIENCE CHINA-INFORMATION SCIENCES, 2013, 56 (06) : 1 - 13
  • [42] Distributed observer-based fixed-time cooperative guidance law against maneuvering target
    He, Zhichuan
    Fan, Shipeng
    Wang, Jiang
    Wang, Peng
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (01) : 27 - 53
  • [43] High-speed maneuvering target detection approach based on joint RFT and keystone transform
    TIAN Jing
    CUI Wei
    SHEN Qing
    WEI ZiXiang
    WU SiLiang
    ScienceChina(InformationSciences), 2013, 56 (06) : 93 - 105
  • [44] High-speed maneuvering target detection approach based on joint RFT and keystone transform
    Jing Tian
    Wei Cui
    Qing Shen
    ZiXiang Wei
    SiLiang Wu
    Science China Information Sciences, 2013, 56 : 1 - 13
  • [45] Combined Proportional Navigation Law for Intercepting High-speed Maneuvering Targets Based on MPSP
    Shen, Lianjie
    Wang, Yongzhou
    Hao, Feng
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 8049 - 8054
  • [46] The X-Layer Optimization in CRN Using Deep Q-Network for Secure High Speed Communication
    Islam, Chowdhury Sajadul
    Mollah, Md. Sarwar Hossain
    2019 11TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND ELECTRICAL ENGINEERING (ICITEE 2019), 2019,
  • [47] Supervised contrastive deep Q-Network for imbalanced radar automatic target recognition
    Liu, Guanliang
    Chen, Wenchao
    Chen, Bo
    Feng, Bo
    Wang, Penghui
    Liu, Hongwei
    PATTERN RECOGNITION, 2025, 161
  • [48] Dynamic constrained evolutionary optimization based on deep Q-network
    Liang, Zhengping
    Yang, Ruitai
    Wang, Jigang
    Liu, Ling
    Ma, Xiaoliang
    Zhu, Zexuan
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [49] Deep Q-network learning-based active speed management under autonomous driving environments
    Kang, Kawon
    Park, Nuri
    Park, Juneyoung
    Abdel-Aty, Mohamed
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2024, 39 (21) : 3225 - 3242
  • [50] High-Speed Maneuvering and Spread Target Detection in High-Resolution Radar
    Tian, Yunlian
    Li, Wujun
    Cao, Xi
    Yi, Wei
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 709 - 714