Guidance law of interceptors against a high-speed maneuvering target based on deep Q-Network

被引:10
|
作者
Wu, Ming-yu [1 ]
He, Xian-jun [1 ]
Qiu, Zhi-ming [2 ]
Chen, Zhi-hua [1 ]
机构
[1] Nanjing Univ Sci & Technol, Natl Key Lab Transient Phys, Nanjing 210094, Peoples R China
[2] Naval Res Acad, Shanghai, Peoples R China
关键词
High-speed maneuvering target; guidance law; convergence of LOS rate; deep reinforcement learning; deep Q-Network; prioritized experience replay; PROPORTIONAL-NAVIGATION;
D O I
10.1177/01423312211052742
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a novel guidance law for intercepting a high-speed maneuvering target based on deep reinforcement learning, which mainly includes the interceptor-target relative motion model and value function approximation model based on deep Q-Network (DQN) with prioritized experience replay. First, a method called prioritized experience replay is applied to extract more efficient samples and reduce the training time. Second, to cope with the discrete action space of DQN, a normal acceleration is introduced to the state space, and the normal acceleration rate is chosen as the action. Then, the continuous normal acceleration command is obtained using numerical integral method. Third, to make the line-of-sight (LOS) rate converge rapidly, the reward function whose absolute value tends to zero has been constructed. Finally, compared with proportional navigation guidance (PNG) and the Q-Learning-based guidance law (QLG), the simulation experiments are implemented to intercept high-speed maneuvering targets at different acceleration policies. Simulation results demonstrate that the proposed DQN-based guidance law (DQNG) can obtain continuous acceleration command, make the LOS rate converge to zero rapidly, and hit the maneuvering targets using only the LOS rate. It also confirms that DQNG can realize the parallel-like approach and improve the interception performance of the interceptor to high-speed maneuvering targets. The proposed DQNG also has the advantages of avoiding the complicated formula derivation of traditional guidance law and eliminates the acceleration buffeting.
引用
收藏
页码:1373 / 1387
页数:15
相关论文
共 50 条
  • [21] Suboptimal guidance law against maneuvering target with time and angle constraints
    Xie, Jiliang
    Ma, Kemao
    AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 148
  • [22] Adaptive Sliding Mode Cooperative Guidance Law against Maneuvering Target
    He, Chen-di
    Shi, Zhen
    Zheng, Yan
    Wang, Saisai
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 9803 - 9808
  • [23] Performance Analysis of Motion Camouflage Guidance Law Against Maneuvering Target
    Zhao, Qiancheng
    Li, Jianqing
    Li, Chaoyong
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2023, 59 (06) : 8466 - 8474
  • [24] A NOVEL SWITCHING GUIDANCE LAW AGAINST HYPERSONIC RANDOM MANEUVERING TARGET
    Wang, Yuzhe
    Shi, Xiaoping
    Zhu, Yin
    2010 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-5, 2010, : 2179 - 2183
  • [25] Vector Field based Guidance Law for Intercepting Maneuvering Target
    Lee, Suwon
    Lee, Seokwon
    Ann, Sungjun
    Lee, Jaeho
    Kim, Youdan
    2018 26TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2018, : 223 - 228
  • [26] A New Angular Acceleration Guidance Law with Estimation Approach based on Sliding Mode Observer against High Maneuvering Target
    Song, Yuanyun
    Chen, Wanchun
    Yin, Xingliang
    MECHANICAL AND AEROSPACE ENGINEERING, PTS 1-7, 2012, 110-116 : 5249 - 5256
  • [27] Double Deep Q network-based speed trajectory intelligent optimization for high-speed train
    Zhou, Min
    Zhou, Xueying
    Cao, Yaoguang
    Yang, Bo
    Done, Hairong
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 2436 - 2441
  • [28] Deep Reinforcement Learning-Based Differential Game Guidance Law against Maneuvering Evaders
    Xi, Axing
    Cai, Yuanli
    AEROSPACE, 2024, 11 (07)
  • [29] Manufacturing Resource Scheduling Based on Deep Q-Network
    ZHANG Yufei
    ZOU Yuanhao
    ZHAO Xiaodong
    Wuhan University Journal of Natural Sciences, 2022, 27 (06) : 531 - 538
  • [30] Research on the Omnidirectional Interception Guidance Law for High Speed Maneuvering Targets
    Bai, Guoyu
    Shen, Huairong
    Duan, Yongsheng
    Tan, Hongyi
    PROCEEDINGS OF THE 2016 7TH INTERNATIONAL CONFERENCE ON MECHATRONICS, CONTROL AND MATERIALS (ICMCM 2016), 2016, 104 : 583 - 598