Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target

被引:0
|
作者
Weifan Li
Yuanheng Zhu
Dongbin Zhao
机构
[1] Chinese Academy of Sciences,The State Key Laboratory of Management and Control for Complex Systems, Institute of Automation
[2] University of Chinese Academy of Sciences,School of Artificial Intelligence
来源
关键词
Reinforcement learning; Missile guidance; Auxiliary learning; Self-imitation learning;
D O I
暂无
中图分类号
学科分类号
摘要
In missile guidance, pursuit performance is seriously degraded due to the uncertainty and randomness in target maneuverability, detection delay, and environmental noise. In many methods, accurately estimating the acceleration of the target or the time-to-go is needed to intercept the maneuvering target, which is hard in an environment with uncertainty. In this paper, we propose an assisted deep reinforcement learning (ARL) algorithm to optimize the neural network-based missile guidance controller for head-on interception. Based on the relative velocity, distance, and angle, ARL can control the missile to intercept the maneuvering target and achieve large terminal intercept angle. To reduce the influence of environmental uncertainty, ARL predicts the target’s acceleration as an auxiliary supervised task. The supervised learning task improves the ability of the agent to extract information from observations. To exploit the agent’s good trajectories, ARL presents the Gaussian self-imitation learning to make the mean of action distribution approach the agent’s good actions. Compared with vanilla self-imitation learning, Gaussian self-imitation learning improves the exploration in continuous control. Simulation results validate that ARL outperforms traditional methods and proximal policy optimization algorithm with higher hit rate and larger terminal intercept angle in the simulation environment with noise, delay, and maneuverable target.
引用
收藏
页码:1205 / 1216
页数:11
相关论文
共 50 条
  • [31] Integrated Guidance and Control Using Adaptive Backstepping Approach for Maneuvering Target Interception
    Pei, Pei
    Ji, Yi
    He, Shaoming
    Wang, Jiang
    Lin, Defu
    IFAC PAPERSONLINE, 2020, 53 (02): : 9458 - 9464
  • [32] ADP based Guidance Strategy for Maneuvering Target Interception under Radome Errors
    Guo J.
    Hu G.
    Guo Z.
    Wang G.
    Yuhang Xuebao/Journal of Astronautics, 2022, 43 (07): : 911 - 920
  • [33] Nonlinear Guidance Laws for Maneuvering Target Interception With Virtual Look Angle Constraint
    Wang, Yaning
    Wang, Hui
    Ling, Defu
    Wang, Wei
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2022, 58 (04) : 2807 - 2822
  • [34] Guidance method for maneuvering target interception based on virtual look angle constraint
    Wang Y.
    Wang H.
    Lin D.
    Yuan Y.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2022, 43 (01):
  • [35] Integrated Game Based Guidance with Nonlinear Autopilot Design for Maneuvering Target Interception
    Hsueh, Ming-Hsiung
    Wang, Ting-Kuo
    Fu, Li-Chen
    ASIAN JOURNAL OF CONTROL, 2014, 16 (02) : 431 - 440
  • [36] Missile-cooperation Target Detection and Interception Time Adjustable Guidance Law
    Wu J.
    Zhao B.
    Han T.
    Yuhang Xuebao/Journal of Astronautics, 2023, 44 (07): : 1084 - 1093
  • [37] Nonlinear Optimal Missile Guidance for Stationary Target Interception with Pendulum Motion Perspective
    Cho, Namhoon
    2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 3908 - 3913
  • [38] Autonomous drone interception with Deep Reinforcement Learning
    Bertoin, David
    Gauffriau, Adrien
    Grasset, Damien
    Gupta, Jayant Sen
    CEUR Workshop Proceedings, 2022, 3173
  • [39] A hierarchical reinforcement learning method for missile evasion and guidance
    Yan, Mengda
    Yang, Rennong
    Zhang, Ying
    Yue, Longfei
    Hu, Dongyuan
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [40] Switching-aware multi-agent deep reinforcement learning for target interception
    Fan, Dongyu
    Shen, Haikuo
    Dong, Lijing
    APPLIED INTELLIGENCE, 2023, 53 (07) : 7876 - 7891