Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target

被引：0

作者：

Weifan Li

Yuanheng Zhu

Dongbin Zhao

机构：

[1] Chinese Academy of Sciences,The State Key Laboratory of Management and Control for Complex Systems, Institute of Automation

[2] University of Chinese Academy of Sciences,School of Artificial Intelligence

来源：

Complex & Intelligent Systems | 2022年 / 8卷

关键词：

Reinforcement learning; Missile guidance; Auxiliary learning; Self-imitation learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In missile guidance, pursuit performance is seriously degraded due to the uncertainty and randomness in target maneuverability, detection delay, and environmental noise. In many methods, accurately estimating the acceleration of the target or the time-to-go is needed to intercept the maneuvering target, which is hard in an environment with uncertainty. In this paper, we propose an assisted deep reinforcement learning (ARL) algorithm to optimize the neural network-based missile guidance controller for head-on interception. Based on the relative velocity, distance, and angle, ARL can control the missile to intercept the maneuvering target and achieve large terminal intercept angle. To reduce the influence of environmental uncertainty, ARL predicts the target’s acceleration as an auxiliary supervised task. The supervised learning task improves the ability of the agent to extract information from observations. To exploit the agent’s good trajectories, ARL presents the Gaussian self-imitation learning to make the mean of action distribution approach the agent’s good actions. Compared with vanilla self-imitation learning, Gaussian self-imitation learning improves the exploration in continuous control. Simulation results validate that ARL outperforms traditional methods and proximal policy optimization algorithm with higher hit rate and larger terminal intercept angle in the simulation environment with noise, delay, and maneuverable target.

引用

页码：1205 / 1216

页数：11

共 50 条

[31] Integrated Guidance and Control Using Adaptive Backstepping Approach for Maneuvering Target Interception
Pei, Pei
Ji, Yi
He, Shaoming
Wang, Jiang
Lin, Defu
IFAC PAPERSONLINE, 2020, 53 (02): : 9458 - 9464
[32] ADP based Guidance Strategy for Maneuvering Target Interception under Radome Errors
Guo J.
Hu G.
Guo Z.
Wang G.
Yuhang Xuebao/Journal of Astronautics, 2022, 43 (07): : 911 - 920
[33] Nonlinear Guidance Laws for Maneuvering Target Interception With Virtual Look Angle Constraint
Wang, Yaning
Wang, Hui
Ling, Defu
Wang, Wei
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2022, 58 (04) : 2807 - 2822
[34] Guidance method for maneuvering target interception based on virtual look angle constraint
Wang Y.
Wang H.
Lin D.
Yuan Y.
Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2022, 43 (01):
[35] Integrated Game Based Guidance with Nonlinear Autopilot Design for Maneuvering Target Interception
Hsueh, Ming-Hsiung
Wang, Ting-Kuo
Fu, Li-Chen
ASIAN JOURNAL OF CONTROL, 2014, 16 (02) : 431 - 440
[36] Missile-cooperation Target Detection and Interception Time Adjustable Guidance Law
Wu J.
Zhao B.
Han T.
Yuhang Xuebao/Journal of Astronautics, 2023, 44 (07): : 1084 - 1093
[37] Nonlinear Optimal Missile Guidance for Stationary Target Interception with Pendulum Motion Perspective
Cho, Namhoon
2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 3908 - 3913
[38] Autonomous drone interception with Deep Reinforcement Learning
Bertoin, David
Gauffriau, Adrien
Grasset, Damien
Gupta, Jayant Sen
CEUR Workshop Proceedings, 2022, 3173
[39] A hierarchical reinforcement learning method for missile evasion and guidance
Yan, Mengda
Yang, Rennong
Zhang, Ying
Yue, Longfei
Hu, Dongyuan
SCIENTIFIC REPORTS, 2022, 12 (01)
[40] Switching-aware multi-agent deep reinforcement learning for target interception
Fan, Dongyu
Shen, Haikuo
Dong, Lijing
APPLIED INTELLIGENCE, 2023, 53 (07) : 7876 - 7891

← 1 2 3 4 5 →