Range-Aware Impact Angle Guidance Law With Deep Reinforcement Meta-Learning

被引：7

作者：

Liang, Chen ^{[1
]}

Wang, Weihong ^{[1
]}

Liu, Zhenghua ^{[1
]}

Lai, Chao ^{[2
]}

Wang, Sen ^{[2
]}

机构：

[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China

[2] China North Ind Grp Corp, Nav & Control Technol Res Inst, Beijing 100089, Peoples R China

来源：

IEEE ACCESS | 2020年 / 8卷 / 08期

关键词：

Missile guidance; tube model predictive control; meta-learning; deep reinforcement learning; impact angle constraint; WEIGHTED OPTIMAL GUIDANCE;

D O I：

10.1109/ACCESS.2020.3017480

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this article, a new guidance law is proposed for impact angle constrained missile with time-varying velocity against a maneuvering target. The proposed guidance law is based on model-based deep reinforcement learning (RL) technique where a deep neural network is trained to be a predictive model used in model predictive path integral (MPPI) control. Tube-MPPI, a robust approach utilizing ancillary controller for disturbance rejection, is introduced in guidance law design in this work to deal with the MPPI degradation of robustness when the deep predictive model differs with actual environment. To further improve the performance, meta-learning is utilized to enable the deep neural dynamics adapt to environment changes online. With this approach the model mismatch of the nominal controller is reduced to improve tube-MPPI performance. Furthermore, a range-aware hyperbolic function is proposed as an adaptive function in the MPPI performance index design. Thus, reduced initial acceleration command and increased terminal velocity benefit guidance performance. Numerical simulations under various conditions demonstrate the effectiveness of proposed guidance law.

引用

页码：152093 / 152104

页数：12

共 50 条

[1] Deep Reinforcement Meta-learning Guidance with Impact Angle Constraint
Liang C.
Wang W.-H.
Lai C.
Yuhang Xuebao/Journal of Astronautics, 2021, 42 (05): : 611 - 620
[2] Adaptive guidance and integrated navigation with reinforcement meta-learning
Gaudet, Brian
Linares, Richard
Furfaro, Roberto
ACTA ASTRONAUTICA, 2020, 169 : 180 - 190
[3] Learning to Guide: Guidance Law Based on Deep Meta-Learning and Model Predictive Path Integral Control
Liang, Chen
Wang, Weihong
Liu, Zhenghua
Lai, Chao
Zhou, Benchun
IEEE ACCESS, 2019, 7 : 47353 - 47365
[4] Meta-learning in Reinforcement Learning
Schweighofer, N
Doya, K
NEURAL NETWORKS, 2003, 16 (01) : 5 - 9
[5] Impact-Angle Constraint Guidance and Control Strategies Based on Deep Reinforcement Learning
Fan, Junfang
Dou, Denghui
Ji, Yi
AEROSPACE, 2023, 10 (11)
[6] Deep Reinforcement Learning Guidance with Impact Time Control
Li, Guofei
Li, Shituo
Li, Bohao
Wu, Yunjie
JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2024, 35 (06) : 1594 - 1603
[7] Quadcopter Guidance Law Design using Deep Reinforcement Learning
Aydinli, Sevket Utku
Kutay, Ali Turker
2023 10TH INTERNATIONAL CONFERENCE ON RECENT ADVANCES IN AIR AND SPACE TECHNOLOGIES, RAST, 2023,
[8] Deep reinforcement learning guidance with impact time control
LI Guofei
LI Shituo
LI Bohao
WU Yunjie
Journal of Systems Engineering and Electronics, 2024, 35 (06) : 1594 - 1603
[9] MURAL: Meta-Learning Uncertainty-Aware Rewards for Outcome-Driven Reinforcement Learning
Li, Kevin
Gupta, Abhishek
Reddy, Ashwin
Pong, Vitchyr
Zhou, Aurick
Yu, Justin
Levine, Sergey
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[10] A survey of deep meta-learning
Mike Huisman
Jan N. van Rijn
Aske Plaat
Artificial Intelligence Review, 2021, 54 : 4483 - 4541

← 1 2 3 4 5 →