Range-Aware Impact Angle Guidance Law With Deep Reinforcement Meta-Learning

被引:7
|
作者
Liang, Chen [1 ]
Wang, Weihong [1 ]
Liu, Zhenghua [1 ]
Lai, Chao [2 ]
Wang, Sen [2 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
[2] China North Ind Grp Corp, Nav & Control Technol Res Inst, Beijing 100089, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷 / 08期
关键词
Missile guidance; tube model predictive control; meta-learning; deep reinforcement learning; impact angle constraint; WEIGHTED OPTIMAL GUIDANCE;
D O I
10.1109/ACCESS.2020.3017480
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, a new guidance law is proposed for impact angle constrained missile with time-varying velocity against a maneuvering target. The proposed guidance law is based on model-based deep reinforcement learning (RL) technique where a deep neural network is trained to be a predictive model used in model predictive path integral (MPPI) control. Tube-MPPI, a robust approach utilizing ancillary controller for disturbance rejection, is introduced in guidance law design in this work to deal with the MPPI degradation of robustness when the deep predictive model differs with actual environment. To further improve the performance, meta-learning is utilized to enable the deep neural dynamics adapt to environment changes online. With this approach the model mismatch of the nominal controller is reduced to improve tube-MPPI performance. Furthermore, a range-aware hyperbolic function is proposed as an adaptive function in the MPPI performance index design. Thus, reduced initial acceleration command and increased terminal velocity benefit guidance performance. Numerical simulations under various conditions demonstrate the effectiveness of proposed guidance law.
引用
收藏
页码:152093 / 152104
页数:12
相关论文
共 50 条
  • [1] Deep Reinforcement Meta-learning Guidance with Impact Angle Constraint
    Liang C.
    Wang W.-H.
    Lai C.
    Yuhang Xuebao/Journal of Astronautics, 2021, 42 (05): : 611 - 620
  • [2] Adaptive guidance and integrated navigation with reinforcement meta-learning
    Gaudet, Brian
    Linares, Richard
    Furfaro, Roberto
    ACTA ASTRONAUTICA, 2020, 169 : 180 - 190
  • [3] Learning to Guide: Guidance Law Based on Deep Meta-Learning and Model Predictive Path Integral Control
    Liang, Chen
    Wang, Weihong
    Liu, Zhenghua
    Lai, Chao
    Zhou, Benchun
    IEEE ACCESS, 2019, 7 : 47353 - 47365
  • [4] Meta-learning in Reinforcement Learning
    Schweighofer, N
    Doya, K
    NEURAL NETWORKS, 2003, 16 (01) : 5 - 9
  • [5] Impact-Angle Constraint Guidance and Control Strategies Based on Deep Reinforcement Learning
    Fan, Junfang
    Dou, Denghui
    Ji, Yi
    AEROSPACE, 2023, 10 (11)
  • [6] Deep Reinforcement Learning Guidance with Impact Time Control
    Li, Guofei
    Li, Shituo
    Li, Bohao
    Wu, Yunjie
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2024, 35 (06) : 1594 - 1603
  • [7] Quadcopter Guidance Law Design using Deep Reinforcement Learning
    Aydinli, Sevket Utku
    Kutay, Ali Turker
    2023 10TH INTERNATIONAL CONFERENCE ON RECENT ADVANCES IN AIR AND SPACE TECHNOLOGIES, RAST, 2023,
  • [8] Deep reinforcement learning guidance with impact time control
    LI Guofei
    LI Shituo
    LI Bohao
    WU Yunjie
    Journal of Systems Engineering and Electronics, 2024, 35 (06) : 1594 - 1603
  • [9] MURAL: Meta-Learning Uncertainty-Aware Rewards for Outcome-Driven Reinforcement Learning
    Li, Kevin
    Gupta, Abhishek
    Reddy, Ashwin
    Pong, Vitchyr
    Zhou, Aurick
    Yu, Justin
    Levine, Sergey
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [10] A survey of deep meta-learning
    Mike Huisman
    Jan N. van Rijn
    Aske Plaat
    Artificial Intelligence Review, 2021, 54 : 4483 - 4541