Impact time control guidance law with time-varying velocity based on deep reinforcement learning

被引:5
|
作者
Yang, Zhuoqiao [1 ]
Liu, Xiangdong [1 ]
Liu, Haikuo [2 ]
机构
[1] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China
[2] Beijing Inst Technol, Sch Mechatron Engn, Beijing 100081, Peoples R China
关键词
Time-varying velocity; Deep reinforcement learning; Impact time control guidance; Missile guidance;
D O I
10.1016/j.ast.2023.108603
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
This paper investigates the problem of impact-time-control guidance law with the time-varying velocity caused by gravity and aerodynamic drag. Using the deep reinforcement learning (DRL) algorithm, we propose a novel impact time control guidance (ITCG) law in which a DRL agent is trained from scratch without using any prior knowledge. Different from the traditional ITCG law, the proposed method doesn't rely on the time-to-go estimation, which is difficult to derive and inaccurate with the time-varying velocity. Further, a prioritized experience replay method and a novel action exploration method are introduced in the DRL algorithm to improve learning efficiency. Additionally, the agent action is shaped to provide smooth guidance command, which avoids the problem that the guidance command generated by the intelligent algorithm may not be continuous. Numerical simulations are conducted to support the validity of the proposed algorithm.(c) 2023 Elsevier Masson SAS. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Reinforcement Learning-Assisted Composite Adaptive Control for Time-Varying Parameters
    Kim, Seong-hun
    Lee, Hanna
    Kim, Youdan
    IFAC PAPERSONLINE, 2020, 53 (02): : 9515 - 9520
  • [32] Adaptive control of LCL filter with time-varying parameters using reinforcement learning
    Dragoun, Jaroslav
    Smidl, Vaclav
    45TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY (IECON 2019), 2019, : 267 - 272
  • [33] Research on Time-cooperative Guidance of Multiple Flight Vehicles with Time-varying Velocity
    Li W.
    Shang T.
    Yao Y.
    Zhao Q.
    Binggong Xuebao/Acta Armamentarii, 2020, 41 (06): : 1096 - 1110
  • [34] Time-Varying Asymmetric Barrier Lyapunov Function-Based Impact Angle Control Guidance Law With Field-of-View Constraint
    Tian, Jiayi
    Bai, Xibin
    Yang, Huabo
    Zhang, Shifeng
    IEEE ACCESS, 2020, 8 : 185346 - 185359
  • [35] A Time-Varying Deep Reinforcement Model Predictive Control for DC Power Converter Systems
    Andalibi, Milad
    Hajihosseini, Mojtaba
    Teymoori, Sam
    Kargar, Maryam
    Gheisarnejad, Meysam
    2021 IEEE 12TH INTERNATIONAL SYMPOSIUM ON POWER ELECTRONICS FOR DISTRIBUTED GENERATION SYSTEMS (PEDG), 2021,
  • [36] A Time-Varying Deep Reinforcement Model Predictive Control for DC Power Converter Systems
    Andalibi, Milad
    Hajihosseini, Mojtaba
    Teymoori, Sam
    Kargar, Maryam
    Gheisarnejad, Meysam
    2021 IEEE 12TH INTERNATIONAL SYMPOSIUM ON POWER ELECTRONICS FOR DISTRIBUTED GENERATION SYSTEMS (PEDG), 2021,
  • [37] TIME-VARYING VELOCITY FILTERS
    LETTON, W
    BUSH, AM
    GEOPHYSICS, 1969, 34 (06) : 1011 - &
  • [38] Attention-Based Meta-Reinforcement Learning for Tracking Control of AUV With Time-Varying Dynamics
    Jiang, Peng
    Song, Shiji
    Huang, Gao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) : 6388 - 6401
  • [39] Guidance algorithm for impact time, angle, and acceleration control under varying velocity condition
    Zhang, Wanqing
    Chen, Wanchun
    Li, Jinglin
    Yu, Wenbin
    AEROSPACE SCIENCE AND TECHNOLOGY, 2022, 123
  • [40] Field-of-View and Impact Angle Constrained Guidance Law for Missiles With Time-Varying Velocities
    Liu, Bojun
    Hou, Mingshan
    Li, Yajun
    IEEE ACCESS, 2019, 7 : 61717 - 61727