Impact time control guidance law with time-varying velocity based on deep reinforcement learning

被引:5
|
作者
Yang, Zhuoqiao [1 ]
Liu, Xiangdong [1 ]
Liu, Haikuo [2 ]
机构
[1] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China
[2] Beijing Inst Technol, Sch Mechatron Engn, Beijing 100081, Peoples R China
关键词
Time-varying velocity; Deep reinforcement learning; Impact time control guidance; Missile guidance;
D O I
10.1016/j.ast.2023.108603
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
This paper investigates the problem of impact-time-control guidance law with the time-varying velocity caused by gravity and aerodynamic drag. Using the deep reinforcement learning (DRL) algorithm, we propose a novel impact time control guidance (ITCG) law in which a DRL agent is trained from scratch without using any prior knowledge. Different from the traditional ITCG law, the proposed method doesn't rely on the time-to-go estimation, which is difficult to derive and inaccurate with the time-varying velocity. Further, a prioritized experience replay method and a novel action exploration method are introduced in the DRL algorithm to improve learning efficiency. Additionally, the agent action is shaped to provide smooth guidance command, which avoids the problem that the guidance command generated by the intelligent algorithm may not be continuous. Numerical simulations are conducted to support the validity of the proposed algorithm.(c) 2023 Elsevier Masson SAS. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Automatic Modulation Classification in Time-Varying Channels Based on Deep Learning
    Zhou, Yu
    Lin, Tian
    Zhu, Yu
    IEEE ACCESS, 2020, 8 (08): : 197508 - 197522
  • [42] Deep Reinforcement Learning for Time Optimal Velocity Control using Prior Knowledge
    Hartmann, Gabriel
    Shiller, Zvi
    Azaria, Amos
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 186 - 193
  • [43] Guidance law to control impact time and angle
    Lee, Jin-Ik
    Jeon, In-Soo
    Tahk, Min-Jea
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2007, 43 (01) : 301 - 310
  • [44] Guidance law to control impact time and angle
    Jeon, IS
    Lee, JI
    Tahk, MJ
    2005 International Conference on Control and Automation (ICCA), Vols 1 and 2, 2005, : 852 - 857
  • [45] A Nonlinear Guidance Law for Impact Time Control
    Saleem, Abdul
    Ratnoo, Ashwini
    2015 AMERICAN CONTROL CONFERENCE (ACC), 2015, : 651 - 656
  • [46] Impact Time Control Guidance Law Design
    Cheng, Zhongtao
    Zhuo, Linren
    Xiong, Jizhang
    Liu, Lei
    Wang, Yongji
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 4723 - 4727
  • [47] Deep Reinforcement Learning Based Dynamic Power and Beamforming Design for Time-Varying Wireless Downlink Interference Channel
    Liu, Mengfan
    Wang, Rui
    Xing, Zhe
    Soto, Ismael
    2022 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2022, : 471 - 476
  • [48] A time-varying iterative learning control scheme
    Tharayil, M
    Alleyne, A
    PROCEEDINGS OF THE 2004 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2004, : 3782 - 3787
  • [49] A Deep Reinforcement Learning-Based PID Tuning Strategy for Nonlinear MIMO Systems with Time-varying Uncertainty
    Wang, Hao
    Ricardez-Sandoval, Luis A.
    IFAC PAPERSONLINE, 2024, 58 (14): : 887 - 892
  • [50] Deep Reinforcement Learning for Trustworthy and Time-Varying Connection Scheduling in a Coupled UAV-Based Femtocaching Architecture
    Hajiakhondi-Meybodi, Zohreh
    Mohammadi, Arash
    Abouei, Jamshid
    IEEE ACCESS, 2021, 9 : 32263 - 32281