Active Defence Guidance for Hypersonic Vehicle with Incomplete Information Based on Reinforcement Learning

被引:0
|
作者
Ni, Weilin [1 ]
Qiu, Peihuan [1 ]
Lu, Baogang [2 ]
Chen, Anhong [2 ]
Liang, Haizhao [1 ]
机构
[1] Sun Yat Sen Univ, Sch Aeronaut & Astronaut, Shenzhen, Peoples R China
[2] Sci & Technol Space Phys Lab, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Partially observable Markov decision process; Reinforcement learning; Active protection; Guidance law; PROTECTION; EVASION;
D O I
10.1145/3669721.3674516
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates the active defense guidance problem for the hypersonic vehicle in target-interceptor-defender scenario. The active defense guidance problem of the hypersonic vehicle always subjects to the limitations of incomplete observation information. To tackle this issue, this paper proposes a cooperative active defense guidance based on Convolutional Deep Q-Network (CDQN) algorithm. By regarding the active defense scenario as the partially observable Markov decision process, the guidance problem is solved in the framework of reinforcement learning. In view of the spatiotemporal continuity properties of hypersonic vehicle, a stacking mechanism is proposed to process the incomplete information. Based on which, the convolutional neural networks are further employed to derive the cooperative active defense guidance law. Moreover, to tackle the sparse reward problem in CDQN's training, a continuous reward function is shaped based on environmental potential functions. Finally, numerical experiments are performed to demonstrate the performance and robustness of the proposed active defense guidance.
引用
收藏
页码:274 / 281
页数:8
相关论文
共 50 条
  • [41] Online Task Offloading in UDN: A Deep Reinforcement Learning Approach with Incomplete Information
    Lin, Ziqi
    Gu, Bo
    Zhang, Xu
    Yi, Difei
    Han, Yu
    2022 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2022, : 1236 - 1241
  • [42] Variable Horizon Reentry Guidance Based on Predictive Control and Pseudospectral Method for Hypersonic Vehicle
    Wang, Na
    Feng, Hao
    Hu, Chaofang
    Tao, Ye
    Zhou, Yimin
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 5914 - 5919
  • [43] Differential transformation-based trajectory tracking guidance scheme for hypersonic reentry vehicle
    Liu L.
    Yang L.
    Cai W.
    2017, National University of Defense Technology (39): : 23 - 29
  • [44] Moderated reinforcement learning of active and semi-active vehicle suspension control laws
    Frost, GP
    Gordon, TJ
    Howell, MN
    Wu, QH
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART I-JOURNAL OF SYSTEMS AND CONTROL ENGINEERING, 1996, 210 (04) : 249 - 257
  • [45] Weights-varying MPC for Autonomous Vehicle Guidance: a Deep Reinforcement Learning Approach
    Zarrouki, Baha
    Kloes, Verena
    Heppner, Nikolas
    Schwan, Simon
    Ritschel, Robert
    Vosswinkel, Rick
    2021 EUROPEAN CONTROL CONFERENCE (ECC), 2021, : 119 - 125
  • [46] A guidance method for coplanar orbital interception based on reinforcement learning
    ZENG Xin
    ZHU Yanwei
    YANG Leping
    ZHANG Chengming
    Journal of Systems Engineering and Electronics, 2021, 32 (04) : 927 - 938
  • [47] Reinforcement-Learning-Based Robust Guidance for Asteroid Approaching
    Yang, Hongwei
    Hu, Jincheng
    Li, Shuang
    Bai, Xiaoli
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2024, 47 (10) : 2058 - 2072
  • [48] A guidance method for coplanar orbital interception based on reinforcement learning
    Zeng Xin
    Zhu Yanwei
    Yang Leping
    Zhang Chengming
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2021, 32 (04) : 927 - 938
  • [49] Crowd Evacuation Guidance Based on Combined Action Reinforcement Learning
    Xue, Yiran
    Wu, Rui
    Liu, Jiafeng
    Tang, Xianglong
    ALGORITHMS, 2021, 14 (01)
  • [50] Deep reinforcement learning-based robust missile guidance
    Ahn, Jeongsu
    Shin, Jongho
    Kim, Hyeong-Geun
    2022 22ND INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2022), 2022, : 927 - 930