Active Defence Guidance for Hypersonic Vehicle with Incomplete Information Based on Reinforcement Learning

被引:0
|
作者
Ni, Weilin [1 ]
Qiu, Peihuan [1 ]
Lu, Baogang [2 ]
Chen, Anhong [2 ]
Liang, Haizhao [1 ]
机构
[1] Sun Yat Sen Univ, Sch Aeronaut & Astronaut, Shenzhen, Peoples R China
[2] Sci & Technol Space Phys Lab, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Partially observable Markov decision process; Reinforcement learning; Active protection; Guidance law; PROTECTION; EVASION;
D O I
10.1145/3669721.3674516
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates the active defense guidance problem for the hypersonic vehicle in target-interceptor-defender scenario. The active defense guidance problem of the hypersonic vehicle always subjects to the limitations of incomplete observation information. To tackle this issue, this paper proposes a cooperative active defense guidance based on Convolutional Deep Q-Network (CDQN) algorithm. By regarding the active defense scenario as the partially observable Markov decision process, the guidance problem is solved in the framework of reinforcement learning. In view of the spatiotemporal continuity properties of hypersonic vehicle, a stacking mechanism is proposed to process the incomplete information. Based on which, the convolutional neural networks are further employed to derive the cooperative active defense guidance law. Moreover, to tackle the sparse reward problem in CDQN's training, a continuous reward function is shaped based on environmental potential functions. Finally, numerical experiments are performed to demonstrate the performance and robustness of the proposed active defense guidance.
引用
收藏
页码:274 / 281
页数:8
相关论文
共 50 条
  • [31] Automatic Ultrasound Guidance Based on Deep Reinforcement Learning
    Jarosik, Piotr
    Lewandowski, Marcin
    2019 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IUS), 2019, : 475 - 478
  • [32] A dynamic route guidance arithmetic based on reinforcement learning
    Zhang, Z
    Xu, JM
    PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 3607 - 3611
  • [33] Reinforcement Learning-Based Guidance of Autonomous Vehicles
    Clemmons, Joseph
    Jin, Yu-Fang
    2023 24TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN, ISQED, 2023, : 496 - 501
  • [34] Barrier Lyapunov function based reinforcement learning control for air-breathing hypersonic vehicle with variable geometry inlet
    Liu, Chen
    Dong, Chaoyang
    Zhou, Zhijie
    Wang, Zhaolei
    AEROSPACE SCIENCE AND TECHNOLOGY, 2020, 96
  • [35] Reinforcement learning based parameter optimization of active disturbance rejection control for autonomous underwater vehicle
    SONG Wanping
    CHEN Zengqiang
    SUN Mingwei
    SUN Qinglin
    Journal of Systems Engineering and Electronics, 2022, 33 (01) : 170 - 179
  • [36] Reinforcement learning based parameter optimization of active disturbance rejection control for autonomous underwater vehicle
    Song Wanping
    Chen Zengqiang
    Sun Mingwei
    Sun Qinglin
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2022, 33 (01) : 170 - 179
  • [37] Reinforcement Learning Method for Multi-spacecraft Orbital Game with Incomplete Information
    Wang Y.
    Yuan L.
    Tang L.
    Huang H.
    Geng Y.
    Yuhang Xuebao/Journal of Astronautics, 2023, 44 (10): : 1522 - 1533
  • [38] Fault-Tolerant Integrated Guidance and Control Design for Hypersonic Vehicle Based on PPO
    Song, Jia
    Luo, Yuxie
    Zhao, Mingfei
    Hu, Yunlong
    Zhang, Yanxue
    MATHEMATICS, 2022, 10 (18)
  • [39] Optimal Feedback Reentry Guidance of Hypersonic Vehicle Based on Improved Gauss Pseudospectral Method
    Sun, Yong
    Duan, Guangren
    PROCEEDINGS OF THE 10TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2012), 2012, : 2457 - 2462
  • [40] Fault-tolerant guidance for hypersonic vehicle based on predictor-corrector strategy
    Meng, Yizhen
    Jiang, Bin
    Qi, Ruiyun
    IFAC PAPERSONLINE, 2017, 50 (01): : 5244 - 5249