A novel guidance law based on proximal policy optimization

被引:0
|
作者
Jiang, Yang [1 ]
Yu, Jianglong [1 ]
Li, Qingdong [1 ]
Ren, Zhang [1 ]
Done, Xiwang [1 ,2 ]
Hua, Yongzhao [1 ,2 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Sci & Technol Aircraft Control Lab, Beijing 100191, Peoples R China
[2] Beihang Univ, Inst Artificial Intelligence, Beijing 100191, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
reinforcement learning; proximal policy optimization; high-speed maneuvering target; SLIDING-MODE CONTROL;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a new guidance law based on deep reinforcement learning is proposed for the high-speed maneuvering target attack problem. Firstly, the missile-target kinematic model is established, and the action space and state space of reinforcement learning are designed. Then, according to the missile strike process, the reward function suitable for this scenario is proposed. The proximal policy optimization (PPO) based guidance law construction is completed. Finally, the strike effect in multiple sets of experiments verifies the effectiveness of the method proposed in this paper.
引用
收藏
页码:3364 / 3369
页数:6
相关论文
共 50 条
  • [1] Proximal policy optimization for UAV autonomous guidance, tracking and obstacle avoidance
    Hu D.
    Dong W.
    Xie W.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2023, 49 (01): : 195 - 205
  • [2] A Novel Proximal Policy Optimization Approach for Filter Design
    Fan, Dongdong
    Ding, Shuai
    Zhang, Haotian
    Zhang, Weihao
    Jia, Qingsong
    Han, Xu
    Tang, Hao
    Zhu, Zhaojun
    Zhou, Yuliang
    APPLIED COMPUTATIONAL ELECTROMAGNETICS SOCIETY JOURNAL, 2024, 39 (05): : 390 - 395
  • [3] Thrust Vectored Rocket Landing Integrated Guidance and Control with Proximal Policy Optimization
    Souza, Gabriel de Almeida
    Silva, Octavio Mathias
    Maximo, Marcos R. O. A.
    2022 LATIN AMERICAN ROBOTICS SYMPOSIUM (LARS), 2022 BRAZILIAN SYMPOSIUM ON ROBOTICS (SBR), AND 2022 WORKSHOP ON ROBOTICS IN EDUCATION (WRE), 2022, : 55 - 60
  • [4] An AGC Dynamic Optimization Method Based on Proximal Policy Optimization
    Liu, Zhao
    Li, Jiateng
    Zhang, Pei
    Ding, Zhenhuan
    Zhao, Yanshun
    FRONTIERS IN ENERGY RESEARCH, 2022, 10
  • [5] A novel intelligent anti-jamming communication algorithm based on proximal policy optimization
    Ding, Huihui
    Niu, Yingtao
    Zhou, Quan
    Peng, Xiang
    PHYSICAL COMMUNICATION, 2024, 65
  • [6] Proximal policy optimization guidance algorithm for intercepting near-space maneuvering targets
    Chen, Wenxue
    Gao, Changsheng
    Jing, Wuxing
    AEROSPACE SCIENCE AND TECHNOLOGY, 2023, 132
  • [7] Proximal Policy Optimization With Policy Feedback
    Gu, Yang
    Cheng, Yuhu
    Chen, C. L. Philip
    Wang, Xuesong
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (07): : 4600 - 4610
  • [8] Proximal policy optimization with model-based methods
    Li, Shuailong
    Zhang, Wei
    Zhang, Huiwen
    Zhang, Xin
    Leng, Yuquan
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (06) : 5399 - 5410
  • [9] Proximal Policy Optimization-Based Optimization of Microwave Planar Resonators
    Pan, Jia-Hao
    Liu, Qi Qiang
    Zhao, Wen-Sheng
    Hu, Xiaoping
    You, Bin
    Hu, Yue
    Wang, Jing
    Yu, Chenghao
    Wang, Da-Wei
    IEEE TRANSACTIONS ON COMPONENTS PACKAGING AND MANUFACTURING TECHNOLOGY, 2024, 14 (12): : 2339 - 2347
  • [10] Reinforcement Learning-Based 3-D Sliding Mode Interception Guidance via Proximal Policy Optimization
    Guo J.
    Li M.
    Guo Z.
    She Z.
    IEEE Journal on Miniaturization for Air and Space Systems, 2023, 4 (04): : 423 - 430