Trajectory optimization of spacecraft autonomous far-distance rapid rendezvous based on deep reinforcement learning

被引:0
|
作者
Di, Peng [1 ,2 ]
Yao, Ye [1 ,2 ]
Lin, Zheng [1 ,2 ]
Yin, Zengshan [1 ,2 ]
机构
[1] Chinese Acad Sci, Innovat Acad Microsatellites, Shanghai 201306, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
J; 2; perturbation; Safe area reward; Uncertainty analysis; Far-distance rapid rendezvous; Deep reinforcement learning; TIME OPTIMAL-CONTROL;
D O I
10.1016/j.asr.2024.09.066
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
This paper investigates the application of Deep Reinforcement Learning (DRL) in the trajectory optimization of spacecraft fardistance rapid rendezvous, and uses the most advanced DRL method Proximal Policy Optimization (PPO) to solve the continuous high-thrust minimum-fuel trajectory optimization problem. The space J2 perturbation was considered, its impact on the spacecraft's on-orbit operation and trajectory design was analyzed, and the effectiveness and accuracy of the proposed method were verified in two far-distance rapid rendezvous missions. In order to ensure the safety of the subsequent close-range operation phase, a safe area reward framework is proposed, and sparse and dense safe area reward functions are designed. The dense safe area reward function significantly improves the training efficiency of the algorithm on the basis of ensuring terminal performance. In addition, the modeling and analysis of possible uncertainties in the spacecraft's orbit operation, including observation uncertainty, state uncertainty and control uncertainty, is carried out to verify the performance of the proposed method through simulation. For uncertainties, the closed-loop performance of the policy is also evaluated by performing Monte Carlo simulations. The results show that the PPO algorithm can effectively deal with the rendezvous problem in uncertainty environments. These preliminary results demonstrate the great potential of the DRL (c) 2024 COSPAR. Published by Elsevier B.V. All rights are reserved, including those for text and data mining, AI training, and similar technologies.
引用
收藏
页码:790 / 806
页数:17
相关论文
共 50 条
  • [21] Generalized autonomous optimization for quantum transmitters with deep reinforcement learning
    Lo, Yuen San
    Woodward, Robert I.
    Paraiso, Taofiq K.
    Poudel, Rudra P. K.
    Shields, Andrew J.
    QUANTUM COMPUTING, COMMUNICATION, AND SIMULATION IV, 2024, 12911
  • [22] Autonomous Vehicle Fuel Economy Optimization with Deep Reinforcement Learning
    Kim, Hyunkun
    Pyeon, Hyeongoo
    Park, Jong Sool
    Hwang, Jin Young
    Lim, Sejoon
    ELECTRONICS, 2020, 9 (11) : 1 - 19
  • [23] Deep reinforcement learning based trajectory optimization for UAV-enabled IoT with SWIPT
    Yang, Yuwen
    Liu, Xin
    AD HOC NETWORKS, 2024, 159
  • [24] Airborne Radar Trajectory Optimization Based on Deep Reinforcement Learning in Extended Target Tracking
    Zhang, Hongyun
    Xi, Lei
    Chen, Hui
    Zhang, Wenxu
    Liu, Jiabin
    Li, Tao
    Liu, Jianrong
    2024 43RD CHINESE CONTROL CONFERENCE, CCC 2024, 2024, : 2106 - 2111
  • [25] Trajectory Planning for Autonomous Driving in Unstructured Scenarios Based on Deep Learning and Quadratic Optimization
    Li, Han
    Chen, Peng
    Yu, Guizhen
    Zhou, Bin
    Li, Yiming
    Liao, Yaping
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (04) : 4886 - 4903
  • [26] Deep Reinforcement Learning Based Autonomous Control Approach for Power System Topology Optimization
    Han, Xiaoyun
    Hao, Yi
    Chong, Zhiqiang
    Ma, Shiqiang
    Mu, Chaoxu
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6041 - 6046
  • [27] Optimizing hyperparameters of deep reinforcement learning for autonomous driving based on whale optimization algorithm
    Ashraf, Nesma M.
    Mostafa, Reham R.
    Sakr, Rasha H.
    Rashad, M. Z.
    PLOS ONE, 2021, 16 (06):
  • [28] Deep Reinforcement Learning for Autonomous Dynamic Skid Steer Vehicle Trajectory Tracking
    Srikonda, Sandeep
    Norris, William Robert
    Nottage, Dustin
    Soylemezoglu, Ahmet
    ROBOTICS, 2022, 11 (05)
  • [29] A Deep Reinforcement Learning Approach for Federated Learning Optimization with UAV Trajectory Planning
    Zhang, Chunyu
    Liu, Yiming
    Zhang, Zhi
    2023 IEEE 34TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, PIMRC, 2023,
  • [30] Learning Basketball Dribbling Skills Using Trajectory Optimization and Deep Reinforcement Learning
    Liu, Libin
    Hodgins, Jessica
    ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (04):