Trajectory optimization of spacecraft autonomous far-distance rapid rendezvous based on deep reinforcement learning

被引：0

作者：

Di, Peng ^{[1
,2
]}

Yao, Ye ^{[1
,2
]}

Lin, Zheng ^{[1
,2
]}

Yin, Zengshan ^{[1
,2
]}

机构：

[1] Chinese Acad Sci, Innovat Acad Microsatellites, Shanghai 201306, Peoples R China

[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

来源：

ADVANCES IN SPACE RESEARCH | 2025年 / 75卷 / 01期

关键词：

J; 2; perturbation; Safe area reward; Uncertainty analysis; Far-distance rapid rendezvous; Deep reinforcement learning; TIME OPTIMAL-CONTROL;

D O I：

10.1016/j.asr.2024.09.066

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

This paper investigates the application of Deep Reinforcement Learning (DRL) in the trajectory optimization of spacecraft fardistance rapid rendezvous, and uses the most advanced DRL method Proximal Policy Optimization (PPO) to solve the continuous high-thrust minimum-fuel trajectory optimization problem. The space J2 perturbation was considered, its impact on the spacecraft's on-orbit operation and trajectory design was analyzed, and the effectiveness and accuracy of the proposed method were verified in two far-distance rapid rendezvous missions. In order to ensure the safety of the subsequent close-range operation phase, a safe area reward framework is proposed, and sparse and dense safe area reward functions are designed. The dense safe area reward function significantly improves the training efficiency of the algorithm on the basis of ensuring terminal performance. In addition, the modeling and analysis of possible uncertainties in the spacecraft's orbit operation, including observation uncertainty, state uncertainty and control uncertainty, is carried out to verify the performance of the proposed method through simulation. For uncertainties, the closed-loop performance of the policy is also evaluated by performing Monte Carlo simulations. The results show that the PPO algorithm can effectively deal with the rendezvous problem in uncertainty environments. These preliminary results demonstrate the great potential of the DRL (c) 2024 COSPAR. Published by Elsevier B.V. All rights are reserved, including those for text and data mining, AI training, and similar technologies.

引用

页码：790 / 806

页数：17

共 50 条

[21] Generalized autonomous optimization for quantum transmitters with deep reinforcement learning
Lo, Yuen San
Woodward, Robert I.
Paraiso, Taofiq K.
Poudel, Rudra P. K.
Shields, Andrew J.
QUANTUM COMPUTING, COMMUNICATION, AND SIMULATION IV, 2024, 12911
[22] Autonomous Vehicle Fuel Economy Optimization with Deep Reinforcement Learning
Kim, Hyunkun
Pyeon, Hyeongoo
Park, Jong Sool
Hwang, Jin Young
Lim, Sejoon
ELECTRONICS, 2020, 9 (11) : 1 - 19
[23] Deep reinforcement learning based trajectory optimization for UAV-enabled IoT with SWIPT
Yang, Yuwen
Liu, Xin
AD HOC NETWORKS, 2024, 159
[24] Airborne Radar Trajectory Optimization Based on Deep Reinforcement Learning in Extended Target Tracking
Zhang, Hongyun
Xi, Lei
Chen, Hui
Zhang, Wenxu
Liu, Jiabin
Li, Tao
Liu, Jianrong
2024 43RD CHINESE CONTROL CONFERENCE, CCC 2024, 2024, : 2106 - 2111
[25] Trajectory Planning for Autonomous Driving in Unstructured Scenarios Based on Deep Learning and Quadratic Optimization
Li, Han
Chen, Peng
Yu, Guizhen
Zhou, Bin
Li, Yiming
Liao, Yaping
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (04) : 4886 - 4903
[26] Deep Reinforcement Learning Based Autonomous Control Approach for Power System Topology Optimization
Han, Xiaoyun
Hao, Yi
Chong, Zhiqiang
Ma, Shiqiang
Mu, Chaoxu
2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6041 - 6046
[27] Optimizing hyperparameters of deep reinforcement learning for autonomous driving based on whale optimization algorithm
Ashraf, Nesma M.
Mostafa, Reham R.
Sakr, Rasha H.
Rashad, M. Z.
PLOS ONE, 2021, 16 (06):
[28] Deep Reinforcement Learning for Autonomous Dynamic Skid Steer Vehicle Trajectory Tracking
Srikonda, Sandeep
Norris, William Robert
Nottage, Dustin
Soylemezoglu, Ahmet
ROBOTICS, 2022, 11 (05)
[29] A Deep Reinforcement Learning Approach for Federated Learning Optimization with UAV Trajectory Planning
Zhang, Chunyu
Liu, Yiming
Zhang, Zhi
2023 IEEE 34TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, PIMRC, 2023,
[30] Learning Basketball Dribbling Skills Using Trajectory Optimization and Deep Reinforcement Learning
Liu, Libin
Hodgins, Jessica
ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (04):

← 1 2 3 4 5 →