Autonomous Trajectory Planning Method for Stratospheric Airship Regional Station-Keeping Based on Deep Reinforcement Learning

被引：2

作者：

Liu, Sitong ^{[1
,2
]}

Zhou, Shuyu ^{[1
]}

Miao, Jinggang ^{[1
,2
]}

Shang, Hai ^{[1
]}

Cui, Yuxuan ^{[1
]}

Lu, Ying ^{[1
]}

机构：

[1] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China

[2] Univ Chinese Acad Sci, Beijing 100190, Peoples R China

来源：

AEROSPACE | 2024年 / 11卷 / 09期

基金：

国家重点研发计划;

关键词：

trajectory planning; stratospheric airship; deep reinforcement learning; proximal policy optimization (PPO); regional station-keeping; VEHICLE;

D O I：

10.3390/aerospace11090753

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

The stratospheric airship, as a near-space vehicle, is increasingly utilized in scientific exploration and Earth observation due to its long endurance and regional observation capabilities. However, due to the complex characteristics of the stratospheric wind field environment, trajectory planning for stratospheric airships is a significant challenge. Unlike lower atmospheric levels, the stratosphere presents a wind field characterized by significant variability in wind speed and direction, which can drastically affect the stability of the airship's trajectory. Recent advances in deep reinforcement learning (DRL) have presented promising avenues for trajectory planning. DRL algorithms have demonstrated the ability to learn complex control strategies autonomously by interacting with the environment. In particular, the proximal policy optimization (PPO) algorithm has shown effectiveness in continuous control tasks and is well suited to the non-linear, high-dimensional problem of trajectory planning in dynamic environments. This paper proposes a trajectory planning method for stratospheric airships based on the PPO algorithm. The primary contributions of this paper include establishing a continuous action space model for stratospheric airship motion; enabling more precise control and adjustments across a broader range of actions; integrating time-varying wind field data into the reinforcement learning environment; enhancing the policy network's adaptability and generalization to various environmental conditions; and enabling the algorithm to automatically adjust and optimize flight paths in real time using wind speed information, reducing the need for human intervention. Experimental results show that, within its wind resistance capability, the airship can achieve long-duration regional station-keeping, with a maximum station-keeping time ratio (STR) of up to 0.997.

引用

页数：18

共 50 条

[1] Trajectory planning of stratospheric airship for station-keeping mission based on improved rapidly exploring random tree
Luo, Qin-chuan
Sun, Kang-wen
Chen, Tian
Zhang, Yi-fei
Zheng, Ze-wei
ADVANCES IN SPACE RESEARCH, 2024, 73 (01) : 992 - 1005
[2] Station-keeping Control of an Underactuated Stratospheric Airship
Zhou, Weixiang
Zhou, Pingfang
Wang, Yueying
Wang, Ning
Duan, Dengping
INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2019, 21 (03) : 715 - 732
[3] Station-keeping Control of an Underactuated Stratospheric Airship
Weixiang Zhou
Pingfang Zhou
Yueying Wang
Ning Wang
Dengping Duan
International Journal of Fuzzy Systems, 2019, 21 : 715 - 732
[4] Robustly Station-Keeping For The Airship Subject To Stratospheric Wind
Jin, Huiyu
Qiao, Jihui
Liu, Lili
TENCON 2015 - 2015 IEEE REGION 10 CONFERENCE, 2015,
[5] STATION-KEEPING CONTROL STRATEGIES ANALYSIS FOR STRATOSPHERIC AIRSHIP
Zhang Lixue
Wang Zhongwei
Zhao Kun
Yang Xixiang
PROCEEDINGS OF THE 21ST ESA SYMPOSIUM ON EUROPEAN ROCKET & BALLOON PROGRAMMES AND RESEARCH, 2013, 721 : 241 - 245
[6] Recovery trajectory optimization of the solar-powered stratospheric airship for the station-keeping mission
Wang, Jie
Meng, Xiuyun
Li, Cuichun
ACTA ASTRONAUTICA, 2021, 178 : 159 - 177
[7] Backstepping Control based on Sliding Mode for Station-Keeping of Stratospheric Airship
Parsa, Ashkan
Monfared, Sadra Borji
Kalhor, Ahmad
2018 6TH RSI INTERNATIONAL CONFERENCE ON ROBOTICS AND MECHATRONICS (ICROM 2018), 2018, : 554 - 559
[8] Stratospheric airship trajectory planning in wind field using deep reinforcement learning
Qi, Lele
Yang, Xixiang
Bai, Fangchao
Deng, Xiaolong
Pan, Yuelong
ADVANCES IN SPACE RESEARCH, 2025, 75 (01) : 620 - 634
[9] Station-keeping of the Autonomous Airship Using Adaptive Pursuit Guidance Law
Wang, Jie
Meng, Xiuyun
Li, Cuichun
Qiu, Wenjie
PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 5047 - 5052
[10] Path planning of stratospheric airship in dynamic wind field based on deep reinforcement learning
Zheng, Baojin
Zhu, Ming
Guo, Xiao
Ou, Jiajun
Yuan, Jiace
AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 150

← 1 2 3 4 5 →