Autonomous Trajectory Planning Method for Stratospheric Airship Regional Station-Keeping Based on Deep Reinforcement Learning

被引:2
|
作者
Liu, Sitong [1 ,2 ]
Zhou, Shuyu [1 ]
Miao, Jinggang [1 ,2 ]
Shang, Hai [1 ]
Cui, Yuxuan [1 ]
Lu, Ying [1 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100190, Peoples R China
基金
国家重点研发计划;
关键词
trajectory planning; stratospheric airship; deep reinforcement learning; proximal policy optimization (PPO); regional station-keeping; VEHICLE;
D O I
10.3390/aerospace11090753
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
The stratospheric airship, as a near-space vehicle, is increasingly utilized in scientific exploration and Earth observation due to its long endurance and regional observation capabilities. However, due to the complex characteristics of the stratospheric wind field environment, trajectory planning for stratospheric airships is a significant challenge. Unlike lower atmospheric levels, the stratosphere presents a wind field characterized by significant variability in wind speed and direction, which can drastically affect the stability of the airship's trajectory. Recent advances in deep reinforcement learning (DRL) have presented promising avenues for trajectory planning. DRL algorithms have demonstrated the ability to learn complex control strategies autonomously by interacting with the environment. In particular, the proximal policy optimization (PPO) algorithm has shown effectiveness in continuous control tasks and is well suited to the non-linear, high-dimensional problem of trajectory planning in dynamic environments. This paper proposes a trajectory planning method for stratospheric airships based on the PPO algorithm. The primary contributions of this paper include establishing a continuous action space model for stratospheric airship motion; enabling more precise control and adjustments across a broader range of actions; integrating time-varying wind field data into the reinforcement learning environment; enhancing the policy network's adaptability and generalization to various environmental conditions; and enabling the algorithm to automatically adjust and optimize flight paths in real time using wind speed information, reducing the need for human intervention. Experimental results show that, within its wind resistance capability, the airship can achieve long-duration regional station-keeping, with a maximum station-keeping time ratio (STR) of up to 0.997.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Trajectory planning of stratospheric airship for station-keeping mission based on improved rapidly exploring random tree
    Luo, Qin-chuan
    Sun, Kang-wen
    Chen, Tian
    Zhang, Yi-fei
    Zheng, Ze-wei
    ADVANCES IN SPACE RESEARCH, 2024, 73 (01) : 992 - 1005
  • [2] Station-keeping Control of an Underactuated Stratospheric Airship
    Zhou, Weixiang
    Zhou, Pingfang
    Wang, Yueying
    Wang, Ning
    Duan, Dengping
    INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2019, 21 (03) : 715 - 732
  • [3] Station-keeping Control of an Underactuated Stratospheric Airship
    Weixiang Zhou
    Pingfang Zhou
    Yueying Wang
    Ning Wang
    Dengping Duan
    International Journal of Fuzzy Systems, 2019, 21 : 715 - 732
  • [4] Robustly Station-Keeping For The Airship Subject To Stratospheric Wind
    Jin, Huiyu
    Qiao, Jihui
    Liu, Lili
    TENCON 2015 - 2015 IEEE REGION 10 CONFERENCE, 2015,
  • [5] STATION-KEEPING CONTROL STRATEGIES ANALYSIS FOR STRATOSPHERIC AIRSHIP
    Zhang Lixue
    Wang Zhongwei
    Zhao Kun
    Yang Xixiang
    PROCEEDINGS OF THE 21ST ESA SYMPOSIUM ON EUROPEAN ROCKET & BALLOON PROGRAMMES AND RESEARCH, 2013, 721 : 241 - 245
  • [6] Recovery trajectory optimization of the solar-powered stratospheric airship for the station-keeping mission
    Wang, Jie
    Meng, Xiuyun
    Li, Cuichun
    ACTA ASTRONAUTICA, 2021, 178 : 159 - 177
  • [7] Backstepping Control based on Sliding Mode for Station-Keeping of Stratospheric Airship
    Parsa, Ashkan
    Monfared, Sadra Borji
    Kalhor, Ahmad
    2018 6TH RSI INTERNATIONAL CONFERENCE ON ROBOTICS AND MECHATRONICS (ICROM 2018), 2018, : 554 - 559
  • [8] Stratospheric airship trajectory planning in wind field using deep reinforcement learning
    Qi, Lele
    Yang, Xixiang
    Bai, Fangchao
    Deng, Xiaolong
    Pan, Yuelong
    ADVANCES IN SPACE RESEARCH, 2025, 75 (01) : 620 - 634
  • [9] Station-keeping of the Autonomous Airship Using Adaptive Pursuit Guidance Law
    Wang, Jie
    Meng, Xiuyun
    Li, Cuichun
    Qiu, Wenjie
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 5047 - 5052
  • [10] Path planning of stratospheric airship in dynamic wind field based on deep reinforcement learning
    Zheng, Baojin
    Zhu, Ming
    Guo, Xiao
    Ou, Jiajun
    Yuan, Jiace
    AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 150