Adaptive generalized ZEM-ZEV feedback guidance for planetary landing via a deep reinforcement learning approach

被引:56
|
作者
Furfaro, Roberto [1 ]
Scorsoglio, Andrea [2 ]
Linares, Richard [3 ]
Massari, Mauro [4 ]
机构
[1] Univ Arizona, Dept Syst & Ind Engn, Dept Aerosp & Mech Engn, Tucson, AZ 85721 USA
[2] Univ Arizona, Dept Syst & Ind Engn, Tucson, AZ 85721 USA
[3] MIT, Dept Aeronaut & Astronaut, Cambridge, MA 02139 USA
[4] Politecn Milan, Dept Aerosp Sci & Technol, I-20156 Milan, Italy
关键词
Optimal landing guidance; Deep reinfocement learning; Closed-loop guidance;
D O I
10.1016/j.actaastro.2020.02.051
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Precision landing on large and small planetary bodies is a technology of utmost importance for future human and robotic exploration of the solar system. In this context, the Zero-Effort-Miss/Zero-Effort-Velocity (ZEM/ZEV) feedback guidance algorithm has been studied extensively and is still a field of active research. The algorithm, although powerful in terms of accuracy and ease of implementation, has some limitations. Therefore with this paper we present an adaptive guidance algorithm based on classical ZEM/ZEV in which machine learning is used to overcome its limitations and create a closed loop guidance algorithm that is sufficiently lightweight to be implemented on board spacecraft and flexible enough to be able to adapt to the given constraint scenario. The adopted methodology is an actor-critic reinforcement learning algorithm that learns the parameters of the above-mentioned guidance architecture according to the given problem constraints.
引用
收藏
页码:156 / 171
页数:16
相关论文
共 50 条
  • [1] WAYPOINT-BASED GENERALIZED ZEM/ZEV FEEDBACK GUIDANCE FOR PLANETARY LANDING VIA A REINFORCEMENT LEARNING APPROACH
    Furfaro, Roberto
    Linares, Richard
    THIRD IAA CONFERENCE ON DYNAMICS AND CONTROL OF SPACE SYSTEMS 2017, 2017, 161 : 401 - 416
  • [2] Collision avoidance ZEM/ZEV optimal feedback guidance for powered descent phase of landing on Mars
    Zhang, Yao
    Guo, Yanning
    Ma, Guangfu
    Zeng, Tianyi
    ADVANCES IN SPACE RESEARCH, 2017, 59 (06) : 1514 - 1525
  • [3] Optimal terminal-time determination for the ZEM/ZEV feedback guidance law with generalized performance index
    Ahn, Jaemyung
    Wang, Pengyu
    Guo, Yanning
    Wie, Bong
    ASTRODYNAMICS, 2019, 3 (02) : 127 - 136
  • [4] Autonomous Planetary Landing via Deep Reinforcement Learning and Transfer Learning
    Ciabatti, Giulia
    Daftry, Shreyansh
    Capobianco, Roberto
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 2031 - 2038
  • [5] Optimal terminal-time determination for the ZEM/ZEV feedback guidance law with generalized performance index
    Jaemyung Ahn
    Pengyu Wang
    Yanning Guo
    Bong Wie
    Astrodynamics, 2019, 3 : 127 - 136
  • [6] MULTIPLE SLIDING SURFACE GUIDANCE FOR PLANETARY LANDING: TUNING AND OPTIMIZATION VIA REINFORCEMENT LEARNING
    Wibben, Daniel R.
    Gaudet, Brian
    Furfaro, Roberto
    Simo, Jules
    SPACEFLIGHT MECHANICS 2013, PTS I-IV, 2013, 148 : 1881 - 1900
  • [7] A RECURRENT DEEP ARCHITECTURE FOR QUASI -OPTIMAL FEEDBACK GUIDANCE IN PLANETARY LANDING
    Furfaro, Roberto
    Bloise, Ilaria
    Orlandelli, Marcello
    Di Lizia, Pierluigi
    Topputo, Francesco
    Linares, Richard
    FIRST IAA/AAS SCITECH FORUM ON SPACE FLIGHT MECHANICS AND SPACE STRUCTURES AND MATERIALS, 2020, 170 : 151 - 174
  • [8] Terminal Multiple Surface Sliding Guidance for Planetary Landing: Development, Tuning and Optimization via Reinforcement Learning
    Roberto Furfaro
    Daniel R. Wibben
    Brian Gaudet
    Jules Simo
    The Journal of the Astronautical Sciences, 2015, 62 : 73 - 99
  • [9] Terminal Multiple Surface Sliding Guidance for Planetary Landing: Development, Tuning and Optimization via Reinforcement Learning
    Furfaro, Roberto
    Wibben, Daniel R.
    Gaudet, Brian
    Simo, Jules
    JOURNAL OF THE ASTRONAUTICAL SCIENCES, 2015, 62 (01): : 73 - 99