Autonomous Decision-Making for Aerobraking via Parallel Randomized Deep Reinforcement Learning

被引:6
|
作者
Falcone, Giusy [1 ,3 ]
Putnam, Zachary R. R. [2 ]
机构
[1] Univ Illinois, Champaign, IL 61801 USA
[2] Univ Illinois, Dept Aerosp Engn, Champaign, IL 61801 USA
[3] Carnegie Mellon Univ, Robot Inst, Pittsburgh, PA 15213 USA
关键词
Space vehicles; Planetary orbits; Mars; Decision making; Computer architecture; Atmospheric modeling; Reinforcement learning; Aerobraking; deep reinforcement learning (DRL); domain randomization; ACCELEROMETER DATA; MARS; MISSION; COST;
D O I
10.1109/TAES.2022.3221697
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Aerobraking is used to insert a spacecraft into a low orbit around a planet through many orbital passages into its complex atmosphere. The aerobraking atmospheric passages are challenging because of the high variability of the atmospheric environment. This paper develops a parallel domain randomized deep reinforcement learning architecture for autonomous decision-making in a stochastic environment, such as aerobraking atmospheric passages. In this context, the architecture is used for planning aerobraking maneuvers to avoid the occurrence of thermal violations during the atmospheric aerobraking passages and target a final low-altitude orbit. The parallel domain randomized deep reinforcement learning architecture is designed to account for large variability of the physical model, as well as uncertain conditions. Also, the parallel approach speeds up the training process for simulation-based applications, and domain randomization improves resultant policy generalization. This framework is applied to the 2001 Mars Odyssey aerobraking campaign; with respect to the 2001 Mars Odyssey mission flight data and a Numerical Predictor Corrector (NPC)-based state-of-the-art heuristic for autonomous aerobraking, the proposed architecture outperforms the state-of-the-art heuristic algorithm with a decrease of 97.5% in the number of thermal violations. Furthermore, it yields a reduction of 98.7% in the number of thermal violations with respect to the Mars Odyssey mission flight data and requires 13.9% fewer orbits. Results also show that the proposed architecture can also learn a generalized policy in the presence of strong uncertainties, such as aggressive atmospheric density perturbations, different atmospheric density models, and a different simulator maximum step size and error accuracy.
引用
收藏
页码:3055 / 3070
页数:16
相关论文
共 50 条
  • [21] An Integrated Lateral and Longitudinal Decision-Making Model for Autonomous Driving Based on Deep Reinforcement Learning
    Cui, Jianxun
    Zhao, Boyuan
    Qu, Mingcheng
    JOURNAL OF ADVANCED TRANSPORTATION, 2023, 2023
  • [22] Autonomous Vehicles' Decision-Making Behavior in Complex Driving Environments Using Deep Reinforcement Learning
    Qi, Xiao
    Ye, Yingjun
    Sun, Jian
    CICTP 2019: TRANSPORTATION IN CHINA-CONNECTING THE WORLD, 2019, : 5853 - 5864
  • [23] Decision-Making of an Autonomous Vehicle when Approached by an Emergency Vehicle using Deep Reinforcement Learning
    Shoaraee, Hamid
    Chen, Liang
    Jiang, Fan
    2021 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS DASC/PICOM/CBDCOM/CYBERSCITECH 2021, 2021, : 185 - 191
  • [24] A novel deep reinforcement learning for POMDP-based autonomous ship collision decision-making
    Zhang, Xinyu
    Zheng, Kangjie
    Wang, Chengbo
    Chen, Jihong
    Qi, Huaiyuan
    NEURAL COMPUTING & APPLICATIONS, 2023,
  • [25] Deep imitative reinforcement learning with gradient conflict-free for decision-making in autonomous vehicles
    Shan, Zitong
    Zhao, Jian
    Huang, Wenhui
    Zhao, Yang
    Ge, Linhe
    Zhong, Shouren
    Hu, Hongyu
    Lv, Chen
    Zhu, Bing
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2025, 173
  • [26] Leveraging on Deep Reinforcement Learning for Autonomous Safe Decision-Making in Highway On-ramp Merging
    Kherroubi, Zine el Abidine
    Aknine, Samir
    Bacha, Rebiha
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15815 - 15816
  • [27] Autonomous Industrial Management via Reinforcement Learning Towards Self-Learning Agents for Decision-Making
    Espinosa-Leal, Leonardo
    Chapman, Anthony
    Westerlund, Magnus
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (06) : 8427 - 8439
  • [28] Intelligent Decision-Making of Scheduling for Dynamic Permutation Flowshop via Deep Reinforcement Learning
    Yang, Shengluo
    Xu, Zhigang
    Wang, Junyi
    SENSORS, 2021, 21 (03) : 1 - 21
  • [29] Autonomous air combat decision-making of UAV based on parallel self-play reinforcement learning
    Li, Bo
    Huang, Jingyi
    Bai, Shuangxia
    Gan, Zhigang
    Liang, Shiyang
    Evgeny, Neretin
    Yao, Shouwen
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2023, 8 (01) : 64 - 81
  • [30] A DECISION-MAKING METHOD FOR AUTONOMOUS VEHICLES BASED ON SIMULATION AND REINFORCEMENT LEARNING
    Zheng, Rui
    Liu, Chunming
    Guo, Qi
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 362 - 369