Autonomous Decision-Making for Aerobraking via Parallel Randomized Deep Reinforcement Learning

被引：6

作者：

Falcone, Giusy ^{[1
,3
]}

Putnam, Zachary R. R. ^{[2
]}

机构：

[1] Univ Illinois, Champaign, IL 61801 USA

[2] Univ Illinois, Dept Aerosp Engn, Champaign, IL 61801 USA

[3] Carnegie Mellon Univ, Robot Inst, Pittsburgh, PA 15213 USA

来源：

IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS | 2023年 / 59卷 / 03期

关键词：

Space vehicles; Planetary orbits; Mars; Decision making; Computer architecture; Atmospheric modeling; Reinforcement learning; Aerobraking; deep reinforcement learning (DRL); domain randomization; ACCELEROMETER DATA; MARS; MISSION; COST;

D O I：

10.1109/TAES.2022.3221697

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

Aerobraking is used to insert a spacecraft into a low orbit around a planet through many orbital passages into its complex atmosphere. The aerobraking atmospheric passages are challenging because of the high variability of the atmospheric environment. This paper develops a parallel domain randomized deep reinforcement learning architecture for autonomous decision-making in a stochastic environment, such as aerobraking atmospheric passages. In this context, the architecture is used for planning aerobraking maneuvers to avoid the occurrence of thermal violations during the atmospheric aerobraking passages and target a final low-altitude orbit. The parallel domain randomized deep reinforcement learning architecture is designed to account for large variability of the physical model, as well as uncertain conditions. Also, the parallel approach speeds up the training process for simulation-based applications, and domain randomization improves resultant policy generalization. This framework is applied to the 2001 Mars Odyssey aerobraking campaign; with respect to the 2001 Mars Odyssey mission flight data and a Numerical Predictor Corrector (NPC)-based state-of-the-art heuristic for autonomous aerobraking, the proposed architecture outperforms the state-of-the-art heuristic algorithm with a decrease of 97.5% in the number of thermal violations. Furthermore, it yields a reduction of 98.7% in the number of thermal violations with respect to the Mars Odyssey mission flight data and requires 13.9% fewer orbits. Results also show that the proposed architecture can also learn a generalized policy in the presence of strong uncertainties, such as aggressive atmospheric density perturbations, different atmospheric density models, and a different simulator maximum step size and error accuracy.

引用

页码：3055 / 3070

页数：16

共 50 条

[21] An Integrated Lateral and Longitudinal Decision-Making Model for Autonomous Driving Based on Deep Reinforcement Learning
Cui, Jianxun
Zhao, Boyuan
Qu, Mingcheng
JOURNAL OF ADVANCED TRANSPORTATION, 2023, 2023
[22] Autonomous Vehicles' Decision-Making Behavior in Complex Driving Environments Using Deep Reinforcement Learning
Qi, Xiao
Ye, Yingjun
Sun, Jian
CICTP 2019: TRANSPORTATION IN CHINA-CONNECTING THE WORLD, 2019, : 5853 - 5864
[23] Decision-Making of an Autonomous Vehicle when Approached by an Emergency Vehicle using Deep Reinforcement Learning
Shoaraee, Hamid
Chen, Liang
Jiang, Fan
2021 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS DASC/PICOM/CBDCOM/CYBERSCITECH 2021, 2021, : 185 - 191
[24] A novel deep reinforcement learning for POMDP-based autonomous ship collision decision-making
Zhang, Xinyu
Zheng, Kangjie
Wang, Chengbo
Chen, Jihong
Qi, Huaiyuan
NEURAL COMPUTING & APPLICATIONS, 2023,
[25] Deep imitative reinforcement learning with gradient conflict-free for decision-making in autonomous vehicles
Shan, Zitong
Zhao, Jian
Huang, Wenhui
Zhao, Yang
Ge, Linhe
Zhong, Shouren
Hu, Hongyu
Lv, Chen
Zhu, Bing
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2025, 173
[26] Leveraging on Deep Reinforcement Learning for Autonomous Safe Decision-Making in Highway On-ramp Merging
Kherroubi, Zine el Abidine
Aknine, Samir
Bacha, Rebiha
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15815 - 15816
[27] Autonomous Industrial Management via Reinforcement Learning Towards Self-Learning Agents for Decision-Making
Espinosa-Leal, Leonardo
Chapman, Anthony
Westerlund, Magnus
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (06) : 8427 - 8439
[28] Intelligent Decision-Making of Scheduling for Dynamic Permutation Flowshop via Deep Reinforcement Learning
Yang, Shengluo
Xu, Zhigang
Wang, Junyi
SENSORS, 2021, 21 (03) : 1 - 21
[29] Autonomous air combat decision-making of UAV based on parallel self-play reinforcement learning
Li, Bo
Huang, Jingyi
Bai, Shuangxia
Gan, Zhigang
Liang, Shiyang
Evgeny, Neretin
Yao, Shouwen
CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2023, 8 (01) : 64 - 81
[30] A DECISION-MAKING METHOD FOR AUTONOMOUS VEHICLES BASED ON SIMULATION AND REINFORCEMENT LEARNING
Zheng, Rui
Liu, Chunming
Guo, Qi
PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 362 - 369

← 1 2 3 4 5 →