Advancing spacecraft rendezvous and docking through safety reinforcement learning and ubiquitous learning principles

被引：1

作者：

Sharma, Kanta Prasad ^{[1
]}

Kumar, Indradeep ^{[2
]}

Singh, Pavitar Parkash ^{[3
]}

Anbazhagan, K. ^{[4
]}

Albarakati, Hussain Mobarak ^{[5
]}

Bhatt, Mohammed Wasim ^{[6
]}

Ziyadullayevich, Avlokulov Anvar ^{[7
]}

Rana, Arti ^{[8
]}

Sivasankari, S. A. ^{[9
]}

机构：

[1] GLA Univ, Dept Comp Engn & Applicat, Mathura 281406, Uttar Pradesh, India

[2] Inst Aeronaut Engn, Dept Aeronaut Engn, Hyderabad 500043, Telangana, India

[3] Lovely Profess Univ, Dept Management, Phagwara, India

[4] SIMATS, Saveetha Sch Engn, Dept Comp Sci & Engn, Chennai, India

[5] Umm Al Qura Univ, Coll Comp & Informat Syst, Comp Engn & Networks Dept, Mecca 24382, Saudi Arabia

[6] Model Inst Engn & Technol, Jammu, J&K, India

[7] Tashkent Inst Finance, Dept Audit, Tashkent, Uzbekistan

[8] Uttaranchal Univ, Uttaranchal Inst Technol, Dept Comp Sci & Engn, Dehra Dun 248007, India

[9] Vignans Fdn Sci Technol & Res, Dept ECE, Guntur 522213, India

来源：

COMPUTERS IN HUMAN BEHAVIOR | 2024年 / 153卷

关键词：

Proximal Policy Optimization; Deep Deterministic Policy Gradient; Reinforcement Learning; Markov Model; Rendezvous and Docking Mission; ARTIFICIAL POTENTIAL-FIELD; SLIDING MODE CONTROL; COLLISION-AVOIDANCE; MANEUVERS;

D O I：

10.1016/j.chb.2023.108110

中图分类号：

B84 [心理学];

学科分类号：

04 ; 0402 ;

摘要：

As spacecraft rendezvous and docking missions become increasingly complex, the need for advanced solutions has surged. In recent years, the application of reinforcement learning techniques to tackle spacecraft rendezvous guidance challenges has emerged as a prominent international trend. Vital to ensuring the secure rendezvous and docking of spacecraft is the task of obstacle avoidance. However, traditional reinforcement learning algorithms lack the ability to enforce safety constraints within the exploration space, which presents a formidable obstacle in the design of spacecraft rendezvous guidance strategies. In response to this challenge, a spacecraft rendezvous guidance methodology founded on safety reinforcement learning is proposed. Firstly, a Markov model is crafted for autonomous spacecraft rendezvous in scenarios involving collision avoidance. A reward system, contingent on obstacle warnings and collision avoidance constraints, is introduced to establish a safety reinforcement learning framework for devising spacecraft rendezvous guidance strategies. Secondly, within the framework of safety reinforcement learning, two deep reinforcement learning (DRL) algorithms, Proximal Policy Optimisation (PPO) and Deep Deterministic Policy Gradient (DDPG), are leveraged to generate these guidance strategies. Experimental findings validate the effectiveness of this approach in successfully executing obstacle avoidance and achieving rendezvous with remarkable precision. Furthermore, through an analysis of the performance and generalization capabilities of these two algorithms, the efficacy of the proposed methodology is further underscored. This fusion of advanced space guidance technology with the principles of Ubiquitous Learning marks a significant step forward in the quest for safer and more efficient spacecraft rendezvous and docking operations.

引用

页数：13

共 50 条

[31] Improving the efficiency of reinforcement learning for a spacecraft powered descent with Q-learning
Callum Wilson
Annalisa Riccardi
Optimization and Engineering, 2023, 24 : 223 - 255
[32] Improving the efficiency of reinforcement learning for a spacecraft powered descent with Q-learning
Wilson, Callum
Riccardi, Annalisa
OPTIMIZATION AND ENGINEERING, 2023, 24 (01) : 223 - 255
[33] REINFORCEMENT LEARNING FOR SPACECRAFT MANEUVERING NEAR SMALL BODIES
Willis, Stefan
Izzo, Dario
Hennes, Daniel
SPACEFLIGHT MECHANICS 2016, PTS I-IV, 2016, 158 : 1351 - 1368
[34] Vision-based attitude estimation for spacecraft docking operation through deep learning algorithm
Phisannupawong, Thaweerath
Kamsing, Patcharin
Torteeka, Peerapong
Yooyen, Soemsak
2020 22ND INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT): DIGITAL SECURITY GLOBAL AGENDA FOR SAFE SOCIETY!, 2020, : 280 - 284
[35] ADAPTIVE CONTROL BY REINFORCEMENT LEARNING FOR SPACECRAFT ATTITUDE CONTROL
Ramadan, Mohammad
Younes, Ahmad Bani
SPACEFLIGHT MECHANICS 2019, VOL 168, PTS I-IV, 2019, 168 : 1805 - 1815
[36] Deep Reinforcement Learning for Spacecraft Proximity Operations Guidance
Hovell, Kirk
Ulrich, Steve
JOURNAL OF SPACECRAFT AND ROCKETS, 2021, 58 (02) : 254 - 264
[37] Safety and Liveness Guarantees through Reach-Avoid Reinforcement Learning
Hsu, Kai-Chieh
Rubies-Royo, Vicenc
Tomlin, Claire J.
Fisac, Jaime F.
ROBOTICS: SCIENCE AND SYSTEM XVII, 2021,
[38] Advancing Aviation Safety Through Machine Learning and Psychophysiological Data: A Systematic Review
Alreshidi, Ibrahim
Moulitsas, Irene
Jenkins, Karl W.
IEEE ACCESS, 2024, 12 : 5132 - 5150
[39] Research on autonomous decision-making method for spacecraft in the mission of rendezvous and approaching to maneuvering target based on deep reinforcement learning
Huang, Cheng
Xing, Aijia
Zeng, Quanli
Xiong, Fangyu
ASIAN JOURNAL OF CONTROL, 2025,
[40] Learning to flock through reinforcement
Durve, Mihir
Perumal, Fernando
Celani, Antonio
PHYSICAL REVIEW E, 2020, 102 (01)

← 1 2 3 4 5 →