Advancing spacecraft rendezvous and docking through safety reinforcement learning and ubiquitous learning principles

被引：1

作者：

Sharma, Kanta Prasad ^{[1
]}

Kumar, Indradeep ^{[2
]}

Singh, Pavitar Parkash ^{[3
]}

Anbazhagan, K. ^{[4
]}

Albarakati, Hussain Mobarak ^{[5
]}

Bhatt, Mohammed Wasim ^{[6
]}

Ziyadullayevich, Avlokulov Anvar ^{[7
]}

Rana, Arti ^{[8
]}

Sivasankari, S. A. ^{[9
]}

机构：

[1] GLA Univ, Dept Comp Engn & Applicat, Mathura 281406, Uttar Pradesh, India

[2] Inst Aeronaut Engn, Dept Aeronaut Engn, Hyderabad 500043, Telangana, India

[3] Lovely Profess Univ, Dept Management, Phagwara, India

[4] SIMATS, Saveetha Sch Engn, Dept Comp Sci & Engn, Chennai, India

[5] Umm Al Qura Univ, Coll Comp & Informat Syst, Comp Engn & Networks Dept, Mecca 24382, Saudi Arabia

[6] Model Inst Engn & Technol, Jammu, J&K, India

[7] Tashkent Inst Finance, Dept Audit, Tashkent, Uzbekistan

[8] Uttaranchal Univ, Uttaranchal Inst Technol, Dept Comp Sci & Engn, Dehra Dun 248007, India

[9] Vignans Fdn Sci Technol & Res, Dept ECE, Guntur 522213, India

来源：

COMPUTERS IN HUMAN BEHAVIOR | 2024年 / 153卷

关键词：

Proximal Policy Optimization; Deep Deterministic Policy Gradient; Reinforcement Learning; Markov Model; Rendezvous and Docking Mission; ARTIFICIAL POTENTIAL-FIELD; SLIDING MODE CONTROL; COLLISION-AVOIDANCE; MANEUVERS;

D O I：

10.1016/j.chb.2023.108110

中图分类号：

B84 [心理学];

学科分类号：

04 ; 0402 ;

摘要：

As spacecraft rendezvous and docking missions become increasingly complex, the need for advanced solutions has surged. In recent years, the application of reinforcement learning techniques to tackle spacecraft rendezvous guidance challenges has emerged as a prominent international trend. Vital to ensuring the secure rendezvous and docking of spacecraft is the task of obstacle avoidance. However, traditional reinforcement learning algorithms lack the ability to enforce safety constraints within the exploration space, which presents a formidable obstacle in the design of spacecraft rendezvous guidance strategies. In response to this challenge, a spacecraft rendezvous guidance methodology founded on safety reinforcement learning is proposed. Firstly, a Markov model is crafted for autonomous spacecraft rendezvous in scenarios involving collision avoidance. A reward system, contingent on obstacle warnings and collision avoidance constraints, is introduced to establish a safety reinforcement learning framework for devising spacecraft rendezvous guidance strategies. Secondly, within the framework of safety reinforcement learning, two deep reinforcement learning (DRL) algorithms, Proximal Policy Optimisation (PPO) and Deep Deterministic Policy Gradient (DDPG), are leveraged to generate these guidance strategies. Experimental findings validate the effectiveness of this approach in successfully executing obstacle avoidance and achieving rendezvous with remarkable precision. Furthermore, through an analysis of the performance and generalization capabilities of these two algorithms, the efficacy of the proposed methodology is further underscored. This fusion of advanced space guidance technology with the principles of Ubiquitous Learning marks a significant step forward in the quest for safer and more efficient spacecraft rendezvous and docking operations.

引用

页数：13

共 50 条

[41] Robust trajectory design and guidance for far-range rendezvous using reinforcement learning with safety and observability considerations
Wijayatunga, Minduli Charithma
Armellin, Roberto
Holt, Harry
AEROSPACE SCIENCE AND TECHNOLOGY, 2025, 159
[42] Advancing open, flexible and distance learning through learning analytics
Zhang, Jingjing
Burgos, Daniel
Dawson, Shane
DISTANCE EDUCATION, 2019, 40 (03) : 303 - 308
[43] Accelerating Multiagent Reinforcement Learning through Transfer Learning
da Silva, Felipe Leno
Reali Costa, Anna Helena
THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 5034 - 5035
[44] Reconnaissance for Reinforcement Learning with Safety Constraints
Maeda, Shin-ichi
Watahiki, Hayato
Ouyang, Yi
Okada, Shintarou
Koyama, Masanori
Nagarajan, Prabhat
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT II, 2021, 12976 : 567 - 582
[45] Learning Mobile Manipulation through Deep Reinforcement Learning
Wang, Cong
Zhang, Qifeng
Tian, Qiyan
Li, Shuo
Wang, Xiaohui
Lane, David
Petillot, Yvan
Wang, Sen
SENSORS, 2020, 20 (03)
[46] Robot docking by reinforcement learning in a visual servoing framework
Martínez-Marín, T
Duckett, T
2004 IEEE CONFERENCE ON ROBOTICS, AUTOMATION AND MECHATRONICS, VOLS 1 AND 2, 2004, : 159 - 164
[47] Robot docking based on omnidirectional vision and reinforcement learning
Muse, D
Weber, C
Wermter, S
RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXII, 2006, : 23 - +
[48] Robot docking based on omnidirectional vision and reinforcement learning
Muse, David
Weber, Cornelius
Wermter, Stefan
KNOWLEDGE-BASED SYSTEMS, 2006, 19 (05) : 324 - 332
[49] Learning decision theoretic utilities through reinforcement learning
Stensmo, M
Sejnowski, TJ
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 9: PROCEEDINGS OF THE 1996 CONFERENCE, 1997, 9 : 1061 - 1067
[50] Optimal Multi-impulse Linear Rendezvous via Reinforcement Learning
Xu, Longwei
Zhang, Gang
Qiu, Shi
Cao, Xibin
SPACE: SCIENCE & TECHNOLOGY, 2023, 3

← 1 2 3 4 5 →