Advancing spacecraft rendezvous and docking through safety reinforcement learning and ubiquitous learning principles

被引：1

作者：

Sharma, Kanta Prasad ^{[1
]}

Kumar, Indradeep ^{[2
]}

Singh, Pavitar Parkash ^{[3
]}

Anbazhagan, K. ^{[4
]}

Albarakati, Hussain Mobarak ^{[5
]}

Bhatt, Mohammed Wasim ^{[6
]}

Ziyadullayevich, Avlokulov Anvar ^{[7
]}

Rana, Arti ^{[8
]}

Sivasankari, S. A. ^{[9
]}

机构：

[1] GLA Univ, Dept Comp Engn & Applicat, Mathura 281406, Uttar Pradesh, India

[2] Inst Aeronaut Engn, Dept Aeronaut Engn, Hyderabad 500043, Telangana, India

[3] Lovely Profess Univ, Dept Management, Phagwara, India

[4] SIMATS, Saveetha Sch Engn, Dept Comp Sci & Engn, Chennai, India

[5] Umm Al Qura Univ, Coll Comp & Informat Syst, Comp Engn & Networks Dept, Mecca 24382, Saudi Arabia

[6] Model Inst Engn & Technol, Jammu, J&K, India

[7] Tashkent Inst Finance, Dept Audit, Tashkent, Uzbekistan

[8] Uttaranchal Univ, Uttaranchal Inst Technol, Dept Comp Sci & Engn, Dehra Dun 248007, India

[9] Vignans Fdn Sci Technol & Res, Dept ECE, Guntur 522213, India

来源：

COMPUTERS IN HUMAN BEHAVIOR | 2024年 / 153卷

关键词：

Proximal Policy Optimization; Deep Deterministic Policy Gradient; Reinforcement Learning; Markov Model; Rendezvous and Docking Mission; ARTIFICIAL POTENTIAL-FIELD; SLIDING MODE CONTROL; COLLISION-AVOIDANCE; MANEUVERS;

D O I：

10.1016/j.chb.2023.108110

中图分类号：

B84 [心理学];

学科分类号：

04 ; 0402 ;

摘要：

As spacecraft rendezvous and docking missions become increasingly complex, the need for advanced solutions has surged. In recent years, the application of reinforcement learning techniques to tackle spacecraft rendezvous guidance challenges has emerged as a prominent international trend. Vital to ensuring the secure rendezvous and docking of spacecraft is the task of obstacle avoidance. However, traditional reinforcement learning algorithms lack the ability to enforce safety constraints within the exploration space, which presents a formidable obstacle in the design of spacecraft rendezvous guidance strategies. In response to this challenge, a spacecraft rendezvous guidance methodology founded on safety reinforcement learning is proposed. Firstly, a Markov model is crafted for autonomous spacecraft rendezvous in scenarios involving collision avoidance. A reward system, contingent on obstacle warnings and collision avoidance constraints, is introduced to establish a safety reinforcement learning framework for devising spacecraft rendezvous guidance strategies. Secondly, within the framework of safety reinforcement learning, two deep reinforcement learning (DRL) algorithms, Proximal Policy Optimisation (PPO) and Deep Deterministic Policy Gradient (DDPG), are leveraged to generate these guidance strategies. Experimental findings validate the effectiveness of this approach in successfully executing obstacle avoidance and achieving rendezvous with remarkable precision. Furthermore, through an analysis of the performance and generalization capabilities of these two algorithms, the efficacy of the proposed methodology is further underscored. This fusion of advanced space guidance technology with the principles of Ubiquitous Learning marks a significant step forward in the quest for safer and more efficient spacecraft rendezvous and docking operations.

引用

页数：13

共 50 条

[21] Educational Principles in Constructivism for Ubiquitous Based Learning
Cha, Sung-Hyun
Seo, Kum-Taek
Shin, Gi-Wang
UBIQUITOUS COMPUTING AND MULTIMEDIA APPLICATIONS, PT I, 2011, 150 : 283 - 289
[22] Advancing RAN Slicing with Offline Reinforcement Learning
Yang, Kun
Yeh, Shu-ping
Zhang, Menglei
Sydir, Jerry
Yang, Jing
Shen, Cong
2024 IEEE INTERNATIONAL SYMPOSIUM ON DYNAMIC SPECTRUM ACCESS NETWORKS, DYSPAN 2024, 2024, : 331 - 338
[23] Safety Margins for Reinforcement Learning
Grushin, Alexander
Woods, Walt
Velasquez, Alvaro
Khan, Simon
2023 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI, 2023, : 42 - 43
[24] Design principles for advancing higher education sustainability learning through transformative research
Bernert, Philip
Wanner, Matthias
Fischer, Nele
Barth, Matthias
ENVIRONMENT DEVELOPMENT AND SUSTAINABILITY, 2022,
[25] Learning User Preferences in Ubiquitous Systems: A User Study and a Reinforcement Learning Approach
Zaidenberg, Sofia
Reignier, Patrick
Mandran, Nadine
ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, 2010, 339 : 336 - +
[26] Reinforcement Learning of Context Models for a Ubiquitous Personal Assistant
Zaidenberg, Sofia
Reignier, Patrick
Crowley, James L.
3rd Symposium of Ubiquitous Computing and Ambient Intelligence 2008, 2009, 51 : 254 - 264
[27] Learning Aerial Docking via Offline-to-Online Reinforcement Learning
Tao, Yang
Feng Yuting
Yu, Yushu
2024 4TH INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL AND ROBOTICS, ICCCR 2024, 2024, : 305 - 309
[28] Model-based Reinforcement Learning for Decentralized Multiagent Rendezvous
Wang, Rose E.
Kew, J. Chase
Lee, Dennis
Lee, Tsang-Wei Edward
Zhang, Tingnan
Ichter, Brian
Tan, Jie
Faust, Aleksandra
CONFERENCE ON ROBOT LEARNING, VOL 155, 2020, 155 : 711 - 725
[29] LEARNING NETWORK REPRESENTATION THROUGH REINFORCEMENT LEARNING
Shen, Siqi
Fu, Yongquan
Jia, Adele Lu
Su, Huayou
Wang, Qinglin
Wang, Chengsong
Dou, Yong
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3537 - 3541
[30] Safety reinforcement learning control via transfer learning
Zhang, Quanqi
Wu, Chengwei
Tian, Haoyu
Gao, Yabin
Yao, Weiran
Wu, Ligang
AUTOMATICA, 2024, 166

← 1 2 3 4 5 →