Advancing spacecraft rendezvous and docking through safety reinforcement learning and ubiquitous learning principles

被引:1
|
作者
Sharma, Kanta Prasad [1 ]
Kumar, Indradeep [2 ]
Singh, Pavitar Parkash [3 ]
Anbazhagan, K. [4 ]
Albarakati, Hussain Mobarak [5 ]
Bhatt, Mohammed Wasim [6 ]
Ziyadullayevich, Avlokulov Anvar [7 ]
Rana, Arti [8 ]
Sivasankari, S. A. [9 ]
机构
[1] GLA Univ, Dept Comp Engn & Applicat, Mathura 281406, Uttar Pradesh, India
[2] Inst Aeronaut Engn, Dept Aeronaut Engn, Hyderabad 500043, Telangana, India
[3] Lovely Profess Univ, Dept Management, Phagwara, India
[4] SIMATS, Saveetha Sch Engn, Dept Comp Sci & Engn, Chennai, India
[5] Umm Al Qura Univ, Coll Comp & Informat Syst, Comp Engn & Networks Dept, Mecca 24382, Saudi Arabia
[6] Model Inst Engn & Technol, Jammu, J&K, India
[7] Tashkent Inst Finance, Dept Audit, Tashkent, Uzbekistan
[8] Uttaranchal Univ, Uttaranchal Inst Technol, Dept Comp Sci & Engn, Dehra Dun 248007, India
[9] Vignans Fdn Sci Technol & Res, Dept ECE, Guntur 522213, India
关键词
Proximal Policy Optimization; Deep Deterministic Policy Gradient; Reinforcement Learning; Markov Model; Rendezvous and Docking Mission; ARTIFICIAL POTENTIAL-FIELD; SLIDING MODE CONTROL; COLLISION-AVOIDANCE; MANEUVERS;
D O I
10.1016/j.chb.2023.108110
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
As spacecraft rendezvous and docking missions become increasingly complex, the need for advanced solutions has surged. In recent years, the application of reinforcement learning techniques to tackle spacecraft rendezvous guidance challenges has emerged as a prominent international trend. Vital to ensuring the secure rendezvous and docking of spacecraft is the task of obstacle avoidance. However, traditional reinforcement learning algorithms lack the ability to enforce safety constraints within the exploration space, which presents a formidable obstacle in the design of spacecraft rendezvous guidance strategies. In response to this challenge, a spacecraft rendezvous guidance methodology founded on safety reinforcement learning is proposed. Firstly, a Markov model is crafted for autonomous spacecraft rendezvous in scenarios involving collision avoidance. A reward system, contingent on obstacle warnings and collision avoidance constraints, is introduced to establish a safety reinforcement learning framework for devising spacecraft rendezvous guidance strategies. Secondly, within the framework of safety reinforcement learning, two deep reinforcement learning (DRL) algorithms, Proximal Policy Optimisation (PPO) and Deep Deterministic Policy Gradient (DDPG), are leveraged to generate these guidance strategies. Experimental findings validate the effectiveness of this approach in successfully executing obstacle avoidance and achieving rendezvous with remarkable precision. Furthermore, through an analysis of the performance and generalization capabilities of these two algorithms, the efficacy of the proposed methodology is further underscored. This fusion of advanced space guidance technology with the principles of Ubiquitous Learning marks a significant step forward in the quest for safer and more efficient spacecraft rendezvous and docking operations.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Robust trajectory design and guidance for far-range rendezvous using reinforcement learning with safety and observability considerations
    Wijayatunga, Minduli Charithma
    Armellin, Roberto
    Holt, Harry
    AEROSPACE SCIENCE AND TECHNOLOGY, 2025, 159
  • [42] Advancing open, flexible and distance learning through learning analytics
    Zhang, Jingjing
    Burgos, Daniel
    Dawson, Shane
    DISTANCE EDUCATION, 2019, 40 (03) : 303 - 308
  • [43] Accelerating Multiagent Reinforcement Learning through Transfer Learning
    da Silva, Felipe Leno
    Reali Costa, Anna Helena
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 5034 - 5035
  • [44] Reconnaissance for Reinforcement Learning with Safety Constraints
    Maeda, Shin-ichi
    Watahiki, Hayato
    Ouyang, Yi
    Okada, Shintarou
    Koyama, Masanori
    Nagarajan, Prabhat
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT II, 2021, 12976 : 567 - 582
  • [45] Learning Mobile Manipulation through Deep Reinforcement Learning
    Wang, Cong
    Zhang, Qifeng
    Tian, Qiyan
    Li, Shuo
    Wang, Xiaohui
    Lane, David
    Petillot, Yvan
    Wang, Sen
    SENSORS, 2020, 20 (03)
  • [46] Robot docking by reinforcement learning in a visual servoing framework
    Martínez-Marín, T
    Duckett, T
    2004 IEEE CONFERENCE ON ROBOTICS, AUTOMATION AND MECHATRONICS, VOLS 1 AND 2, 2004, : 159 - 164
  • [47] Robot docking based on omnidirectional vision and reinforcement learning
    Muse, D
    Weber, C
    Wermter, S
    RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXII, 2006, : 23 - +
  • [48] Robot docking based on omnidirectional vision and reinforcement learning
    Muse, David
    Weber, Cornelius
    Wermter, Stefan
    KNOWLEDGE-BASED SYSTEMS, 2006, 19 (05) : 324 - 332
  • [49] Learning decision theoretic utilities through reinforcement learning
    Stensmo, M
    Sejnowski, TJ
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 9: PROCEEDINGS OF THE 1996 CONFERENCE, 1997, 9 : 1061 - 1067
  • [50] Optimal Multi-impulse Linear Rendezvous via Reinforcement Learning
    Xu, Longwei
    Zhang, Gang
    Qiu, Shi
    Cao, Xibin
    SPACE: SCIENCE & TECHNOLOGY, 2023, 3