Advancing spacecraft rendezvous and docking through safety reinforcement learning and ubiquitous learning principles

被引:1
|
作者
Sharma, Kanta Prasad [1 ]
Kumar, Indradeep [2 ]
Singh, Pavitar Parkash [3 ]
Anbazhagan, K. [4 ]
Albarakati, Hussain Mobarak [5 ]
Bhatt, Mohammed Wasim [6 ]
Ziyadullayevich, Avlokulov Anvar [7 ]
Rana, Arti [8 ]
Sivasankari, S. A. [9 ]
机构
[1] GLA Univ, Dept Comp Engn & Applicat, Mathura 281406, Uttar Pradesh, India
[2] Inst Aeronaut Engn, Dept Aeronaut Engn, Hyderabad 500043, Telangana, India
[3] Lovely Profess Univ, Dept Management, Phagwara, India
[4] SIMATS, Saveetha Sch Engn, Dept Comp Sci & Engn, Chennai, India
[5] Umm Al Qura Univ, Coll Comp & Informat Syst, Comp Engn & Networks Dept, Mecca 24382, Saudi Arabia
[6] Model Inst Engn & Technol, Jammu, J&K, India
[7] Tashkent Inst Finance, Dept Audit, Tashkent, Uzbekistan
[8] Uttaranchal Univ, Uttaranchal Inst Technol, Dept Comp Sci & Engn, Dehra Dun 248007, India
[9] Vignans Fdn Sci Technol & Res, Dept ECE, Guntur 522213, India
关键词
Proximal Policy Optimization; Deep Deterministic Policy Gradient; Reinforcement Learning; Markov Model; Rendezvous and Docking Mission; ARTIFICIAL POTENTIAL-FIELD; SLIDING MODE CONTROL; COLLISION-AVOIDANCE; MANEUVERS;
D O I
10.1016/j.chb.2023.108110
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
As spacecraft rendezvous and docking missions become increasingly complex, the need for advanced solutions has surged. In recent years, the application of reinforcement learning techniques to tackle spacecraft rendezvous guidance challenges has emerged as a prominent international trend. Vital to ensuring the secure rendezvous and docking of spacecraft is the task of obstacle avoidance. However, traditional reinforcement learning algorithms lack the ability to enforce safety constraints within the exploration space, which presents a formidable obstacle in the design of spacecraft rendezvous guidance strategies. In response to this challenge, a spacecraft rendezvous guidance methodology founded on safety reinforcement learning is proposed. Firstly, a Markov model is crafted for autonomous spacecraft rendezvous in scenarios involving collision avoidance. A reward system, contingent on obstacle warnings and collision avoidance constraints, is introduced to establish a safety reinforcement learning framework for devising spacecraft rendezvous guidance strategies. Secondly, within the framework of safety reinforcement learning, two deep reinforcement learning (DRL) algorithms, Proximal Policy Optimisation (PPO) and Deep Deterministic Policy Gradient (DDPG), are leveraged to generate these guidance strategies. Experimental findings validate the effectiveness of this approach in successfully executing obstacle avoidance and achieving rendezvous with remarkable precision. Furthermore, through an analysis of the performance and generalization capabilities of these two algorithms, the efficacy of the proposed methodology is further underscored. This fusion of advanced space guidance technology with the principles of Ubiquitous Learning marks a significant step forward in the quest for safer and more efficient spacecraft rendezvous and docking operations.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Improving the efficiency of reinforcement learning for a spacecraft powered descent with Q-learning
    Callum Wilson
    Annalisa Riccardi
    Optimization and Engineering, 2023, 24 : 223 - 255
  • [32] Improving the efficiency of reinforcement learning for a spacecraft powered descent with Q-learning
    Wilson, Callum
    Riccardi, Annalisa
    OPTIMIZATION AND ENGINEERING, 2023, 24 (01) : 223 - 255
  • [33] REINFORCEMENT LEARNING FOR SPACECRAFT MANEUVERING NEAR SMALL BODIES
    Willis, Stefan
    Izzo, Dario
    Hennes, Daniel
    SPACEFLIGHT MECHANICS 2016, PTS I-IV, 2016, 158 : 1351 - 1368
  • [34] Vision-based attitude estimation for spacecraft docking operation through deep learning algorithm
    Phisannupawong, Thaweerath
    Kamsing, Patcharin
    Torteeka, Peerapong
    Yooyen, Soemsak
    2020 22ND INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT): DIGITAL SECURITY GLOBAL AGENDA FOR SAFE SOCIETY!, 2020, : 280 - 284
  • [35] ADAPTIVE CONTROL BY REINFORCEMENT LEARNING FOR SPACECRAFT ATTITUDE CONTROL
    Ramadan, Mohammad
    Younes, Ahmad Bani
    SPACEFLIGHT MECHANICS 2019, VOL 168, PTS I-IV, 2019, 168 : 1805 - 1815
  • [36] Deep Reinforcement Learning for Spacecraft Proximity Operations Guidance
    Hovell, Kirk
    Ulrich, Steve
    JOURNAL OF SPACECRAFT AND ROCKETS, 2021, 58 (02) : 254 - 264
  • [37] Safety and Liveness Guarantees through Reach-Avoid Reinforcement Learning
    Hsu, Kai-Chieh
    Rubies-Royo, Vicenc
    Tomlin, Claire J.
    Fisac, Jaime F.
    ROBOTICS: SCIENCE AND SYSTEM XVII, 2021,
  • [38] Advancing Aviation Safety Through Machine Learning and Psychophysiological Data: A Systematic Review
    Alreshidi, Ibrahim
    Moulitsas, Irene
    Jenkins, Karl W.
    IEEE ACCESS, 2024, 12 : 5132 - 5150
  • [39] Research on autonomous decision-making method for spacecraft in the mission of rendezvous and approaching to maneuvering target based on deep reinforcement learning
    Huang, Cheng
    Xing, Aijia
    Zeng, Quanli
    Xiong, Fangyu
    ASIAN JOURNAL OF CONTROL, 2025,
  • [40] Learning to flock through reinforcement
    Durve, Mihir
    Perumal, Fernando
    Celani, Antonio
    PHYSICAL REVIEW E, 2020, 102 (01)