Advancing spacecraft rendezvous and docking through safety reinforcement learning and ubiquitous learning principles

被引:1
|
作者
Sharma, Kanta Prasad [1 ]
Kumar, Indradeep [2 ]
Singh, Pavitar Parkash [3 ]
Anbazhagan, K. [4 ]
Albarakati, Hussain Mobarak [5 ]
Bhatt, Mohammed Wasim [6 ]
Ziyadullayevich, Avlokulov Anvar [7 ]
Rana, Arti [8 ]
Sivasankari, S. A. [9 ]
机构
[1] GLA Univ, Dept Comp Engn & Applicat, Mathura 281406, Uttar Pradesh, India
[2] Inst Aeronaut Engn, Dept Aeronaut Engn, Hyderabad 500043, Telangana, India
[3] Lovely Profess Univ, Dept Management, Phagwara, India
[4] SIMATS, Saveetha Sch Engn, Dept Comp Sci & Engn, Chennai, India
[5] Umm Al Qura Univ, Coll Comp & Informat Syst, Comp Engn & Networks Dept, Mecca 24382, Saudi Arabia
[6] Model Inst Engn & Technol, Jammu, J&K, India
[7] Tashkent Inst Finance, Dept Audit, Tashkent, Uzbekistan
[8] Uttaranchal Univ, Uttaranchal Inst Technol, Dept Comp Sci & Engn, Dehra Dun 248007, India
[9] Vignans Fdn Sci Technol & Res, Dept ECE, Guntur 522213, India
关键词
Proximal Policy Optimization; Deep Deterministic Policy Gradient; Reinforcement Learning; Markov Model; Rendezvous and Docking Mission; ARTIFICIAL POTENTIAL-FIELD; SLIDING MODE CONTROL; COLLISION-AVOIDANCE; MANEUVERS;
D O I
10.1016/j.chb.2023.108110
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
As spacecraft rendezvous and docking missions become increasingly complex, the need for advanced solutions has surged. In recent years, the application of reinforcement learning techniques to tackle spacecraft rendezvous guidance challenges has emerged as a prominent international trend. Vital to ensuring the secure rendezvous and docking of spacecraft is the task of obstacle avoidance. However, traditional reinforcement learning algorithms lack the ability to enforce safety constraints within the exploration space, which presents a formidable obstacle in the design of spacecraft rendezvous guidance strategies. In response to this challenge, a spacecraft rendezvous guidance methodology founded on safety reinforcement learning is proposed. Firstly, a Markov model is crafted for autonomous spacecraft rendezvous in scenarios involving collision avoidance. A reward system, contingent on obstacle warnings and collision avoidance constraints, is introduced to establish a safety reinforcement learning framework for devising spacecraft rendezvous guidance strategies. Secondly, within the framework of safety reinforcement learning, two deep reinforcement learning (DRL) algorithms, Proximal Policy Optimisation (PPO) and Deep Deterministic Policy Gradient (DDPG), are leveraged to generate these guidance strategies. Experimental findings validate the effectiveness of this approach in successfully executing obstacle avoidance and achieving rendezvous with remarkable precision. Furthermore, through an analysis of the performance and generalization capabilities of these two algorithms, the efficacy of the proposed methodology is further underscored. This fusion of advanced space guidance technology with the principles of Ubiquitous Learning marks a significant step forward in the quest for safer and more efficient spacecraft rendezvous and docking operations.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Educational Principles in Constructivism for Ubiquitous Based Learning
    Cha, Sung-Hyun
    Seo, Kum-Taek
    Shin, Gi-Wang
    UBIQUITOUS COMPUTING AND MULTIMEDIA APPLICATIONS, PT I, 2011, 150 : 283 - 289
  • [22] Advancing RAN Slicing with Offline Reinforcement Learning
    Yang, Kun
    Yeh, Shu-ping
    Zhang, Menglei
    Sydir, Jerry
    Yang, Jing
    Shen, Cong
    2024 IEEE INTERNATIONAL SYMPOSIUM ON DYNAMIC SPECTRUM ACCESS NETWORKS, DYSPAN 2024, 2024, : 331 - 338
  • [23] Safety Margins for Reinforcement Learning
    Grushin, Alexander
    Woods, Walt
    Velasquez, Alvaro
    Khan, Simon
    2023 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI, 2023, : 42 - 43
  • [24] Design principles for advancing higher education sustainability learning through transformative research
    Bernert, Philip
    Wanner, Matthias
    Fischer, Nele
    Barth, Matthias
    ENVIRONMENT DEVELOPMENT AND SUSTAINABILITY, 2022,
  • [25] Learning User Preferences in Ubiquitous Systems: A User Study and a Reinforcement Learning Approach
    Zaidenberg, Sofia
    Reignier, Patrick
    Mandran, Nadine
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, 2010, 339 : 336 - +
  • [26] Reinforcement Learning of Context Models for a Ubiquitous Personal Assistant
    Zaidenberg, Sofia
    Reignier, Patrick
    Crowley, James L.
    3rd Symposium of Ubiquitous Computing and Ambient Intelligence 2008, 2009, 51 : 254 - 264
  • [27] Learning Aerial Docking via Offline-to-Online Reinforcement Learning
    Tao, Yang
    Feng Yuting
    Yu, Yushu
    2024 4TH INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL AND ROBOTICS, ICCCR 2024, 2024, : 305 - 309
  • [28] Model-based Reinforcement Learning for Decentralized Multiagent Rendezvous
    Wang, Rose E.
    Kew, J. Chase
    Lee, Dennis
    Lee, Tsang-Wei Edward
    Zhang, Tingnan
    Ichter, Brian
    Tan, Jie
    Faust, Aleksandra
    CONFERENCE ON ROBOT LEARNING, VOL 155, 2020, 155 : 711 - 725
  • [29] LEARNING NETWORK REPRESENTATION THROUGH REINFORCEMENT LEARNING
    Shen, Siqi
    Fu, Yongquan
    Jia, Adele Lu
    Su, Huayou
    Wang, Qinglin
    Wang, Chengsong
    Dou, Yong
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3537 - 3541
  • [30] Safety reinforcement learning control via transfer learning
    Zhang, Quanqi
    Wu, Chengwei
    Tian, Haoyu
    Gao, Yabin
    Yao, Weiran
    Wu, Ligang
    AUTOMATICA, 2024, 166