Reinforcement learning-based decision-making for spacecraft pursuit-evasion game in elliptical orbits

被引:2
|
作者
Yu, Weizhuo [1 ,2 ]
Liu, Chuang [1 ,2 ]
Yue, Xiaokui [1 ,2 ]
机构
[1] Northwestern Polytech Univ, Sch Astronaut, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ Shenzhen, Res & Dev Inst, Shenzhen 518057, Peoples R China
基金
中国国家自然科学基金;
关键词
Pursuit-evasion game; Decision making; Deep deterministic policy gradient; Impulsive maneuver; Elliptical orbit; DYNAMICS; DOCKING;
D O I
10.1016/j.conengprac.2024.106072
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The orbital game theory is a fundamental technology for the cleanup of space debris to improve the safety of useful spacecraft in future, thus, this work develops a decision-making method by reinforcement learning technology to implement the pursuit-evasion game in elliptical orbits. The linearized Tschauner-Hempel equation describes the spacecraft's motion and the problem is formulated by game theory. Subsequently, an impulsive maneuvering model in a complete three-dimensional elliptical orbit is established. Then an algorithm based on deep deterministic policy gradient is designed to solve the optimal strategy for the pursuit-evasion game. For the successful decision of the pursuer, an extensive reward function is designed and improved considering the shortest time, optimal fuel, and collision avoidance. Finally, numerical simulations of a pursuit-evasion mission are performed to demonstrate the effectiveness and superiority of the proposed decision-making algorithm. The game success rate of the algorithm against targets with different maneuvering abilities is verified, which implies that the algorithm can be applied in extended scenarios.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Nonzero-Sum Pursuit-Evasion Game Control for Spacecraft Systems: A Q-Learning Method
    Zheng, Zixuan
    Zhang, Peng
    Yuan, Jianping
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2023, 59 (04) : 3971 - 3981
  • [42] Multi-UAV cooperative maneuver decision-making for pursuit-evasion using improved MADRL
    Luo, Delin
    Fan, Zihao
    Yang, Ziyi
    Xu, Yang
    DEFENCE TECHNOLOGY, 2024, 35 : 187 - 197
  • [43] Game of Drones: UAV Pursuit-Evasion Game With Type-2 Fuzzy Logic Controllers Tuned by Reinforcement Learning
    Camci, Efe
    Kayacan, Erdal
    2016 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2016, : 618 - 625
  • [44] Optimal game theoretic solution of the pursuit-evasion intercept problem using on-policy reinforcement learning
    Kartal, Yusuf
    Subbarao, Kamesh
    Dogan, Atilla
    Lewis, Frank
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2021, 31 (16) : 7886 - 7903
  • [45] Pursuit and Evasion Strategy of a Differential Game Based on Deep Reinforcement Learning
    Xu, Can
    Zhang, Yin
    Wang, Weigang
    Dong, Ligang
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2022, 10
  • [46] Transfer reinforcement learning for multi-agent pursuit-evasion differential game with obstacles in a continuous environment
    Hu, Penglin
    Pan, Quan
    Zhao, Chunhui
    Guo, Yaning
    ASIAN JOURNAL OF CONTROL, 2024, 26 (04) : 2125 - 2140
  • [47] A DIMENSION-REDUCTION METHOD FOR THE FINITE-HORIZON SPACECRAFT PURSUIT-EVASION GAME
    Qi-Shuai Wang
    Li, Pei
    Lei, Ting
    Xiao-Feng Liu
    Guo-Ping Cai
    JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION, 2023, 19 (03) : 1983 - 1998
  • [48] SPACECRAFT DECISION-MAKING AUTONOMY USING DEEP REINFORCEMENT LEARNING
    Harris, Andrew
    Teil, Thibaud
    Schaub, Hanspeter
    SPACEFLIGHT MECHANICS 2019, VOL 168, PTS I-IV, 2019, 168 : 1757 - 1775
  • [49] Escape Strategy Based on Apollonius Circles in the Pursuit-Evasion Game
    Huang, Yuting
    Luo, Yifan
    Nie, Yuhan
    Hou, Tianle
    Fu, Xiaowei
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 2143 - 2153
  • [50] A Visibility-Based Pursuit-Evasion Game with a Circular Obstacle
    Sourabh Bhattacharya
    Tamer Başar
    Naira Hovakimyan
    Journal of Optimization Theory and Applications, 2016, 171 : 1071 - 1082