Reinforcement learning in dual-arm trajectory planning for a free-floating space robot

被引:95
|
作者
Wu, Yun-Hua [1 ]
Yu, Zhi-Cheng [1 ]
Li, Chao-Yong [2 ]
He, Meng-Jie [1 ]
Hua, Bing [1 ]
Chen, Zhi-Ming [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Sch Astronaut, 29 Yudao St, Nanjing 210016, Peoples R China
[2] Zhejiang Univ, Coll Elect Engn, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
On-orbit servicing; Free-floating space robot; Dual-arm trajectory planning; Reinforcement learning; Fixed and moving targets; MANEUVER;
D O I
10.1016/j.ast.2019.105657
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
A free-floating space robot exhibits strong dynamic coupling between the arm and the base, and the resulting position of the end of the arm depends not only on the joint angles but also on the state of the base. Dynamic modeling is complicated for multiple degree of freedom (DOF) manipulators, especially for a space robot with two arms. Therefore, the trajectories are typically planned offline and tracked online. However, this approach is not suitable if the target has relative motion with respect to the servicing space robot. To handle this issue, a model-free reinforcement learning strategy is proposed for training a policy for online trajectory planning without establishing the dynamic and kinematic models of the space robot. The model-free learning algorithm learns a policy that maps states to actions via trial and error in a simulation environment. With the learned policy, which is represented by a feedforward neural network with 2 hidden layers, the space robot can schedule and perform actions quickly and can be implemented for real-time applications. The feasibility of the trained policy is demonstrated for both fixed and moving targets. (C) 2020 Elsevier Masson SAS. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Modelling and Control of Dual-Arm Free-Floating Space Robot Using Virtual Decomposition Control for Capturing Target
    Wang, Xuectian
    Xia, Bo
    Li, Gang
    Liu, Houde
    Liang, Bin
    2017 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE ROBIO 2017), 2017, : 2021 - 2026
  • [32] PLANNING MOTIONS OF A DUAL-ARM FREE-FLOATING MANIPULATOR KEEPING AND BASE INERTIALLY FIXED
    AGRAWAL, SK
    SHIRUMALLA, S
    MECHANISM AND MACHINE THEORY, 1995, 30 (01) : 59 - 70
  • [33] Dynamic Modeling and Control Optimization of Free-Floating Dual-Arm Space Robots in Task Space
    Rodrigues, Gabriel S.
    Pazelli, Tatiana F. P. A. T.
    2021 LATIN AMERICAN ROBOTICS SYMPOSIUM / 2021 BRAZILIAN SYMPOSIUM ON ROBOTICS / 2021 WORKSHOP OF ROBOTICS IN EDUCATION (LARS-SBR-WRE 2021), 2021, : 168 - 173
  • [34] Base attitude disturbance minimizing trajectory planning for a dual-arm space robot
    Zhou, Qing
    Liu, Xiaofeng
    Cai, Guoping
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART G-JOURNAL OF AEROSPACE ENGINEERING, 2022, 236 (04) : 704 - 721
  • [35] Coordinated trajectory planning of a dual-arm space robot with multiple avoidance constraints
    Ni, Shihao
    Chen, Weidong
    Ju, Hehua
    Chen, Ti
    ACTA ASTRONAUTICA, 2022, 195 : 379 - 391
  • [36] Optimal trajectory planning of a flexible dual-arm space robot with vibration reduction
    Wu, H
    Sun, FC
    Sun, ZQ
    Wu, LC
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2004, 40 (02) : 147 - 163
  • [37] Optimal Trajectory Planning of a Flexible Dual-Arm Space Robot with Vibration Reduction
    Hao Wu
    Fuchun Sun
    Zengqi Sun
    Licheng Wu
    Journal of Intelligent and Robotic Systems, 2004, 40 : 147 - 163
  • [38] Study on trajectory planning of dual-arm space robot keeping the base stabilized
    Xu, W.-F. (wfxu@hit.edu.cn), 1600, Science Press (39):
  • [39] Trajectory Planning of Free-floating Space Robot Using an Improved PSO Algorithm
    Zhu, Zhanxia
    Zhong, Jianfei
    Jing, Sa
    Tang, Biwei
    PROCEEDINGS OF 2018 IEEE 4TH INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC 2018), 2018, : 580 - 585
  • [40] Cartesian Trajectory Planning of Free-Floating Space Robot with Dynamic Singularities Avoidance
    Jin R.-Y.
    Geng Y.-H.
    Yuhang Xuebao/Journal of Astronautics, 2020, 41 (08): : 989 - 999