Reinforcement learning in dual-arm trajectory planning for a free-floating space robot

被引:95
|
作者
Wu, Yun-Hua [1 ]
Yu, Zhi-Cheng [1 ]
Li, Chao-Yong [2 ]
He, Meng-Jie [1 ]
Hua, Bing [1 ]
Chen, Zhi-Ming [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Sch Astronaut, 29 Yudao St, Nanjing 210016, Peoples R China
[2] Zhejiang Univ, Coll Elect Engn, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
On-orbit servicing; Free-floating space robot; Dual-arm trajectory planning; Reinforcement learning; Fixed and moving targets; MANEUVER;
D O I
10.1016/j.ast.2019.105657
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
A free-floating space robot exhibits strong dynamic coupling between the arm and the base, and the resulting position of the end of the arm depends not only on the joint angles but also on the state of the base. Dynamic modeling is complicated for multiple degree of freedom (DOF) manipulators, especially for a space robot with two arms. Therefore, the trajectories are typically planned offline and tracked online. However, this approach is not suitable if the target has relative motion with respect to the servicing space robot. To handle this issue, a model-free reinforcement learning strategy is proposed for training a policy for online trajectory planning without establishing the dynamic and kinematic models of the space robot. The model-free learning algorithm learns a policy that maps states to actions via trial and error in a simulation environment. With the learned policy, which is represented by a feedforward neural network with 2 hidden layers, the space robot can schedule and perform actions quickly and can be implemented for real-time applications. The feasibility of the trained policy is demonstrated for both fixed and moving targets. (C) 2020 Elsevier Masson SAS. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Dynamic Modeling and Improved Nonlinear Model Predictive Control of a Free-Floating Dual-Arm Space Robot
    Guo, Zhenhao
    Ju, Hehua
    Lu, Chenxin
    Wang, Kaimeng
    APPLIED SCIENCES-BASEL, 2024, 14 (08):
  • [22] Dual-Arm Robot Trajectory Planning Based on Deep Reinforcement Learning under Complex Environment
    Tang, Wanxing
    Cheng, Chuang
    Ai, Haiping
    Chen, Li
    MICROMACHINES, 2022, 13 (04)
  • [23] Trajectory Tracking for a Dual-Arm Free-Floating Space Robot With a Class of General Nonsingular Predefined-Time Terminal Sliding Mode
    Liu, Yicheng
    Yan, Wen
    Zhang, Tao
    Yu, Chunxiao
    Tu, Haiyan
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (05): : 3273 - 3286
  • [24] KINEMATICS, MOTION PLANNING AND DESIGN OF A FREE-FLOATING DUAL-ARM PLANAR MANIPULATOR
    AGRAWAL, SK
    SHIRUMALLA, S
    MECHANISM AND MACHINE THEORY, 1994, 29 (05) : 691 - 700
  • [25] Acceleration-level trajectory planning for a dual-arm space robot
    Xie, Kedi
    Lan, Weiyao
    IFAC PAPERSONLINE, 2019, 52 (24): : 243 - 248
  • [26] A Multi-Target Trajectory Planning of a 6-DoF Free-Floating Space Robot via Reinforcement Learning
    Wang, Shengjie
    Zheng, Xiang
    Cao, Yuxue
    Zhang, Tao
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 3724 - 3730
  • [27] Dynamic control of a free-floating flexible dual-arm space robotic system
    College of Automation Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China
    Jixie Gongcheng Xuebao, 2007, 10 (196-200):
  • [28] Minimum torque trajectory planning algorithm for Free-Floating Space Robot
    Hu, Qing-Lei
    Wang, Yong-Zhi
    Shi, Zhong
    Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2011, 43 (11): : 20 - 24
  • [29] Autonomous trajectory planning of free-floating robot for capturing space target
    Li, Cheng
    Liang, Bin
    Xu, Wenfu
    2006 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-12, 2006, : 1008 - +
  • [30] Trajectory planning of free-floating space robot with avoidance of dynamic singularity
    Zhang, Fuhai
    Fu, Yili
    Wang, Shuguo
    Jiqiren/Robot, 2012, 34 (01): : 38 - 43