Reinforcement learning in dual-arm trajectory planning for a free-floating space robot

被引:95
|
作者
Wu, Yun-Hua [1 ]
Yu, Zhi-Cheng [1 ]
Li, Chao-Yong [2 ]
He, Meng-Jie [1 ]
Hua, Bing [1 ]
Chen, Zhi-Ming [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Sch Astronaut, 29 Yudao St, Nanjing 210016, Peoples R China
[2] Zhejiang Univ, Coll Elect Engn, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
On-orbit servicing; Free-floating space robot; Dual-arm trajectory planning; Reinforcement learning; Fixed and moving targets; MANEUVER;
D O I
10.1016/j.ast.2019.105657
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
A free-floating space robot exhibits strong dynamic coupling between the arm and the base, and the resulting position of the end of the arm depends not only on the joint angles but also on the state of the base. Dynamic modeling is complicated for multiple degree of freedom (DOF) manipulators, especially for a space robot with two arms. Therefore, the trajectories are typically planned offline and tracked online. However, this approach is not suitable if the target has relative motion with respect to the servicing space robot. To handle this issue, a model-free reinforcement learning strategy is proposed for training a policy for online trajectory planning without establishing the dynamic and kinematic models of the space robot. The model-free learning algorithm learns a policy that maps states to actions via trial and error in a simulation environment. With the learned policy, which is represented by a feedforward neural network with 2 hidden layers, the space robot can schedule and perform actions quickly and can be implemented for real-time applications. The feasibility of the trained policy is demonstrated for both fixed and moving targets. (C) 2020 Elsevier Masson SAS. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Free-floating flexible space robot trajectory tracking
    Hu, Q.-L. (huqinglei@hit.edu.cn), 1600, Harbin Institute of Technology (44):
  • [42] Trajectory planning of a dual-arm space robot for target capturing with minimizing base disturbance
    Xue, Zhihui
    Zhang, Xin
    Liu, Jinguo
    ADVANCES IN SPACE RESEARCH, 2023, 72 (06) : 2091 - 2108
  • [43] Optimal trajectory planning for cooperative control of dual-arm robot
    Park C.
    Ha H.
    Lee J.
    Journal of Institute of Control, Robotics and Systems, 2010, 16 (09) : 891 - 897
  • [44] Trajectory planning of free-floating space robot using Particle Swarm Optimization (PSO)
    Wang, Mingming
    Luo, Jianjun
    Walter, Ulrich
    ACTA ASTRONAUTICA, 2015, 112 : 77 - 88
  • [45] Self-collision Avoidance Trajectory Planning and Robust Control of a Dual-arm Space Robot
    Liu, Yicheng
    Yu, Chunxiao
    Sheng, Jingyuan
    Zhang, Tao
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2018, 16 (06) : 2896 - 2905
  • [46] Self-collision Avoidance Trajectory Planning and Robust Control of a Dual-arm Space Robot
    Yicheng Liu
    Chunxiao Yu
    Jingyuan Sheng
    Tao Zhang
    International Journal of Control, Automation and Systems, 2018, 16 : 2896 - 2905
  • [47] Coordinated trajectory planning of dual-arm space robot using constrained particle swarm optimization
    Wang, Mingming
    Luo, Jianjun
    Yuan, Jianping
    Walter, Ulrich
    ACTA ASTRONAUTICA, 2018, 146 : 259 - 272
  • [48] Constrained motion planning of free-float dual-arm space manipulator via deep reinforcement learning
    Li, Yinkang
    Hao, Xiaolong
    She, Yuchen
    Li, Shuang
    Yu, Meng
    AEROSPACE SCIENCE AND TECHNOLOGY, 2021, 109
  • [49] Hybrid simulation of a dual-arm space robot colliding with a floating object
    Takahashi, Ryohei
    Ise, Hiroto
    Konno, Atsushi
    Uchiyama, Masaru
    Sato, Daisuke
    2008 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-9, 2008, : 1201 - +
  • [50] Research on trajectory planning method of dual-arm robot based on ROS
    Cong, Yongzheng
    Jiang, Congrang
    Liu, Hui
    Du, Haibo
    Gan, Yahui
    Jiang, Canghua
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 2616 - 2621