Reinforcement learning in dual-arm trajectory planning for a free-floating space robot

被引：95

作者：

Wu, Yun-Hua ^{[1
]}

Yu, Zhi-Cheng ^{[1
]}

Li, Chao-Yong ^{[2
]}

He, Meng-Jie ^{[1
]}

Hua, Bing ^{[1
]}

Chen, Zhi-Ming ^{[1
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Sch Astronaut, 29 Yudao St, Nanjing 210016, Peoples R China

[2] Zhejiang Univ, Coll Elect Engn, Hangzhou 310027, Peoples R China

来源：

AEROSPACE SCIENCE AND TECHNOLOGY | 2020年 / 98卷

基金：

中国国家自然科学基金;

关键词：

On-orbit servicing; Free-floating space robot; Dual-arm trajectory planning; Reinforcement learning; Fixed and moving targets; MANEUVER;

D O I：

10.1016/j.ast.2019.105657

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

A free-floating space robot exhibits strong dynamic coupling between the arm and the base, and the resulting position of the end of the arm depends not only on the joint angles but also on the state of the base. Dynamic modeling is complicated for multiple degree of freedom (DOF) manipulators, especially for a space robot with two arms. Therefore, the trajectories are typically planned offline and tracked online. However, this approach is not suitable if the target has relative motion with respect to the servicing space robot. To handle this issue, a model-free reinforcement learning strategy is proposed for training a policy for online trajectory planning without establishing the dynamic and kinematic models of the space robot. The model-free learning algorithm learns a policy that maps states to actions via trial and error in a simulation environment. With the learned policy, which is represented by a feedforward neural network with 2 hidden layers, the space robot can schedule and perform actions quickly and can be implemented for real-time applications. The feasibility of the trained policy is demonstrated for both fixed and moving targets. (C) 2020 Elsevier Masson SAS. All rights reserved.

引用

页数：10

共 50 条

[41] Free-floating flexible space robot trajectory tracking
Hu, Q.-L. (huqinglei@hit.edu.cn), 1600, Harbin Institute of Technology (44):
[42] Trajectory planning of a dual-arm space robot for target capturing with minimizing base disturbance
Xue, Zhihui
Zhang, Xin
Liu, Jinguo
ADVANCES IN SPACE RESEARCH, 2023, 72 (06) : 2091 - 2108
[43] Optimal trajectory planning for cooperative control of dual-arm robot
Park C.
Ha H.
Lee J.
Journal of Institute of Control, Robotics and Systems, 2010, 16 (09) : 891 - 897
[44] Trajectory planning of free-floating space robot using Particle Swarm Optimization (PSO)
Wang, Mingming
Luo, Jianjun
Walter, Ulrich
ACTA ASTRONAUTICA, 2015, 112 : 77 - 88
[45] Self-collision Avoidance Trajectory Planning and Robust Control of a Dual-arm Space Robot
Liu, Yicheng
Yu, Chunxiao
Sheng, Jingyuan
Zhang, Tao
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2018, 16 (06) : 2896 - 2905
[46] Self-collision Avoidance Trajectory Planning and Robust Control of a Dual-arm Space Robot
Yicheng Liu
Chunxiao Yu
Jingyuan Sheng
Tao Zhang
International Journal of Control, Automation and Systems, 2018, 16 : 2896 - 2905
[47] Coordinated trajectory planning of dual-arm space robot using constrained particle swarm optimization
Wang, Mingming
Luo, Jianjun
Yuan, Jianping
Walter, Ulrich
ACTA ASTRONAUTICA, 2018, 146 : 259 - 272
[48] Constrained motion planning of free-float dual-arm space manipulator via deep reinforcement learning
Li, Yinkang
Hao, Xiaolong
She, Yuchen
Li, Shuang
Yu, Meng
AEROSPACE SCIENCE AND TECHNOLOGY, 2021, 109
[49] Hybrid simulation of a dual-arm space robot colliding with a floating object
Takahashi, Ryohei
Ise, Hiroto
Konno, Atsushi
Uchiyama, Masaru
Sato, Daisuke
2008 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-9, 2008, : 1201 - +
[50] Research on trajectory planning method of dual-arm robot based on ROS
Cong, Yongzheng
Jiang, Congrang
Liu, Hui
Du, Haibo
Gan, Yahui
Jiang, Canghua
2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 2616 - 2621

← 1 2 3 4 5 →