Reinforcement learning in dual-arm trajectory planning for a free-floating space robot

被引：95

作者：

Wu, Yun-Hua ^{[1
]}

Yu, Zhi-Cheng ^{[1
]}

Li, Chao-Yong ^{[2
]}

He, Meng-Jie ^{[1
]}

Hua, Bing ^{[1
]}

Chen, Zhi-Ming ^{[1
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Sch Astronaut, 29 Yudao St, Nanjing 210016, Peoples R China

[2] Zhejiang Univ, Coll Elect Engn, Hangzhou 310027, Peoples R China

来源：

AEROSPACE SCIENCE AND TECHNOLOGY | 2020年 / 98卷

基金：

中国国家自然科学基金;

关键词：

On-orbit servicing; Free-floating space robot; Dual-arm trajectory planning; Reinforcement learning; Fixed and moving targets; MANEUVER;

D O I：

10.1016/j.ast.2019.105657

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

A free-floating space robot exhibits strong dynamic coupling between the arm and the base, and the resulting position of the end of the arm depends not only on the joint angles but also on the state of the base. Dynamic modeling is complicated for multiple degree of freedom (DOF) manipulators, especially for a space robot with two arms. Therefore, the trajectories are typically planned offline and tracked online. However, this approach is not suitable if the target has relative motion with respect to the servicing space robot. To handle this issue, a model-free reinforcement learning strategy is proposed for training a policy for online trajectory planning without establishing the dynamic and kinematic models of the space robot. The model-free learning algorithm learns a policy that maps states to actions via trial and error in a simulation environment. With the learned policy, which is represented by a feedforward neural network with 2 hidden layers, the space robot can schedule and perform actions quickly and can be implemented for real-time applications. The feasibility of the trained policy is demonstrated for both fixed and moving targets. (C) 2020 Elsevier Masson SAS. All rights reserved.

引用

页数：10

共 50 条

[21] Dynamic Modeling and Improved Nonlinear Model Predictive Control of a Free-Floating Dual-Arm Space Robot
Guo, Zhenhao
Ju, Hehua
Lu, Chenxin
Wang, Kaimeng
APPLIED SCIENCES-BASEL, 2024, 14 (08):
[22] Dual-Arm Robot Trajectory Planning Based on Deep Reinforcement Learning under Complex Environment
Tang, Wanxing
Cheng, Chuang
Ai, Haiping
Chen, Li
MICROMACHINES, 2022, 13 (04)
[23] Trajectory Tracking for a Dual-Arm Free-Floating Space Robot With a Class of General Nonsingular Predefined-Time Terminal Sliding Mode
Liu, Yicheng
Yan, Wen
Zhang, Tao
Yu, Chunxiao
Tu, Haiyan
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (05): : 3273 - 3286
[24] KINEMATICS, MOTION PLANNING AND DESIGN OF A FREE-FLOATING DUAL-ARM PLANAR MANIPULATOR
AGRAWAL, SK
SHIRUMALLA, S
MECHANISM AND MACHINE THEORY, 1994, 29 (05) : 691 - 700
[25] Acceleration-level trajectory planning for a dual-arm space robot
Xie, Kedi
Lan, Weiyao
IFAC PAPERSONLINE, 2019, 52 (24): : 243 - 248
[26] A Multi-Target Trajectory Planning of a 6-DoF Free-Floating Space Robot via Reinforcement Learning
Wang, Shengjie
Zheng, Xiang
Cao, Yuxue
Zhang, Tao
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 3724 - 3730
[27] Dynamic control of a free-floating flexible dual-arm space robotic system
College of Automation Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China
Jixie Gongcheng Xuebao, 2007, 10 (196-200):
[28] Minimum torque trajectory planning algorithm for Free-Floating Space Robot
Hu, Qing-Lei
Wang, Yong-Zhi
Shi, Zhong
Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2011, 43 (11): : 20 - 24
[29] Autonomous trajectory planning of free-floating robot for capturing space target
Li, Cheng
Liang, Bin
Xu, Wenfu
2006 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-12, 2006, : 1008 - +
[30] Trajectory planning of free-floating space robot with avoidance of dynamic singularity
Zhang, Fuhai
Fu, Yili
Wang, Shuguo
Jiqiren/Robot, 2012, 34 (01): : 38 - 43

← 1 2 3 4 5 →