Robotic Grasp Detection With 6-D Pose Estimation Based on Graph Convolution and Refinement

被引：4

作者：

Yu, Sheng ^{[1
]}

Zhai, Di-Hua ^{[1
,2
]}

Xia, Yuanqing ^{[1
,3
]}

Wang, Wei ^{[4
]}

Zhang, Chengyu ^{[4
]}

Zhao, Shiqi ^{[4
]}

机构：

[1] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China

[2] Beijing Inst Technol, Yangtze Delta Reg Acad, Jiaxing 314001, Peoples R China

[3] Zhongyuan Univ Technol, Zhengzhou 450007, Henan, Peoples R China

[4] China United Network Commun Corp Ltd, Res Inst, Beijing 100176, Peoples R China

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2024年 / 54卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Convolution network; grasp detection; pose estimation; robot; transformer;

D O I：

10.1109/TSMC.2024.3371580

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Six-dimensional (6-D) object pose estimation plays a critical role in robotic grasp, which performs extensive usage in manufacturing. The current state-of-the-art pose estimation techniques primarily depend on matching keypoints. Typically, these methods establish a correspondence between 2-D keypoints in an image and the corresponding ones in a 3-D object model. And then they use the PnP-RANSAC algorithm to determine the 6-D pose of the object. However, this approach is not end-to-end trainable and may encounter difficulties when applied to scenarios necessitating differentiable poses. When employing a direct end-to-end regression method, the outcomes are often inferior. To tackle the mentioned problems, we present GR6D, which is a keypoint-and graph-convolution-based neural network for differentiable pose estimation based on RGB-D data. First, we propose a multiscale fusion method that utilizes convolution and graph convolution to exploit information contained in RGB and depth images. Additionally, we propose a transformer-based pose refinement module to further adjust features from RGB images and point clouds. We evaluate GR6D on three datasets: 1) LINEMOD; 2) occlusion LINEMOD; and 3) YCB-Video dataset, and it outperforms most state-of-the-art methods. Finally, we apply GR6D to pose estimation and the robotic grasping task in the real world, manifesting superior performance.

引用

页码：3783 / 3795

页数：13

共 50 条

[21] Reconstruction-based 6D pose estimation for robotic assembly
Shi, Zhongchen
Xu, Kai
Li, Zhang
Guan, Banglei
Wang, Gang
Shang, Yang
APPLIED OPTICS, 2020, 59 (31) : 9824 - 9835
[22] GHand: A Graph Convolution Network for 3D Hand Pose Estimation
Wang, Pengsheng
Xue, Guangtao
Li, Pin
Kim, Jinman
Sheng, Bin
Mao, Lijuan
ADVANCES IN COMPUTER GRAPHICS, CGI 2020, 2020, 12221 : 374 - 381
[23] Robotic Grasp Pose Detection Using Deep Learning
Caldera, Shehan
Rassau, Alexander
Chai, Douglas
2018 15TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2018, : 1966 - 1972
[24] Conditional Directed Graph Convolution for 3D Human Pose Estimation
Hu, Wenbo
Zhang, Changgong
Zhan, Fangneng
Zhang, Lei
Wong, Tien-Tsin
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 602 - 611
[25] SKGNet: Robotic Grasp Detection With Selective Kernel Convolution
Yu, Sheng
Zhai, Di-Hua
Xia, Yuanqing
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 20 (04) : 2241 - 2252
[26] Robotic Grasp Detection Based on Category-Level Object Pose Estimation With Self-Supervised Learning
Yu, Sheng
Zhai, Di-Hua
Xia, Yuanqing
IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024, 29 (01) : 625 - 635
[27] ContourPose: Monocular 6-D Pose Estimation Method for Reflective Textureless Metal Parts
He, Zaixing
Li, Quanzhi
Zhao, Xinyue
Wang, Jin
Shen, Huarong
Zhang, Shuyou
Tan, Jianrong
IEEE TRANSACTIONS ON ROBOTICS, 2023, 39 (05) : 4037 - 4050
[28] A Pose Proposal and Refinement Network for Better 6D Object Pose Estimation
Trabelsi, Ameni
Chaabane, Mohamed
Blanchard, Nathaniel
Beveridge, Ross
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2381 - 2390
[29] Optimal Model-Based 6-D Object Pose Estimation With Structured-Light Depth Sensors
Landau, Michael J.
Beling, Peter A.
IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2017, 3 (01) : 58 - 73
[30] 6-DoF grasp pose estimation based on instance reconstruction
Huiyan Han
Wenjun Wang
Xie Han
Xiaowen Yang
Intelligent Service Robotics, 2024, 17 : 251 - 264

← 1 2 3 4 5 →