Robotic Grasp Detection With 6-D Pose Estimation Based on Graph Convolution and Refinement

被引:4
|
作者
Yu, Sheng [1 ]
Zhai, Di-Hua [1 ,2 ]
Xia, Yuanqing [1 ,3 ]
Wang, Wei [4 ]
Zhang, Chengyu [4 ]
Zhao, Shiqi [4 ]
机构
[1] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China
[2] Beijing Inst Technol, Yangtze Delta Reg Acad, Jiaxing 314001, Peoples R China
[3] Zhongyuan Univ Technol, Zhengzhou 450007, Henan, Peoples R China
[4] China United Network Commun Corp Ltd, Res Inst, Beijing 100176, Peoples R China
来源
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2024年 / 54卷 / 06期
基金
中国国家自然科学基金;
关键词
Convolution network; grasp detection; pose estimation; robot; transformer;
D O I
10.1109/TSMC.2024.3371580
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Six-dimensional (6-D) object pose estimation plays a critical role in robotic grasp, which performs extensive usage in manufacturing. The current state-of-the-art pose estimation techniques primarily depend on matching keypoints. Typically, these methods establish a correspondence between 2-D keypoints in an image and the corresponding ones in a 3-D object model. And then they use the PnP-RANSAC algorithm to determine the 6-D pose of the object. However, this approach is not end-to-end trainable and may encounter difficulties when applied to scenarios necessitating differentiable poses. When employing a direct end-to-end regression method, the outcomes are often inferior. To tackle the mentioned problems, we present GR6D, which is a keypoint-and graph-convolution-based neural network for differentiable pose estimation based on RGB-D data. First, we propose a multiscale fusion method that utilizes convolution and graph convolution to exploit information contained in RGB and depth images. Additionally, we propose a transformer-based pose refinement module to further adjust features from RGB images and point clouds. We evaluate GR6D on three datasets: 1) LINEMOD; 2) occlusion LINEMOD; and 3) YCB-Video dataset, and it outperforms most state-of-the-art methods. Finally, we apply GR6D to pose estimation and the robotic grasping task in the real world, manifesting superior performance.
引用
收藏
页码:3783 / 3795
页数:13
相关论文
共 50 条
  • [21] Reconstruction-based 6D pose estimation for robotic assembly
    Shi, Zhongchen
    Xu, Kai
    Li, Zhang
    Guan, Banglei
    Wang, Gang
    Shang, Yang
    APPLIED OPTICS, 2020, 59 (31) : 9824 - 9835
  • [22] GHand: A Graph Convolution Network for 3D Hand Pose Estimation
    Wang, Pengsheng
    Xue, Guangtao
    Li, Pin
    Kim, Jinman
    Sheng, Bin
    Mao, Lijuan
    ADVANCES IN COMPUTER GRAPHICS, CGI 2020, 2020, 12221 : 374 - 381
  • [23] Robotic Grasp Pose Detection Using Deep Learning
    Caldera, Shehan
    Rassau, Alexander
    Chai, Douglas
    2018 15TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2018, : 1966 - 1972
  • [24] Conditional Directed Graph Convolution for 3D Human Pose Estimation
    Hu, Wenbo
    Zhang, Changgong
    Zhan, Fangneng
    Zhang, Lei
    Wong, Tien-Tsin
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 602 - 611
  • [25] SKGNet: Robotic Grasp Detection With Selective Kernel Convolution
    Yu, Sheng
    Zhai, Di-Hua
    Xia, Yuanqing
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 20 (04) : 2241 - 2252
  • [26] Robotic Grasp Detection Based on Category-Level Object Pose Estimation With Self-Supervised Learning
    Yu, Sheng
    Zhai, Di-Hua
    Xia, Yuanqing
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024, 29 (01) : 625 - 635
  • [27] ContourPose: Monocular 6-D Pose Estimation Method for Reflective Textureless Metal Parts
    He, Zaixing
    Li, Quanzhi
    Zhao, Xinyue
    Wang, Jin
    Shen, Huarong
    Zhang, Shuyou
    Tan, Jianrong
    IEEE TRANSACTIONS ON ROBOTICS, 2023, 39 (05) : 4037 - 4050
  • [28] A Pose Proposal and Refinement Network for Better 6D Object Pose Estimation
    Trabelsi, Ameni
    Chaabane, Mohamed
    Blanchard, Nathaniel
    Beveridge, Ross
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2381 - 2390
  • [29] Optimal Model-Based 6-D Object Pose Estimation With Structured-Light Depth Sensors
    Landau, Michael J.
    Beling, Peter A.
    IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2017, 3 (01) : 58 - 73
  • [30] 6-DoF grasp pose estimation based on instance reconstruction
    Huiyan Han
    Wenjun Wang
    Xie Han
    Xiaowen Yang
    Intelligent Service Robotics, 2024, 17 : 251 - 264