DR-Pose: A Two-stage Deformation-and-Registration Pipeline for Category-level 6D Object Pose Estimation

被引:3
|
作者
Zhou, Lei [1 ]
Liu, Zhiyang [1 ]
Gan, Runze [1 ]
Wang, Haozhe [1 ,2 ]
Ang, Marcelo H., Jr. [1 ]
机构
[1] Natl Univ Singapore, Dept Mech Engn, Singapore 117608, Singapore
[2] Natl Univ Singapore, Integrat Sci & Engn Programme, Grad Sch, Singapore 119077, Singapore
关键词
D O I
10.1109/IROS55552.2023.10341552
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Category-level object pose estimation involves estimating the 6D pose and the 3D metric size of objects from predetermined categories. While recent approaches take categorical shape prior information as reference to improve pose estimation accuracy, the single-stage network design and training manner lead to sub-optimal performance since there are two distinct tasks in the pipeline. In this paper, the advantage of two-stage pipeline over single-stage design is discussed. To this end, we propose a two-stage deformation-and-registration pipeline called DR-Pose, which consists of completion-aided deformation stage and scaled registration stage. The first stage uses a point cloud completion method to generate unseen parts of target object, guiding subsequent deformation on the shape prior. In the second stage, a novel registration network is designed to extract pose-sensitive features and predict the representation of object partial point cloud in canonical space based on the deformation results from the first stage. DR-Pose produces superior results to the state-of-the-art shape prior-based methods on both CAMERA25 and REAL275 benchmarks. Codes are available at https://github.com/Zray26/DR-Pose.git.
引用
收藏
页码:1192 / 1199
页数:8
相关论文
共 50 条
  • [21] Category-Level Object Pose Estimation in Heavily Cluttered Scenes by Generalized Two-Stage Shape Reconstructor
    Tatemichi, Hiroki
    Kawanishi, Yasutomo
    Deguchi, Daisuke
    Ide, Ichiro
    Murase, Hiroshi
    IEEE ACCESS, 2024, 12 : 33440 - 33448
  • [22] Self-Supervised Category-Level 6D Object Pose Estimation with Deep Implicit Shape Representation
    Peng, Wanli
    Yan, Jianhang
    Wen, Hongtao
    Sun, Yi
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2082 - 2090
  • [23] Category-Level 6-D Object Pose Estimation With Shape Deformation for Robotic Grasp Detection
    Yu, Sheng
    Zhai, Di-Hua
    Guan, Yuyin
    Xia, Yuanqing
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1857 - 1871
  • [24] Attention-guided RGB-D Fusion Network for Category-level 6D Object Pose Estimation
    Wang, Hao
    Li, Weiming
    Kim, Jiyeon
    Wang, Qiang
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 10651 - 10658
  • [25] Learning geometric consistency and discrepancy for category-level 6D object pose estimation from point clouds
    Zou, Lu
    Huang, Zhangjin
    Gu, Naijie
    Wang, Guoping
    PATTERN RECOGNITION, 2024, 145
  • [26] Category-Level Object Pose Estimation with Statistic Attention
    Jiang, Changhong
    Mu, Xiaoqiao
    Zhang, Bingbing
    Liang, Chao
    Xie, Mujun
    SENSORS, 2024, 24 (16)
  • [27] CatTrack: Single-Stage Category-Level 6D Object Pose Tracking via Convolution and Vision Transformer
    Yu, Sheng
    Zhai, Di-Hua
    Xia, Yuanqing
    Li, Dong
    Zhao, Shiqi
    IEEE Transactions on Multimedia, 2024, 26 : 1665 - 1680
  • [28] SAR-Net: Shape Alignment and Recovery Network for Category-level 6D Object Pose and Size Estimation
    Lin, Haitao
    Liu, Zichang
    Cheang, Chilam
    Fu, Yanwei
    Guo, Guodong
    Xue, Xiangyang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6697 - 6707
  • [29] CatTrack: Single-Stage Category-Level 6D Object Pose Tracking via Convolution and Vision Transformer
    Yu, Sheng
    Zhai, Di-Hua
    Xia, Yuanqing
    Li, Dong
    Zhao, Shiqi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1665 - 1680
  • [30] Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation
    Zhang, Mengchen
    Wu, Tong
    Wang, Tai
    Wang, Tengfei
    Liu, Ziwei
    Lin, Dahua
    COMPUTER VISION - ECCV 2024, PT XXV, 2025, 15083 : 216 - 232