Learning geometric consistency and discrepancy for category-level 6D object pose estimation from point clouds

被引:7
|
作者
Zou, Lu [1 ]
Huang, Zhangjin [1 ,2 ,3 ]
Gu, Naijie [1 ,2 ]
Wang, Guoping [4 ]
机构
[1] Univ Sci & Technol China, Hefei 230027, Peoples R China
[2] Anhui Prov Key Lab Software Comp & Commun, Hefei 230027, Peoples R China
[3] USTC, Deqing Alpha Innovat Res Inst, Huzhou 313299, Peoples R China
[4] Peking Univ, Beijing 100871, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
6D object pose estimation; 3D object detection; Point cloud processing; Shape recovery;
D O I
10.1016/j.patcog.2023.109896
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Category-level 6D object pose estimation aims to predict the position and orientation of unseen object instances, which is a fundamental problem in robotic applications. Previous works mainly focused on exploiting visual cues from RGB images, while depth images received less attention. However, depth images contain rich geometric attributes about the object's shape, which are crucial for inferring the object's pose. This work achieves category-level 6D object pose estimation by performing sufficient geometric learning from depth images represented by point clouds. Specifically, we present a novel geometric consistency and geometric discrepancy learning framework called CD-Pose to resolve the intra-category variation, inter-category similarity, and objects with complex structures. Our network consists of a Pose-Consistent Module and a Pose-Discrepant Module. First, a simple MLP-based Pose-Consistent Module is utilized to extract geometrically consistent pose features of objects from the pre-computed object shape priors for each category. Then, the Pose Discrepant Module, designed as a multi-scale region-guided transformer network, is dedicated to exploring each instance's geometrically discrepant features. Next, the NOCS model of the object is reconstructed according to the integration of consistent and discrepant geometric representations. Finally, 6D object poses are obtained by solving the similarity transformation between the reconstruction and the observed point cloud. Experiments on the benchmark datasets show that our CD-Pose produces superior results to state-of-the-art competitors.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Self-Supervised Category-Level 6D Object Pose Estimation with Deep Implicit Shape Representation
    Peng, Wanli
    Yan, Jianhang
    Wen, Hongtao
    Sun, Yi
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2082 - 2090
  • [22] Attention-guided RGB-D Fusion Network for Category-level 6D Object Pose Estimation
    Wang, Hao
    Li, Weiming
    Kim, Jiyeon
    Wang, Qiang
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 10651 - 10658
  • [23] Learning latent geometric consistency for 6D object pose estimation in heavily cluttered scenes
    Li, Qingnan
    Hu, Ruimin
    Xiao, Jing
    Wang, Zhongyuan
    Chen, Yu
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 70
  • [24] Learning stereopsis from geometric synthesis for 6D object pose estimation
    State Key Laboratory of Industrial Control Technology and Institue of Cyber-Systems and Control, Zhejiang University, Zhejiang, China
    arXiv, 1600,
  • [25] Category-Level Articulated Object Pose Estimation
    Li, Xiaolong
    Wang, He
    Yi, Li
    Guibas, Leonidas
    Abbott, A. Lynn
    Song, Shuran
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3703 - 3712
  • [26] Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation
    Zhang, Mengchen
    Wu, Tong
    Wang, Tai
    Wang, Tengfei
    Liu, Ziwei
    Lin, Dahua
    COMPUTER VISION - ECCV 2024, PT XXV, 2025, 15083 : 216 - 232
  • [27] VI-Net: Boosting Category-level 6D Object Pose Estimation via Learning Decoupled Rotations on the Spherical Representations
    Lin, Jiehong
    Wei, Zewei
    Zhang, Yabin
    Jia, Kui
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 13955 - 13965
  • [28] CATRE: Iterative Point Clouds Alignment for Category-Level Object Pose Refinement
    Liu, Xingyu
    Wang, Gu
    Li, Yi
    Ji, Xiangyang
    COMPUTER VISION - ECCV 2022, PT II, 2022, 13662 : 499 - 516
  • [29] SAR-Net: Shape Alignment and Recovery Network for Category-level 6D Object Pose and Size Estimation
    Lin, Haitao
    Liu, Zichang
    Cheang, Chilam
    Fu, Yanwei
    Guo, Guodong
    Xue, Xiangyang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6697 - 6707
  • [30] DR-Pose: A Two-stage Deformation-and-Registration Pipeline for Category-level 6D Object Pose Estimation
    Zhou, Lei
    Liu, Zhiyang
    Gan, Runze
    Wang, Haozhe
    Ang, Marcelo H., Jr.
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 1192 - 1199