Learning geometric consistency and discrepancy for category-level 6D object pose estimation from point clouds

被引:7
|
作者
Zou, Lu [1 ]
Huang, Zhangjin [1 ,2 ,3 ]
Gu, Naijie [1 ,2 ]
Wang, Guoping [4 ]
机构
[1] Univ Sci & Technol China, Hefei 230027, Peoples R China
[2] Anhui Prov Key Lab Software Comp & Commun, Hefei 230027, Peoples R China
[3] USTC, Deqing Alpha Innovat Res Inst, Huzhou 313299, Peoples R China
[4] Peking Univ, Beijing 100871, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
6D object pose estimation; 3D object detection; Point cloud processing; Shape recovery;
D O I
10.1016/j.patcog.2023.109896
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Category-level 6D object pose estimation aims to predict the position and orientation of unseen object instances, which is a fundamental problem in robotic applications. Previous works mainly focused on exploiting visual cues from RGB images, while depth images received less attention. However, depth images contain rich geometric attributes about the object's shape, which are crucial for inferring the object's pose. This work achieves category-level 6D object pose estimation by performing sufficient geometric learning from depth images represented by point clouds. Specifically, we present a novel geometric consistency and geometric discrepancy learning framework called CD-Pose to resolve the intra-category variation, inter-category similarity, and objects with complex structures. Our network consists of a Pose-Consistent Module and a Pose-Discrepant Module. First, a simple MLP-based Pose-Consistent Module is utilized to extract geometrically consistent pose features of objects from the pre-computed object shape priors for each category. Then, the Pose Discrepant Module, designed as a multi-scale region-guided transformer network, is dedicated to exploring each instance's geometrically discrepant features. Next, the NOCS model of the object is reconstructed according to the integration of consistent and discrepant geometric representations. Finally, 6D object poses are obtained by solving the similarity transformation between the reconstruction and the observed point cloud. Experiments on the benchmark datasets show that our CD-Pose produces superior results to state-of-the-art competitors.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] StereoPose: Category-Level 6D Transparent Object Pose Estimation from Stereo Images via Back-View NOCS
    Chen, Kai
    James, Stephen
    Sui, Congying
    Liu, Yun-Hui
    Abbeel, Pieter
    Dou, Qi
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 2855 - 2861
  • [32] GS-Pose: Category-Level Object Pose Estimation via Geometric and Semantic Correspondence
    Wang, Pengyuan
    Ikeda, Takuya
    Lee, Robert
    Nishiwaki, Koichi
    COMPUTER VISION - ECCV 2024, PT XXVII, 2025, 15085 : 108 - 126
  • [33] GeoPose: Dense Reconstruction Guided 6D Object Pose Estimation With Geometric Consistency
    Wang, Deming
    Zhou, Guangliang
    Yan, Yi
    Chen, Huiyi
    Chen, Qijun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 4394 - 4408
  • [34] 6D Object Pose Regression via Supervised Learning on Point Clouds
    Gao, Ge
    Lauri, Mikko
    Wang, Yulong
    Hu, Xiaolin
    Zhang, Jianwei
    Frintrop, Simone
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 3643 - 3649
  • [35] TGNF-Net: Two-Stage Geometric Neighborhood Fusion Network for Category-Level 6D Pose Estimation
    Zhao, Xiaolong
    Yan, Feihu
    Zhao, Guangzhe
    Wang, Caiyong
    INFORMATION, 2025, 16 (02)
  • [36] Object Level Depth Reconstruction for Category Level 6D Object Pose Estimation from Monocular RGB Image
    Fan, Zhaoxin
    Song, Zhenbo
    Xu, Jian
    Wang, Zhicheng
    Wu, Kejian
    Liu, Hongyan
    He, Jun
    COMPUTER VISION - ECCV 2022, PT II, 2022, 13662 : 220 - 236
  • [37] Category-Level 6D Object Pose and Size Estimation Using Self-supervised Deep Prior Deformation Networks
    Lin, Jiehong
    Wei, Zewei
    Ding, Changxing
    Jia, Kui
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 19 - 34
  • [38] Category-Level Object Pose Estimation with Statistic Attention
    Jiang, Changhong
    Mu, Xiaoqiao
    Zhang, Bingbing
    Liang, Chao
    Xie, Mujun
    SENSORS, 2024, 24 (16)
  • [39] Robust Category-Level 6D Pose Estimation with Coarse-to-Fine Rendering of Neural Features
    Ma, Wufei
    Wang, Angtian
    Yuille, Alan
    Kortylewski, Adam
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 492 - 508
  • [40] Category-Level Articulated Object 9D Pose Estimation via Reinforcement Learning
    Liu, Liu
    Du, Jianming
    Wu, Hao
    Yang, Xun
    Liu, Zhenguang
    Hong, Richang
    Wang, Meng
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 728 - 736