Learning geometric consistency and discrepancy for category-level 6D object pose estimation from point clouds

被引:7
|
作者
Zou, Lu [1 ]
Huang, Zhangjin [1 ,2 ,3 ]
Gu, Naijie [1 ,2 ]
Wang, Guoping [4 ]
机构
[1] Univ Sci & Technol China, Hefei 230027, Peoples R China
[2] Anhui Prov Key Lab Software Comp & Commun, Hefei 230027, Peoples R China
[3] USTC, Deqing Alpha Innovat Res Inst, Huzhou 313299, Peoples R China
[4] Peking Univ, Beijing 100871, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
6D object pose estimation; 3D object detection; Point cloud processing; Shape recovery;
D O I
10.1016/j.patcog.2023.109896
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Category-level 6D object pose estimation aims to predict the position and orientation of unseen object instances, which is a fundamental problem in robotic applications. Previous works mainly focused on exploiting visual cues from RGB images, while depth images received less attention. However, depth images contain rich geometric attributes about the object's shape, which are crucial for inferring the object's pose. This work achieves category-level 6D object pose estimation by performing sufficient geometric learning from depth images represented by point clouds. Specifically, we present a novel geometric consistency and geometric discrepancy learning framework called CD-Pose to resolve the intra-category variation, inter-category similarity, and objects with complex structures. Our network consists of a Pose-Consistent Module and a Pose-Discrepant Module. First, a simple MLP-based Pose-Consistent Module is utilized to extract geometrically consistent pose features of objects from the pre-computed object shape priors for each category. Then, the Pose Discrepant Module, designed as a multi-scale region-guided transformer network, is dedicated to exploring each instance's geometrically discrepant features. Next, the NOCS model of the object is reconstructed according to the integration of consistent and discrepant geometric representations. Finally, 6D object poses are obtained by solving the similarity transformation between the reconstruction and the observed point cloud. Experiments on the benchmark datasets show that our CD-Pose produces superior results to state-of-the-art competitors.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] SOCS: Semantically-aware Object Coordinate Space for Category-Level 6D Object Pose Estimation under Large Shape Variations
    Wan, Boyan
    Shi, Yifei
    Xu, Kai
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14019 - 14028
  • [42] Refined Prior Guided Category-Level 6D Pose Estimation and Its Application on Robotic Grasping
    Sun, Huimin
    Zhang, Yilin
    Sun, Honglin
    Hashimoto, Kenji
    APPLIED SCIENCES-BASEL, 2024, 14 (17):
  • [43] iCaps: Iterative Category-Level Object Pose and Shape Estimation
    Deng, Xinke
    Geng, Junyi
    Bretl, Timothy
    Xiang, Yu
    Fox, Dieter
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02): : 1784 - 1791
  • [44] A Visual Navigation Perspective for Category-Level Object Pose Estimation
    Guo, Jiaxin
    Zhong, Fangxun
    Xiong, Rong
    Liu, Yunhui
    Wang, Yue
    Liao, Yiyi
    COMPUTER VISION - ECCV 2022, PT VI, 2022, 13666 : 123 - 141
  • [45] Sca-pose: category-level 6D pose estimation with adaptive shape prior based on CNN and graph convolution
    Zuo, Guoyu
    Yu, Shan
    Yu, Shuangyue
    Liu, Hong
    Zhao, Min
    INTELLIGENT SERVICE ROBOTICS, 2025, 18 (02) : 351 - 361
  • [46] Category-Level 6-D Object Pose Estimation With Shape Deformation for Robotic Grasp Detection
    Yu, Sheng
    Zhai, Di-Hua
    Guan, Yuyin
    Xia, Yuanqing
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1857 - 1871
  • [47] Zero-Shot Category-Level Object Pose Estimation
    Goodwin, Walter
    Vaze, Sagar
    Havoutis, Ioannis
    Posner, Ingmar
    COMPUTER VISION, ECCV 2022, PT XXXIX, 2022, 13699 : 516 - 532
  • [48] Category-Level Metric Scale Object Shape and Pose Estimation
    Lee, Taeyeop
    Lee, Byeong-Uk
    Kim, Myungchul
    Kweon, I. S.
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) : 8575 - 8582
  • [49] CatTrack: Single-Stage Category-Level 6D Object Pose Tracking via Convolution and Vision Transformer
    Yu, Sheng
    Zhai, Di-Hua
    Xia, Yuanqing
    Li, Dong
    Zhao, Shiqi
    IEEE Transactions on Multimedia, 2024, 26 : 1665 - 1680
  • [50] Category-Level 6-D Object Pose Estimation With Shape Deformation for Robotic Grasp Detection
    Yu, Sheng
    Zhai, Di-Hua
    Guan, Yuyin
    Xia, Yuanqing
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1857 - 1871