HS-Pose: Hybrid Scope Feature Extraction for Category-level Object Pose Estimation

被引:21
|
作者
Zheng, Linfang [1 ,4 ]
Wang, Chen [1 ,2 ]
Sun, Yinghan [1 ]
Dasgupta, Esha [4 ]
Chen, Hua [1 ]
Leonardis, Ales [4 ]
Zhang, Wei [1 ,3 ]
Chang, Hyung Jin [4 ]
机构
[1] Southern Univ Sci & Technol, Dept Mech & Energy Engn, Shenzhen, Peoples R China
[2] Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[3] Peng Cheng Lab, Shenzhen, Peoples R China
[4] Univ Birmingham, Sch Comp Sci, Birmingham, England
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52729.2023.01646
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we focus on the problem of category-level object pose estimation, which is challenging due to the large intra-category shape variation. 3D graph convolution (3D-GC) based methods have been widely used to extract local geometric features, but they have limitations for complex shaped objects and are sensitive to noise. Moreover, the scale and translation invariant properties of 3D-GC restrict the perception of an object's size and translation information. In this paper, we propose a simple network structure, the HS-layer, which extends 3D-GC to extract hybrid scope latent features from point cloud data for category-level object pose estimation tasks. The proposed HS-layer: 1) is able to perceive local-global geometric structure and global information, 2) is robust to noise, and 3) can encode size and translation information. Our experiments show that the simple replacement of the 3D-GC layer with the proposed HS-layer on the baseline method (GPV-Pose) achieves a significant improvement, with the performance increased by 14.5% on 5 degrees 2cm metric and 10.3% on IoU(75). Our method outperforms the state-of-the-art methods by a large margin (8.3% on 5 degrees 2cm, 6.9% on IoU(75)) on REAL275 dataset and runs in real-time (50 FPS)(1).
引用
收藏
页码:17163 / 17173
页数:11
相关论文
共 50 条
  • [41] Keypoint-Based Category-Level Object Pose Tracking from an RGB Sequence with Uncertainty Estimation
    Lin, Yunzhi
    Tremblay, Jonathan
    Tyree, Stephen
    Vela, Patricio A.
    Birchfield, Stan
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022,
  • [42] SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose Estimation
    Chen, Yamei
    Di, Yan
    Zhai, Guangyao
    Manhardt, Fabian
    Zhang, Chenyangguang
    Zhang, Ruida
    Tombari, Federico
    Navab, Nassir
    Busam, Benjamin
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 9959 - 9969
  • [43] Keypoint-Based Disentangled Pose Network for Category-Level 6-D Object Pose Tracking
    Sun, Shantong
    Liu, Rongke
    Sun, Shuqiao
    Park, Unsang
    IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2022, 42 (05) : 28 - 36
  • [44] CATRE: Iterative Point Clouds Alignment for Category-Level Object Pose Refinement
    Liu, Xingyu
    Wang, Gu
    Li, Yi
    Ji, Xiangyang
    COMPUTER VISION - ECCV 2022, PT II, 2022, 13662 : 499 - 516
  • [45] Robotic Grasp Detection Based on Category-Level Object Pose Estimation With Self-Supervised Learning
    Yu, Sheng
    Zhai, Di-Hua
    Xia, Yuanqing
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024, 29 (01) : 625 - 635
  • [46] PhoCaL: A Multi-Modal Dataset for Category-Level Object Pose Estimation with Photometrically Challenging Objects
    Wang, Pengyuan
    Jung, HyunJun
    Li, Yitong
    Shen, Siyuan
    Srikanth, Rahul Parthasarathy
    Garattoni, Lorenzo
    Meier, Sven
    Navab, Nassir
    Busam, Benjamin
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 21190 - 21199
  • [47] You Only Look at One: Category-Level Object Representations for Pose Estimation From a Single Example
    Goodwin, Walter
    Havoutis, Ioannis
    Posner, Ingmar
    CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 1435 - 1445
  • [48] Category-Level 6-D Object Pose Estimation With Shape Deformation for Robotic Grasp Detection
    Yu, Sheng
    Zhai, Di-Hua
    Guan, Yuyin
    Xia, Yuanqing
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1857 - 1871
  • [49] Estimation method of category-level multi-object rigid body 6D pose
    Cheng, Shuo
    Jia, Di
    Yang, Liu
    He, Dekun
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2025, 40 (03) : 457 - 471
  • [50] Object Pose Estimation and Feature Extraction Based on PVNet
    Kao, Yi-Hsiang
    Chen, Ching-Kun
    Chen, Chih-Cheng
    Lan, Chen-Yen
    IEEE ACCESS, 2022, 10 : 122387 - 122398