HS-Pose: Hybrid Scope Feature Extraction for Category-level Object Pose Estimation

被引:21
|
作者
Zheng, Linfang [1 ,4 ]
Wang, Chen [1 ,2 ]
Sun, Yinghan [1 ]
Dasgupta, Esha [4 ]
Chen, Hua [1 ]
Leonardis, Ales [4 ]
Zhang, Wei [1 ,3 ]
Chang, Hyung Jin [4 ]
机构
[1] Southern Univ Sci & Technol, Dept Mech & Energy Engn, Shenzhen, Peoples R China
[2] Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[3] Peng Cheng Lab, Shenzhen, Peoples R China
[4] Univ Birmingham, Sch Comp Sci, Birmingham, England
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52729.2023.01646
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we focus on the problem of category-level object pose estimation, which is challenging due to the large intra-category shape variation. 3D graph convolution (3D-GC) based methods have been widely used to extract local geometric features, but they have limitations for complex shaped objects and are sensitive to noise. Moreover, the scale and translation invariant properties of 3D-GC restrict the perception of an object's size and translation information. In this paper, we propose a simple network structure, the HS-layer, which extends 3D-GC to extract hybrid scope latent features from point cloud data for category-level object pose estimation tasks. The proposed HS-layer: 1) is able to perceive local-global geometric structure and global information, 2) is robust to noise, and 3) can encode size and translation information. Our experiments show that the simple replacement of the 3D-GC layer with the proposed HS-layer on the baseline method (GPV-Pose) achieves a significant improvement, with the performance increased by 14.5% on 5 degrees 2cm metric and 10.3% on IoU(75). Our method outperforms the state-of-the-art methods by a large margin (8.3% on 5 degrees 2cm, 6.9% on IoU(75)) on REAL275 dataset and runs in real-time (50 FPS)(1).
引用
收藏
页码:17163 / 17173
页数:11
相关论文
共 50 条
  • [31] GPV-Pose: Category-level Object Pose Estimation via Geometry-guided Point-wise Voting
    Di, Yan
    Zhang, Ruida
    Lou, Zhiqiang
    Manhardt, Fabian
    Ji, Xiangyang
    Navab, Nassir
    Tombari, Federico
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6771 - 6781
  • [32] Simultaneous Scene-independent Camera Localization and Category-level Object Pose Estimation via Multi-level Feature Fusion
    Wang, Junyi
    Qi, Yue
    2023 IEEE CONFERENCE VIRTUAL REALITY AND 3D USER INTERFACES, VR, 2023, : 254 - 264
  • [33] GarmentTracking: Category-Level Garment Pose Tracking
    Xue, Han
    Xu, Wenqiang
    Zhang, Jieyi
    Tang, Tutian
    Li, Yutong
    Du, Wenxin
    Ye, Ruolin
    Lu, Cewu
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21233 - 21242
  • [34] Toward Real-World Category-Level Articulation Pose Estimation
    Liu, Liu
    Xue, Han
    Xu, Wenqiang
    Fu, Haoyuan
    Lu, Cewu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1072 - 1083
  • [35] GSNet: Model Reconstruction Network for Category-level 6D Object Pose and Size Estimation
    Liu, Penglei
    Zhang, Qieshi
    Cheng, Jun
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 2898 - 2904
  • [36] LaPose: Laplacian Mixture Shape Modeling for RGB-Based Category-Level Object Pose Estimation
    Zhang, Ruida
    Huang, Ziqin
    Wang, Gu
    Zhang, Chenyangguang
    Die, Yan
    Zuo, Xingxing
    Tang, Jiwen
    Ji, Xiangyang
    COMPUTER VISION - ECCV 2024, PT XXV, 2025, 15083 : 467 - 484
  • [37] CLIPose: Category-Level Object Pose Estimation With Pre-Trained Vision-Language Knowledge
    Lin, Xiao
    Zhu, Minghao
    Dang, Ronghao
    Zhou, Guangliang
    Shu, Shaolong
    Lin, Feng
    Liu, Chengju
    Chen, Qijun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9125 - 9138
  • [38] Category-Level 6-D Object Pose Estimation With Shape Deformation for Robotic Grasp Detection
    Yu, Sheng
    Zhai, Di-Hua
    Guan, Yuyin
    Xia, Yuanqing
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1857 - 1871
  • [39] Category-Level 6D Object Pose Recovery in Depth Images
    Sahin, Caner
    Kim, Tae-Kyun
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT I, 2019, 11129 : 665 - 681
  • [40] GPT-COPE: A Graph-Guided Point Transformer for Category-Level Object Pose Estimation
    Zou, Lu
    Huang, Zhangjin
    Gu, Naijie
    Wang, Guoping
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2385 - 2398