Median-shape Representation Learning for Category-level Object Pose Estimation in Cluttered Environments

被引:1
|
作者
Tatemichi, Hiroki [1 ]
Kawanishi, Yasutomo [1 ]
Deguchi, Daisuke [1 ]
Ide, Ichiro [1 ]
Amma, Ayako [2 ]
Murase, Hiroshi [1 ]
机构
[1] Nagoya Univ, Nagoya, Aichi, Japan
[2] Toyota Motor Co Ltd, Toyota, Japan
关键词
RECOGNITION;
D O I
10.1109/ICPR48806.2021.9412318
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose an occlusion-robust pose estimation method of an unknown object instance in an object category from a depth image. In a cluttered environment, objects are often occluded mutually. For estimating the pose of an object in such a situation, a method that de-occludes the unobservable area of the object would be effective. However, there are two difficulties; occlusion causes the offset between the center of the actual object and its observable area, and different instances in a category may have different shapes. To cope with these difficulties, we propose a two-stage Encoder-Decoder model to extract features with objects whose centers are aligned to the image center. In the model, we also propose the Median-shape Reconstructor as the second stage to absorb shape variations in a category. By evaluating the method with both a large-scale virtual dataset and a real dataset, we confirmed the proposed method achieves good performance on pose estimation of an occluded object from a depth image.
引用
收藏
页码:4473 / 4480
页数:8
相关论文
共 50 条
  • [41] SAR-Net: Shape Alignment and Recovery Network for Category-level 6D Object Pose and Size Estimation
    Lin, Haitao
    Liu, Zichang
    Cheang, Chilam
    Fu, Yanwei
    Guo, Guodong
    Xue, Xiangyang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6697 - 6707
  • [42] 6D-ViT: Category-Level 6D Object Pose Estimation via Transformer-Based Instance Representation Learning
    Zou, Lu
    Huang, Zhangjin
    Gu, Naijie
    Wang, Guoping
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6907 - 6921
  • [43] TTA-COPE: Test-Time Adaptation for Category-Level Object Pose Estimation
    Lee, Taeyeop
    Tremblay, Jonathan
    Blukis, Valts
    Wen, Bowen
    Lee, Byeong-Uk
    Shin, Inkyu
    Birchfield, Stan
    Kweon, In So
    Yoon, Kuk-Jin
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21285 - 21295
  • [44] Leveraging SE(3) Equivariance for Self-Supervised Category-Level Object Pose Estimation
    Li, Xiaolong
    Weng, Yijia
    Yi, Li
    Guibas, Leonidas
    Abbott, A. Lynn
    Song, Shuran
    Wang, He
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [45] Category-Level 6D Object Pose Estimation With Structure Encoder and Reasoning Attention
    Liu, Jierui
    Cao, Zhiqiang
    Tang, Yingbo
    Liu, Xilong
    Tan, Min
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 6728 - 6740
  • [46] Learning geometric consistency and discrepancy for category-level 6D object pose estimation from point clouds
    Zou, Lu
    Huang, Zhangjin
    Gu, Naijie
    Wang, Guoping
    PATTERN RECOGNITION, 2024, 145
  • [47] GSNet: Model Reconstruction Network for Category-level 6D Object Pose and Size Estimation
    Liu, Penglei
    Zhang, Qieshi
    Cheng, Jun
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 2898 - 2904
  • [48] CLIPose: Category-Level Object Pose Estimation With Pre-Trained Vision-Language Knowledge
    Lin, Xiao
    Zhu, Minghao
    Dang, Ronghao
    Zhou, Guangliang
    Shu, Shaolong
    Lin, Feng
    Liu, Chengju
    Chen, Qijun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9125 - 9138
  • [49] Category-Level 6D Object Pose Estimation in the Wild: A Semi-Supervised Learning Approach and A New Dataset
    Fu, Yang
    Wang, Xiaolong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [50] GPT-COPE: A Graph-Guided Point Transformer for Category-Level Object Pose Estimation
    Zou, Lu
    Huang, Zhangjin
    Gu, Naijie
    Wang, Guoping
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2385 - 2398