Zero-Shot Category-Level Object Pose Estimation

被引:18
|
作者
Goodwin, Walter [1 ]
Vaze, Sagar [2 ]
Havoutis, Ioannis [1 ]
Posner, Ingmar [1 ]
机构
[1] Univ Oxford, Oxford Robot Inst, Oxford, England
[2] Univ Oxford, Visual Geometry Grp, Oxford, England
来源
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.1007/978-3-031-19842-7_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object pose estimation is an important component of most vision pipelines for embodied agents, as well as in 3D vision more generally. In this paper we tackle the problem of estimating the pose of novel object categories in a zero-shot manner. This extends much of the existing literature by removing the need for pose-labelled datasets or category-specific CAD models for training or inference. Specifically, we make the following contributions. First, we formalise the zero-shot, category-level pose estimation problem and frame it in a way that is most applicable to real-world embodied agents. Secondly, we propose a novel method based on semantic correspondences from a self-supervised vision transformer to solve the pose estimation problem. We further re-purpose the recent CO3D dataset to present a controlled and realistic test setting. Finally, we demonstrate that all baselines for our proposed task perform poorly, and show that our method provides a six-fold improvement in average rotation accuracy at 30 C-o. Our code is available at https:// github.com/applied- ai- lab/zero- shot-pose.
引用
收藏
页码:516 / 532
页数:17
相关论文
共 50 条
  • [1] Category-Level Articulated Object Pose Estimation
    Li, Xiaolong
    Wang, He
    Yi, Li
    Guibas, Leonidas
    Abbott, A. Lynn
    Song, Shuran
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3703 - 3712
  • [2] Category-Level Object Pose Estimation with Statistic Attention
    Jiang, Changhong
    Mu, Xiaoqiao
    Zhang, Bingbing
    Liang, Chao
    Xie, Mujun
    SENSORS, 2024, 24 (16)
  • [3] iCaps: Iterative Category-Level Object Pose and Shape Estimation
    Deng, Xinke
    Geng, Junyi
    Bretl, Timothy
    Xiang, Yu
    Fox, Dieter
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02): : 1784 - 1791
  • [4] A Visual Navigation Perspective for Category-Level Object Pose Estimation
    Guo, Jiaxin
    Zhong, Fangxun
    Xiong, Rong
    Liu, Yunhui
    Wang, Yue
    Liao, Yiyi
    COMPUTER VISION - ECCV 2022, PT VI, 2022, 13666 : 123 - 141
  • [5] Category-Level Metric Scale Object Shape and Pose Estimation
    Lee, Taeyeop
    Lee, Byeong-Uk
    Kim, Myungchul
    Kweon, I. S.
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) : 8575 - 8582
  • [6] Open-Vocabulary Category-Level Object Pose and Size Estimation
    Cai, Junhao
    He, Yisheng
    Yuan, Weihao
    Zhu, Siyu
    Dong, Zilong
    Bo, Liefeng
    Chen, Qifeng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (09): : 7661 - 7668
  • [7] Zero-shot Pose Estimation Using Image Translation to Maintain Object Pose
    Fujita K.
    Tasaki T.
    IEEJ Transactions on Electronics, Information and Systems, 2023, 143 (12) : 1113 - 1122
  • [8] TG-Pose: Delving Into Topology and Geometry for Category-Level Object Pose Estimation
    Zhan, Yue
    Wang, Xin
    Nie, Lang
    Zhao, Yang
    Yang, Tangwen
    Ruan, Qiuqi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9749 - 9762
  • [9] GenPose: Generative Category-level Object Pose Estimation via Diffusion Models
    Zhang, Jiyao
    Wu, Mingdong
    Dong, Hao
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [10] GS-Pose: Category-Level Object Pose Estimation via Geometric and Semantic Correspondence
    Wang, Pengyuan
    Ikeda, Takuya
    Lee, Robert
    Nishiwaki, Koichi
    COMPUTER VISION - ECCV 2024, PT XXVII, 2025, 15085 : 108 - 126