Keypoint-Based Category-Level Object Pose Tracking from an RGB Sequence with Uncertainty Estimation

被引:0
|
作者
Lin, Yunzhi [1 ,2 ]
Tremblay, Jonathan [1 ]
Tyree, Stephen [1 ]
Vela, Patricio A. [2 ]
Birchfield, Stan [1 ]
机构
[1] NVIDIA, Santa Clara, CA 95051 USA
[2] Georgia Inst Technol, Atlanta, GA 30332 USA
关键词
D O I
10.1109/ICRA.46639.2022.9811720
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a single-stage, category-level 6-DoF pose estimation algorithm that simultaneously detects and tracks instances of objects within a known category. Our method takes as input the previous and current frame from a monocular RGB video, as well as predictions from the previous frame, to predict the bounding cuboid and 6-DoF pose (up to scale). Internally, a deep network predicts distributions over object keypoints (vertices of the bounding cuboid) in image coordinates, after which a novel probabilistic filtering process integrates across estimates before computing the final pose using PnP. Our framework allows the system to take previous uncertainties into consideration when predicting the current frame, resulting in predictions that are more accurate and stable than single frame methods. Extensive experiments show that our method outperforms existing approaches on the challenging Objectron benchmark of annotated object videos. We also demonstrate the usability of our work in an augmented reality setting.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Object Level Depth Reconstruction for Category Level 6D Object Pose Estimation from Monocular RGB Image
    Fan, Zhaoxin
    Song, Zhenbo
    Xu, Jian
    Wang, Zhicheng
    Wu, Kejian
    Liu, Hongyan
    He, Jun
    COMPUTER VISION - ECCV 2022, PT II, 2022, 13662 : 220 - 236
  • [42] Best Next-Viewpoint Recommendation by Selecting Minimum Pose Ambiguity for Category-Level Object Pose Estimation
    Hashim N.M.Z.
    Kawanishi Y.
    Deguchi D.
    Ide I.
    Amma A.
    Kobori N.
    Murase H.
    Seimitsu Kogaku Kaishi/Journal of the Japan Society for Precision Engineering, 2021, 87 (05): : 440 - 446
  • [43] TTA-COPE: Test-Time Adaptation for Category-Level Object Pose Estimation
    Lee, Taeyeop
    Tremblay, Jonathan
    Blukis, Valts
    Wen, Bowen
    Lee, Byeong-Uk
    Shin, Inkyu
    Birchfield, Stan
    Kweon, In So
    Yoon, Kuk-Jin
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21285 - 21295
  • [44] Leveraging SE(3) Equivariance for Self-Supervised Category-Level Object Pose Estimation
    Li, Xiaolong
    Weng, Yijia
    Yi, Li
    Guibas, Leonidas
    Abbott, A. Lynn
    Song, Shuran
    Wang, He
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [45] Category-Level 6D Object Pose Estimation With Structure Encoder and Reasoning Attention
    Liu, Jierui
    Cao, Zhiqiang
    Tang, Yingbo
    Liu, Xilong
    Tan, Min
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 6728 - 6740
  • [46] Median-shape Representation Learning for Category-level Object Pose Estimation in Cluttered Environments
    Tatemichi, Hiroki
    Kawanishi, Yasutomo
    Deguchi, Daisuke
    Ide, Ichiro
    Amma, Ayako
    Murase, Hiroshi
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 4473 - 4480
  • [47] Adversarial imitation learning-based network for category-level 6D object pose estimation
    Sun, Shantong
    Bao, Xu
    Kaushik, Aryan
    MACHINE VISION AND APPLICATIONS, 2024, 35 (05)
  • [48] Category-Level Articulated Object 9D Pose Estimation via Reinforcement Learning
    Liu, Liu
    Du, Jianming
    Wu, Hao
    Yang, Xun
    Liu, Zhenguang
    Hong, Richang
    Wang, Meng
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 728 - 736
  • [49] VoCAPTER: Voting-based Pose Tracking for Category-level Articulated Object via Inter-frame Priors
    Zhang, Li
    Han, Zean
    Zhong, Yan
    Yu, Qiaojun
    Wu, Xingyu
    Wang, Xue
    Wang, Rujing
    MM 2024 - Proceedings of the 32nd ACM International Conference on Multimedia, : 8942 - 8951
  • [50] Generative Category-Level Shape and Pose Estimation with Semantic Primitives
    Li, Guanglin
    Li, Yifeng
    Ye, Zhichao
    Zhang, Qihang
    Kong, Tao
    Cui, Zhaopeng
    Zhang, Guofeng
    CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 1390 - 1400