Enriching Object Detection with 2D-3D Registration and Continuous Viewpoint Estimation

被引:0
|
作者
Choy, Christopher Bongsoo [1 ]
Stark, Michael [2 ]
Corbett-Davies, Sam [1 ]
Savarese, Silvio [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Max Planck Inst Informat, Saarbrucken, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A large body of recent work on object detection has focused on exploiting 3D CAD model databases to improve detection performance. Many of these approaches work by aligning exact 3D models to images using templates generated from renderings of the 3D models at a set of discrete viewpoints. However, the training procedures for these approaches are computationally expensive and require gigabytes of memory and storage, while the viewpoint discretization hampers pose estimation performance. We propose an efficient method for synthesizing templates from 3D models that runs on the fly - that is, it quickly produces detectors for an arbitrary viewpoint of a 3D model without expensive dataset-dependent training or template storage. Given a 3D model and an arbitrary continuous detection viewpoint, our method synthesizes a discriminative template by extracting features from a rendered view of the object and decorrelating spatial dependences among the features. Our decorrelation procedure relies on a gradient-based algorithm that is more numerically stable than standard decomposition-based procedures, and we efficiently search for candidate detections by computing FFT-based template convolutions. Due to the speed of our template synthesis procedure, we are able to perform joint optimization of scale, translation, continuous rotation, and focal length using Metropolis-Hastings algorithm. We provide an efficient GPU implementation of our algorithm, and we validate its performance on 3D Object Classes and PASCAL3D+ datasets.
引用
收藏
页码:2512 / 2520
页数:9
相关论文
共 50 条
  • [41] Colonoscopy 3D video dataset with paired depth from 2D-3D registration
    Bobrow, Taylor L.
    Golhar, Mayank
    Vijayan, Rohan
    Akshintala, Venkata S.
    Garcia, Juan R.
    Durr, Nicholas J.
    MEDICAL IMAGE ANALYSIS, 2023, 90
  • [42] How network structures affect the 2D-3D registration of cardiovascular images
    Ma, Limei
    Nie, Yang
    Feng, Qian
    Cao, Jianshu
    Guan, Shaoya
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89
  • [43] NON-RIGID 2D-3D REGISTRATION USING CONVOLUTIONAL AUTOENCODERS
    Li, Peixin
    Pei, Yuru
    Guo, Yuke
    Ma, Gengyu
    Xu, Tianmin
    Zha, Hongbin
    2020 IEEE 17TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2020), 2020, : 700 - 704
  • [44] Multi-modal 2D-3D non-rigid registration
    Pruemmer, M.
    Hornegger, J.
    Pfister, M.
    Doerfler, A.
    MEDICAL IMAGING 2006: IMAGE PROCESSING, PTS 1-3, 2006, 6144
  • [45] 2D-3D Registration With Weighted Local Mutual Information in Vascular Interventions
    Meng, Cai
    Wang, Qi
    Guan, Shaoya
    Sun, Kai
    Liu, Bo
    IEEE ACCESS, 2019, 7 : 162629 - 162638
  • [46] Evaluation of a 2D-3D registration method for external beam radiation therapy
    Jans, H.
    Syme, A.
    Rathee, S.
    Fallone, B.
    MEDICAL PHYSICS, 2006, 33 (06) : 2208 - 2208
  • [47] 2D-3D registration of coronary angiograms for cardiac procedure planning and guidance
    Turgeon, GA
    Lehmann, G
    Guiraudon, G
    Drangova, M
    Holdsworth, D
    Peters, T
    MEDICAL PHYSICS, 2005, 32 (12) : 3737 - 3749
  • [48] Fast 2D-3D registration using GPU-based preprocessing
    Kim, K
    Park, S
    Hong, H
    Shin, YG
    Healthcom 2005: 7th International Workshop on Enterprise Networking and Computing in Healthcare Industry, Proceedings, 2005, : 139 - 143
  • [49] New CTA protocol and 2D-3D registration method for liver catheterization
    Groher, Martin
    Padoy, Nicolas
    Jakobs, Tobias F.
    Navab, Nassir
    MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2006, PT 1, 2006, 4190 : 873 - 881
  • [50] Deformable 2D-3D Registration of Vascular Structures in a One View Scenario
    Groher, Martin
    Zikic, Darko
    Navab, Nassir
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2009, 28 (06) : 847 - 860