Enriching Object Detection with 2D-3D Registration and Continuous Viewpoint Estimation

被引:0
|
作者
Choy, Christopher Bongsoo [1 ]
Stark, Michael [2 ]
Corbett-Davies, Sam [1 ]
Savarese, Silvio [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Max Planck Inst Informat, Saarbrucken, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A large body of recent work on object detection has focused on exploiting 3D CAD model databases to improve detection performance. Many of these approaches work by aligning exact 3D models to images using templates generated from renderings of the 3D models at a set of discrete viewpoints. However, the training procedures for these approaches are computationally expensive and require gigabytes of memory and storage, while the viewpoint discretization hampers pose estimation performance. We propose an efficient method for synthesizing templates from 3D models that runs on the fly - that is, it quickly produces detectors for an arbitrary viewpoint of a 3D model without expensive dataset-dependent training or template storage. Given a 3D model and an arbitrary continuous detection viewpoint, our method synthesizes a discriminative template by extracting features from a rendered view of the object and decorrelating spatial dependences among the features. Our decorrelation procedure relies on a gradient-based algorithm that is more numerically stable than standard decomposition-based procedures, and we efficiently search for candidate detections by computing FFT-based template convolutions. Due to the speed of our template synthesis procedure, we are able to perform joint optimization of scale, translation, continuous rotation, and focal length using Metropolis-Hastings algorithm. We provide an efficient GPU implementation of our algorithm, and we validate its performance on 3D Object Classes and PASCAL3D+ datasets.
引用
收藏
页码:2512 / 2520
页数:9
相关论文
共 50 条
  • [31] A 2D-3D Object Detection System for Updating Building Information Models with Mobile Robots
    Ferguson, Max
    Law, Kincho
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1357 - 1365
  • [32] StructuRegNet: Structure-Guided Multimodal 2D-3D Registration
    Leroy, Amaury
    Cafaro, Alexandre
    Gessain, Gregoire
    Champagnac, Anne
    Gregoire, Vincent
    Deutsch, Eric
    Lepetit, Vincent
    Paragios, Nikos
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT X, 2023, 14229 : 771 - 780
  • [33] Hybrid 2D-3D ultrasound registration for navigated prostate biopsy
    Selmi, Sonia-Yuki
    Promayon, Emmanuel
    Troccaz, Jocelyne
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2018, 13 (07) : 987 - 995
  • [34] Disocclusion-based 2D-3D registration for aortic interventions
    Demirci, Stefanie
    Baust, Maximilian
    Kutter, Oliver
    Manstad-Hulaas, Frode
    Eckstein, Hans-Henning
    Navab, Nassir
    COMPUTERS IN BIOLOGY AND MEDICINE, 2013, 43 (04) : 312 - 322
  • [35] A 2D-3D registration quality evaluator for patient positioning in radiotherapy
    Wu, J.
    Samant, S.
    MEDICAL PHYSICS, 2007, 34 (06) : 2354 - 2354
  • [36] Projection-slice theorem based 2D-3D registration
    van der Bom, M. J.
    Pluim, J. P. W.
    Homan, R.
    Timmer, J.
    Bartels, L. W.
    MEDICAL IMAGING 2007: IMAGE PROCESSING, PTS 1-3, 2007, 6512
  • [37] 2D3D-MATR: 2D-3D Matching Transformer for Detection-free Registration between Images and Point Clouds
    Li, Minhao
    Qin, Zheng
    Gao, Zhirui
    Yi, Renjiao
    Zhu, Chenyang
    Guo, Yulan
    Xu, Kai
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14082 - 14092
  • [38] Looking 3D: Anomaly Detection with 2D-3D Alignment
    Bhunia, Ankan
    Li, Changjian
    Bilen, Hakan
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 17263 - 17272
  • [39] Weighted Local Mutual Information for 2D-3D Registration in Vascular Interventions
    Meng, Cai
    Wang, Qi
    Guan, Shaoya
    Xie, Yi
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2018, 2018, 11004 : 376 - 385
  • [40] 2D-3D Medical image registration based on ant colony algorithm
    Wei, Wei
    Lin, Wei
    Liu, Liang
    Hu, Zhongqin
    PROGRESS IN MECHATRONICS AND INFORMATION TECHNOLOGY, PTS 1 AND 2, 2014, 462-463 : 267 - +