Enriching Object Detection with 2D-3D Registration and Continuous Viewpoint Estimation

被引:0
|
作者
Choy, Christopher Bongsoo [1 ]
Stark, Michael [2 ]
Corbett-Davies, Sam [1 ]
Savarese, Silvio [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Max Planck Inst Informat, Saarbrucken, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A large body of recent work on object detection has focused on exploiting 3D CAD model databases to improve detection performance. Many of these approaches work by aligning exact 3D models to images using templates generated from renderings of the 3D models at a set of discrete viewpoints. However, the training procedures for these approaches are computationally expensive and require gigabytes of memory and storage, while the viewpoint discretization hampers pose estimation performance. We propose an efficient method for synthesizing templates from 3D models that runs on the fly - that is, it quickly produces detectors for an arbitrary viewpoint of a 3D model without expensive dataset-dependent training or template storage. Given a 3D model and an arbitrary continuous detection viewpoint, our method synthesizes a discriminative template by extracting features from a rendered view of the object and decorrelating spatial dependences among the features. Our decorrelation procedure relies on a gradient-based algorithm that is more numerically stable than standard decomposition-based procedures, and we efficiently search for candidate detections by computing FFT-based template convolutions. Due to the speed of our template synthesis procedure, we are able to perform joint optimization of scale, translation, continuous rotation, and focal length using Metropolis-Hastings algorithm. We provide an efficient GPU implementation of our algorithm, and we validate its performance on 3D Object Classes and PASCAL3D+ datasets.
引用
收藏
页码:2512 / 2520
页数:9
相关论文
共 50 条
  • [1] A Combined 2D-3D Object Detection Framework
    Amara, Kahina
    Djekoune, Oualid
    Achour, Nouara
    Belhocine, Mahmoud
    Bellal, Rima Narimene
    IETE JOURNAL OF RESEARCH, 2017, 63 (05) : 607 - 615
  • [2] A robust technique for 2D-3D registration
    Gong, Ren Hui
    Abolinaesumi, Purang
    Stewart, James
    2006 28TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-15, 2006, : 714 - 717
  • [3] Standardized evaluation of 2D-3D registration
    van de Kraats, EB
    Penney, GP
    Tomazevic, D
    van Walsum, T
    Niessen, WJ
    MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2004, PT 1, PROCEEDINGS, 2004, 3216 : 574 - 581
  • [4] Efficient framework for deformable 2D-3D registration
    Fluck, Oliver
    Aharon, Shmuel
    Khamene, Ali
    MEDICAL IMAGING 2008: VISUALIZATION, IMAGE-GUIDED PROCEDURES, AND MODELING, PTS 1 AND 2, 2008, 6918
  • [5] Increasing the Automation of a 2D-3D Registration System
    Varnavas, Andreas
    Carrell, Tom
    Penney, Graeme
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2013, 32 (02) : 387 - 399
  • [6] 2D-3D Registration using Intensity Gradients
    Duraisamy, Prakash
    Belkhouche, Yassine
    Jackson, Stephen
    Namuduri, Kamesh
    Buckles, Bill
    SIGNAL AND DATA PROCESSING OF SMALL TARGETS 2011, 2011, 8137
  • [7] Fully automated 2D-3D registration and verification
    Varnavas, Andreas
    Carrell, Tom
    Penney, Graeme
    MEDICAL IMAGE ANALYSIS, 2015, 26 (01) : 108 - 119
  • [8] 2D-3D registration based on shape matching
    Cyr, CM
    Kamal, AF
    Sebastian, TB
    Kimia, BB
    IEEE WORKSHOP ON MATHEMATICAL METHODS IN BIOMEDICAL IMAGE ANALYSIS, PROCEEDINGS, 2000, : 198 - 203
  • [9] TemporalNet: Real-time 2D-3D Video Object Detection
    Chen, Meihong
    Lang, Jochen
    2022 19TH CONFERENCE ON ROBOTS AND VISION (CRV 2022), 2022, : 205 - 212
  • [10] An Open Platform for 2D-3D Image Registration Experiments
    Balter, J.
    Long, Y.
    Folkerts, M.
    Sharp, G.
    Bortfeld, T.
    Fessler, J.
    MEDICAL PHYSICS, 2011, 38 (06)