Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views

被引:441
|
作者
Su, Hao [1 ]
Qi, Charles R. [1 ]
Li, Yangyan [1 ]
Guibas, Leonidas J. [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
POSE;
D O I
10.1109/ICCV.2015.308
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object viewpoint estimation from 2D images is an essential task in computer vision. However, two issues hinder its progress: scarcity of training data with viewpoint annotations, and a lack of powerful features. Inspired by the growing availability of 3D models, we propose a framework to address both issues by combining render-based image synthesis and CNNs (Convolutional Neural Networks). We believe that 3D models have the potential in generating a large number of images of high variation, which can be well exploited by deep CNN with a high learning capacity. Towards this goal, we propose a scalable and overfitresistant image synthesis pipeline, together with a novel CNN specifically tailored for the viewpoint estimation task. Experimentally, we show that the viewpoint estimation from our pipeline can significantly outperform state-of-the-art methods on PASCAL 3D+ benchmark.
引用
收藏
页码:2686 / 2694
页数:9
相关论文
共 50 条
  • [1] Learning camera viewpoint using CNN to improve 3D body pose estimation
    Ghezelghieh, Mona Fathollahi
    Kasturi, Rangachar
    Sarkar, Sudeep
    PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, : 685 - 693
  • [2] Dual Viewpoint Passenger State Classification Using 3D CNNs
    Tu, Ian
    Bhalerao, Abhir
    Griffiths, Nathan
    Delgado, Mauricio Munoz
    Thomason, Alasdair
    Popham, Thomas
    Mouzakitis, Alex
    2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 2163 - 2169
  • [3] Force estimation from OCT volumes using 3D CNNs
    Nils Gessert
    Jens Beringhoff
    Christoph Otte
    Alexander Schlaefer
    International Journal of Computer Assisted Radiology and Surgery, 2018, 13 : 1073 - 1082
  • [4] 3D object tracking by using virtual viewpoint images
    Seimitsu Kogaku Kaishi, 12 (1194-1199):
  • [5] Force estimation from OCT volumes using 3D CNNs
    Gessert, Nils
    Beringhoff, Jens
    Otte, Christoph
    Schlaefer, Alexander
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2018, 13 (07) : 1073 - 1082
  • [6] Joint Denoising of Stereo Images Using 3D CNN
    Khamassi, Malek
    Kaaniche, Mounir
    Benazza-Benyahia, Amel
    2020 10TH INTERNATIONAL SYMPOSIUM ON SIGNAL, IMAGE, VIDEO AND COMMUNICATIONS (ISIVC), 2021,
  • [7] Robust 3D Hand Pose Estimation in Single Depth Images: from Single-View CNN to Multi-View CNNs
    Ge, Liuhao
    Liang, Hui
    Yuan, Junsong
    Thalmann, Daniel
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3593 - 3601
  • [8] 3D CG Integral Photography Artwork Using Glittering Effects in the Post-processing of Multi-viewpoint Rendered Images
    Maki, Nahomi
    Yanaka, Kazuhisa
    HUMAN INTERFACE AND THE MANAGEMENT OF INFORMATION: INFORMATION AND KNOWLEDGE IN APPLICATIONS AND SERVICES, PT II, 2014, 8522 : 546 - 554
  • [9] Age Estimation in Living Adults using 3D Volume Rendered CT Images of the Sternal Plastron and Lower Chest
    Oldrini, Guillaume
    Harter, Valentin
    Witte, Yannick
    Martrille, Laurent
    Blum, Alain
    JOURNAL OF FORENSIC SCIENCES, 2016, 61 (01) : 127 - 133
  • [10] 3D model estimation from multiple images
    Rövid, A
    Várkonyi-Kóczy, AR
    Várlaki, M
    2004 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, PROCEEDINGS, 2004, : 1661 - 1666