Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views

被引:441
|
作者
Su, Hao [1 ]
Qi, Charles R. [1 ]
Li, Yangyan [1 ]
Guibas, Leonidas J. [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
POSE;
D O I
10.1109/ICCV.2015.308
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object viewpoint estimation from 2D images is an essential task in computer vision. However, two issues hinder its progress: scarcity of training data with viewpoint annotations, and a lack of powerful features. Inspired by the growing availability of 3D models, we propose a framework to address both issues by combining render-based image synthesis and CNNs (Convolutional Neural Networks). We believe that 3D models have the potential in generating a large number of images of high variation, which can be well exploited by deep CNN with a high learning capacity. Towards this goal, we propose a scalable and overfitresistant image synthesis pipeline, together with a novel CNN specifically tailored for the viewpoint estimation task. Experimentally, we show that the viewpoint estimation from our pipeline can significantly outperform state-of-the-art methods on PASCAL 3D+ benchmark.
引用
收藏
页码:2686 / 2694
页数:9
相关论文
共 50 条
  • [21] Refined Attitude Estimation of Ships in Photographs via Matching Images Rendered from 3D Models
    Wang, Hongxiang
    Xu, Xiaojian
    2017 SENSOR SIGNAL PROCESSING FOR DEFENCE CONFERENCE (SSPD), 2017, : 119 - 123
  • [22] Dynamic 3D shape from multi-viewpoint images using deformable mesh model
    Nobuhara, S
    Matsuyama, T
    ISPA 2003: PROCEEDINGS OF THE 3RD INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, PTS 1 AND 2, 2003, : 192 - 197
  • [23] Mathematical PSNR prediction model between compressed normal maps and rendered 3D images
    Yamasaki, T
    Hayase, K
    Aizawa, K
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2005, PT 2, 2005, 3768 : 584 - 594
  • [24] Human Pose Estimation in Space and Time Using 3D CNN
    Grinciunaite, Agne
    Gudi, Amogh
    Tasli, Emrah
    den Uyl, Marten
    COMPUTER VISION - ECCV 2016 WORKSHOPS, PT III, 2016, 9915 : 32 - 39
  • [25] 3D SURFACE RENDERED MR IMAGES OF THE BRAIN AND ITS VASCULATURE
    CLINE, HE
    LORENSEN, WE
    SOUZA, SP
    JOLESZ, FA
    KIKINIS, R
    GERIG, G
    KENNEDY, TE
    JOURNAL OF COMPUTER ASSISTED TOMOGRAPHY, 1991, 15 (02) : 344 - 351
  • [26] Segmenting Unknown 3D Objects from Real Depth Images using Mask R-CNN Trained on Synthetic Data
    Danielczuk, Michael
    Matl, Matthew
    Gupta, Saurabh
    Li, Andrew
    Lee, Andrew
    Mahler, Jeffrey
    Goldberg, Ken
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 7283 - 7290
  • [27] Estimating relative diffusion from 3D micro-CT images using CNNs
    Gaerttner, Stephan
    Frank, Florian
    Woller, Fabian
    Meier, Andreas
    Ray, Nadja
    ARTIFICIAL INTELLIGENCE IN GEOSCIENCES, 2023, 4 : 199 - 208
  • [28] Automatic Detection of Alzheimer Disease from 3D MRI Images using Deep CNNs
    Negied, Nermin
    SeragEldin, Ahmed
    International Journal of Advanced Computer Science and Applications, 2022, 13 (12): : 477 - 482
  • [29] Self-supervised 3D Shape and Viewpoint Estimation from Single Images for Robotics
    Mees, Oier
    Tatarchenko, Maxim
    Brox, Thomas
    Burgard, Wolfram
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 6083 - 6089
  • [30] Lambs' live weight estimation using 3D images
    Samperio, E.
    Lidon, I
    Rebollar, R.
    Castejon-Limas, M.
    Alvarez-Aparicio, C.
    ANIMAL, 2021, 15 (05)