Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views

被引:441
|
作者
Su, Hao [1 ]
Qi, Charles R. [1 ]
Li, Yangyan [1 ]
Guibas, Leonidas J. [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
POSE;
D O I
10.1109/ICCV.2015.308
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object viewpoint estimation from 2D images is an essential task in computer vision. However, two issues hinder its progress: scarcity of training data with viewpoint annotations, and a lack of powerful features. Inspired by the growing availability of 3D models, we propose a framework to address both issues by combining render-based image synthesis and CNNs (Convolutional Neural Networks). We believe that 3D models have the potential in generating a large number of images of high variation, which can be well exploited by deep CNN with a high learning capacity. Towards this goal, we propose a scalable and overfitresistant image synthesis pipeline, together with a novel CNN specifically tailored for the viewpoint estimation task. Experimentally, we show that the viewpoint estimation from our pipeline can significantly outperform state-of-the-art methods on PASCAL 3D+ benchmark.
引用
收藏
页码:2686 / 2694
页数:9
相关论文
共 50 条
  • [31] Automatic Detection of Alzheimer Disease from 3D MRI Images using Deep CNNs
    Negied, Nermin
    SeragEldin, Ahmed
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (12) : 477 - 482
  • [32] 3D Model Reconstruction from Multi-views of 2D Images using Radon Transform
    Sobani, Siti Syazalina Mohd.
    Mahmood, Nasrul Humaimi
    Zakaria, Nor Aini
    Ariffin, Ismail
    JURNAL TEKNOLOGI, 2015, 74 (06): : 21 - 26
  • [33] 3D Object Detection and 6D Pose Estimation Using RGB-D Images and Mask R-CNN
    Tran, Van Luan
    Lin, Huei-Yung
    2020 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2020,
  • [34] Model indexing and object recognition using 3D viewpoint invariance
    Umasuthan, M
    Wallace, AM
    PATTERN RECOGNITION, 1997, 30 (09) : 1415 - 1434
  • [35] 3D CNN classification model for accurate diagnosis of coronavirus disease 2019 using computed tomography images
    Li, Yifan
    Pei, Xuan
    Guo, Yandong
    JOURNAL OF MEDICAL IMAGING, 2021, 8
  • [36] 3D model search and pose estimation from single images using VIP features
    Wu, Changchang
    Fraundorfer, Friedrich
    Frahm, Jan-Michael
    Pollefeys, Marc
    2008 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, VOLS 1-3, 2008, : 659 - +
  • [37] Affine correspondence based head pose estimation for a sequence of images by using a 3D model
    Liang, GY
    Zha, HB
    Liu, H
    SIXTH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, PROCEEDINGS, 2004, : 632 - 637
  • [38] 3D CNN HAND POSE ESTIMATION WITH END-TO-END HIERARCHICAL MODEL AND PHYSICAL CONSTRAINTS FROM DEPTH IMAGES
    Xu, Z. Z.
    Zhang, W. J.
    NEURAL NETWORK WORLD, 2023, 33 (01) : 35 - 48
  • [39] Mathematical model for 3D object reconstruction using OccNet (CNN)
    Shruthiba, A.
    Deepu, R.
    JOURNAL OF INTERDISCIPLINARY MATHEMATICS, 2022, 25 (07) : 1961 - 1970
  • [40] Consistent 3D Background Model Estimation from Multi-Viewpoint Videos
    Tsekourakis, Iraklis
    Mordohai, Philippos
    2015 INTERNATIONAL CONFERENCE ON 3D VISION, 2015, : 144 - 152