Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views

被引:441
|
作者
Su, Hao [1 ]
Qi, Charles R. [1 ]
Li, Yangyan [1 ]
Guibas, Leonidas J. [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
POSE;
D O I
10.1109/ICCV.2015.308
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object viewpoint estimation from 2D images is an essential task in computer vision. However, two issues hinder its progress: scarcity of training data with viewpoint annotations, and a lack of powerful features. Inspired by the growing availability of 3D models, we propose a framework to address both issues by combining render-based image synthesis and CNNs (Convolutional Neural Networks). We believe that 3D models have the potential in generating a large number of images of high variation, which can be well exploited by deep CNN with a high learning capacity. Towards this goal, we propose a scalable and overfitresistant image synthesis pipeline, together with a novel CNN specifically tailored for the viewpoint estimation task. Experimentally, we show that the viewpoint estimation from our pipeline can significantly outperform state-of-the-art methods on PASCAL 3D+ benchmark.
引用
收藏
页码:2686 / 2694
页数:9
相关论文
共 50 条
  • [41] Multi-scale CNNs for 3D model retrieval
    Weizhi Nie
    Shu Xiang
    Anan Liu
    Multimedia Tools and Applications, 2018, 77 : 22953 - 22963
  • [42] Multi-scale CNNs for 3D model retrieval
    Nie, Weizhi
    Xiang, Shu
    Liu, Anan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (17) : 22953 - 22963
  • [43] Classification of Hyperspectral Images Using 3D CNN Based ResNet50
    Firat, Huseyin
    Hanbay, Davut
    29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [44] Information Fusion based Quality Enhancement for 3D Stereo Images Using CNN
    Jin, Zhi
    Luo, Haili
    Luo, Lei
    Zou, Wenbin
    Li, Xia
    Steinbach, Eckehard
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 1447 - 1451
  • [45] Direct 3D Detection of Vehicles in Monocular Images with a CNN based 3D Decoder
    Weber, Michael
    Fuerst, Michael
    Zoellner, J. Marius
    2019 30TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV19), 2019, : 417 - 423
  • [46] Comparison of 3D CNN based deep learning architectures using hyperspectral images
    Firat, Huseyin
    Hanbay, Davut
    JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY, 2023, 38 (01): : 521 - 534
  • [47] 3D model generating method based on multi-views deblurring images
    Shi, Yu
    Hong, Hanyu
    Song, Jie
    Hua, Xia
    2015 INTERNATIONAL CONFERENCE ON OPTOELECTRONICS AND MICROELECTRONICS (ICOM), 2015, : 170 - 173
  • [48] Inpainting of Ring Artifacts on Microtomographic Images by 3D CNN
    Kornilov, Anton
    Safonov, Ilia
    Yakimchuk, Ivan
    PROCEEDINGS OF THE 26TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION FRUCT, 2020, : 200 - 206
  • [49] DAFT: A universal module to interweave tabular data and 3D images in CNNs
    Wolf, Tom Nuno
    Poelsterl, Sebastian
    Wachinger, Christian
    NEUROIMAGE, 2022, 260
  • [50] Weight Estimation of Broilers in Images Using 3D Prior Knowledge
    Jorgensen, Anders
    Dueholm, Jacob, V
    Fagertun, Jens
    Moeslund, Thomas B.
    IMAGE ANALYSIS, 2019, 11482 : 221 - 232