Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views

被引：441

作者：

Su, Hao ^{[1
]}

Qi, Charles R. ^{[1
]}

Li, Yangyan ^{[1
]}

Guibas, Leonidas J. ^{[1
]}

机构：

[1] Stanford Univ, Stanford, CA 94305 USA

来源：

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2015年

基金：

美国国家科学基金会;

关键词：

POSE;

D O I：

10.1109/ICCV.2015.308

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Object viewpoint estimation from 2D images is an essential task in computer vision. However, two issues hinder its progress: scarcity of training data with viewpoint annotations, and a lack of powerful features. Inspired by the growing availability of 3D models, we propose a framework to address both issues by combining render-based image synthesis and CNNs (Convolutional Neural Networks). We believe that 3D models have the potential in generating a large number of images of high variation, which can be well exploited by deep CNN with a high learning capacity. Towards this goal, we propose a scalable and overfitresistant image synthesis pipeline, together with a novel CNN specifically tailored for the viewpoint estimation task. Experimentally, we show that the viewpoint estimation from our pipeline can significantly outperform state-of-the-art methods on PASCAL 3D+ benchmark.

引用

页码：2686 / 2694

页数：9

共 50 条

[41] Multi-scale CNNs for 3D model retrieval
Weizhi Nie
Shu Xiang
Anan Liu
Multimedia Tools and Applications, 2018, 77 : 22953 - 22963
[42] Multi-scale CNNs for 3D model retrieval
Nie, Weizhi
Xiang, Shu
Liu, Anan
MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (17) : 22953 - 22963
[43] Classification of Hyperspectral Images Using 3D CNN Based ResNet50
Firat, Huseyin
Hanbay, Davut
29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
[44] Information Fusion based Quality Enhancement for 3D Stereo Images Using CNN
Jin, Zhi
Luo, Haili
Luo, Lei
Zou, Wenbin
Li, Xia
Steinbach, Eckehard
2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 1447 - 1451
[45] Direct 3D Detection of Vehicles in Monocular Images with a CNN based 3D Decoder
Weber, Michael
Fuerst, Michael
Zoellner, J. Marius
2019 30TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV19), 2019, : 417 - 423
[46] Comparison of 3D CNN based deep learning architectures using hyperspectral images
Firat, Huseyin
Hanbay, Davut
JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY, 2023, 38 (01): : 521 - 534
[47] 3D model generating method based on multi-views deblurring images
Shi, Yu
Hong, Hanyu
Song, Jie
Hua, Xia
2015 INTERNATIONAL CONFERENCE ON OPTOELECTRONICS AND MICROELECTRONICS (ICOM), 2015, : 170 - 173
[48] Inpainting of Ring Artifacts on Microtomographic Images by 3D CNN
Kornilov, Anton
Safonov, Ilia
Yakimchuk, Ivan
PROCEEDINGS OF THE 26TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION FRUCT, 2020, : 200 - 206
[49] DAFT: A universal module to interweave tabular data and 3D images in CNNs
Wolf, Tom Nuno
Poelsterl, Sebastian
Wachinger, Christian
NEUROIMAGE, 2022, 260
[50] Weight Estimation of Broilers in Images Using 3D Prior Knowledge
Jorgensen, Anders
Dueholm, Jacob, V
Fagertun, Jens
Moeslund, Thomas B.
IMAGE ANALYSIS, 2019, 11482 : 221 - 232

← 1 2 3 4 5 →