Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views

被引：441

作者：

Su, Hao ^{[1
]}

Qi, Charles R. ^{[1
]}

Li, Yangyan ^{[1
]}

Guibas, Leonidas J. ^{[1
]}

机构：

[1] Stanford Univ, Stanford, CA 94305 USA

来源：

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2015年

基金：

美国国家科学基金会;

关键词：

POSE;

D O I：

10.1109/ICCV.2015.308

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Object viewpoint estimation from 2D images is an essential task in computer vision. However, two issues hinder its progress: scarcity of training data with viewpoint annotations, and a lack of powerful features. Inspired by the growing availability of 3D models, we propose a framework to address both issues by combining render-based image synthesis and CNNs (Convolutional Neural Networks). We believe that 3D models have the potential in generating a large number of images of high variation, which can be well exploited by deep CNN with a high learning capacity. Towards this goal, we propose a scalable and overfitresistant image synthesis pipeline, together with a novel CNN specifically tailored for the viewpoint estimation task. Experimentally, we show that the viewpoint estimation from our pipeline can significantly outperform state-of-the-art methods on PASCAL 3D+ benchmark.

引用

页码：2686 / 2694

页数：9

共 50 条

[21] Refined Attitude Estimation of Ships in Photographs via Matching Images Rendered from 3D Models
Wang, Hongxiang
Xu, Xiaojian
2017 SENSOR SIGNAL PROCESSING FOR DEFENCE CONFERENCE (SSPD), 2017, : 119 - 123
[22] Dynamic 3D shape from multi-viewpoint images using deformable mesh model
Nobuhara, S
Matsuyama, T
ISPA 2003: PROCEEDINGS OF THE 3RD INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, PTS 1 AND 2, 2003, : 192 - 197
[23] Mathematical PSNR prediction model between compressed normal maps and rendered 3D images
Yamasaki, T
Hayase, K
Aizawa, K
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2005, PT 2, 2005, 3768 : 584 - 594
[24] Human Pose Estimation in Space and Time Using 3D CNN
Grinciunaite, Agne
Gudi, Amogh
Tasli, Emrah
den Uyl, Marten
COMPUTER VISION - ECCV 2016 WORKSHOPS, PT III, 2016, 9915 : 32 - 39
[25] 3D SURFACE RENDERED MR IMAGES OF THE BRAIN AND ITS VASCULATURE
CLINE, HE
LORENSEN, WE
SOUZA, SP
JOLESZ, FA
KIKINIS, R
GERIG, G
KENNEDY, TE
JOURNAL OF COMPUTER ASSISTED TOMOGRAPHY, 1991, 15 (02) : 344 - 351
[26] Segmenting Unknown 3D Objects from Real Depth Images using Mask R-CNN Trained on Synthetic Data
Danielczuk, Michael
Matl, Matthew
Gupta, Saurabh
Li, Andrew
Lee, Andrew
Mahler, Jeffrey
Goldberg, Ken
2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 7283 - 7290
[27] Estimating relative diffusion from 3D micro-CT images using CNNs
Gaerttner, Stephan
Frank, Florian
Woller, Fabian
Meier, Andreas
Ray, Nadja
ARTIFICIAL INTELLIGENCE IN GEOSCIENCES, 2023, 4 : 199 - 208
[28] Automatic Detection of Alzheimer Disease from 3D MRI Images using Deep CNNs
Negied, Nermin
SeragEldin, Ahmed
International Journal of Advanced Computer Science and Applications, 2022, 13 (12): : 477 - 482
[29] Self-supervised 3D Shape and Viewpoint Estimation from Single Images for Robotics
Mees, Oier
Tatarchenko, Maxim
Brox, Thomas
Burgard, Wolfram
2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 6083 - 6089
[30] Lambs' live weight estimation using 3D images
Samperio, E.
Lidon, I
Rebollar, R.
Castejon-Limas, M.
Alvarez-Aparicio, C.
ANIMAL, 2021, 15 (05)

← 1 2 3 4 5 →