Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views

被引：441

作者：

Su, Hao ^{[1
]}

Qi, Charles R. ^{[1
]}

Li, Yangyan ^{[1
]}

Guibas, Leonidas J. ^{[1
]}

机构：

[1] Stanford Univ, Stanford, CA 94305 USA

来源：

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2015年

基金：

美国国家科学基金会;

关键词：

POSE;

D O I：

10.1109/ICCV.2015.308

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Object viewpoint estimation from 2D images is an essential task in computer vision. However, two issues hinder its progress: scarcity of training data with viewpoint annotations, and a lack of powerful features. Inspired by the growing availability of 3D models, we propose a framework to address both issues by combining render-based image synthesis and CNNs (Convolutional Neural Networks). We believe that 3D models have the potential in generating a large number of images of high variation, which can be well exploited by deep CNN with a high learning capacity. Towards this goal, we propose a scalable and overfitresistant image synthesis pipeline, together with a novel CNN specifically tailored for the viewpoint estimation task. Experimentally, we show that the viewpoint estimation from our pipeline can significantly outperform state-of-the-art methods on PASCAL 3D+ benchmark.

引用

页码：2686 / 2694

页数：9

共 50 条

[1] Learning camera viewpoint using CNN to improve 3D body pose estimation
Ghezelghieh, Mona Fathollahi
Kasturi, Rangachar
Sarkar, Sudeep
PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, : 685 - 693
[2] Dual Viewpoint Passenger State Classification Using 3D CNNs
Tu, Ian
Bhalerao, Abhir
Griffiths, Nathan
Delgado, Mauricio Munoz
Thomason, Alasdair
Popham, Thomas
Mouzakitis, Alex
2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 2163 - 2169
[3] Force estimation from OCT volumes using 3D CNNs
Nils Gessert
Jens Beringhoff
Christoph Otte
Alexander Schlaefer
International Journal of Computer Assisted Radiology and Surgery, 2018, 13 : 1073 - 1082
[4] 3D object tracking by using virtual viewpoint images
Seimitsu Kogaku Kaishi, 12 (1194-1199):
[5] Force estimation from OCT volumes using 3D CNNs
Gessert, Nils
Beringhoff, Jens
Otte, Christoph
Schlaefer, Alexander
INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2018, 13 (07) : 1073 - 1082
[6] Joint Denoising of Stereo Images Using 3D CNN
Khamassi, Malek
Kaaniche, Mounir
Benazza-Benyahia, Amel
2020 10TH INTERNATIONAL SYMPOSIUM ON SIGNAL, IMAGE, VIDEO AND COMMUNICATIONS (ISIVC), 2021,
[7] Robust 3D Hand Pose Estimation in Single Depth Images: from Single-View CNN to Multi-View CNNs
Ge, Liuhao
Liang, Hui
Yuan, Junsong
Thalmann, Daniel
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3593 - 3601
[8] 3D CG Integral Photography Artwork Using Glittering Effects in the Post-processing of Multi-viewpoint Rendered Images
Maki, Nahomi
Yanaka, Kazuhisa
HUMAN INTERFACE AND THE MANAGEMENT OF INFORMATION: INFORMATION AND KNOWLEDGE IN APPLICATIONS AND SERVICES, PT II, 2014, 8522 : 546 - 554
[9] Age Estimation in Living Adults using 3D Volume Rendered CT Images of the Sternal Plastron and Lower Chest
Oldrini, Guillaume
Harter, Valentin
Witte, Yannick
Martrille, Laurent
Blum, Alain
JOURNAL OF FORENSIC SCIENCES, 2016, 61 (01) : 127 - 133
[10] 3D model estimation from multiple images
Rövid, A
Várkonyi-Kóczy, AR
Várlaki, M
2004 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, PROCEEDINGS, 2004, : 1661 - 1666

← 1 2 3 4 5 →