Semantic Scene Completion from a Single Depth Image

被引：672

作者：

Song, Shuran ^{[1
]}

Yu, Fisher ^{[1
]}

Zeng, Andy ^{[1
]}

Chang, Angel X. ^{[1
]}

Savva, Manolis ^{[1
]}

Funkhouser, Thomas ^{[1
]}

机构：

[1] Princeton Univ, Princeton, NJ 08544 USA

来源：

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) | 2017年

基金：

美国国家科学基金会;

关键词：

D O I：

10.1109/CVPR.2017.28

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper focuses on semantic scene completion, a task for producing a complete 3D voxel representation of volumetric occupancy and semantic labels for a scene from a single-view depth map observation. Previous work has considered scene completion and semantic labeling of depth maps separately. However, we observe that these two problems are tightly intertwined. To leverage the coupled nature of these two tasks, we introduce the semantic scene completion network (SSCNet), an end-to-end 3D convolutional network that takes a single depth image as input and simultaneously outputs occupancy and semantic labels for all voxels in the camera view frustum. Our network uses a dilation-based 3D context module to efficiently expand the receptive field and enable 3D context learning. To train our network, we construct SUNCG - a manually created large-scale dataset of synthetic 3D scenes with dense volumetric annotations. Our experiments demonstrate that the joint model outperforms methods addressing each task in isolation and outperforms alternative approaches on the semantic scene completion task. The dataset and code is available at http://sscnet.cs.princeton.edu.

引用

页码：190 / 198

页数：9

共 50 条

[1] Semantic scene completion with dense CRF from a single depth image
Zhang, Liang
Wang, Le
Zhang, Xiangdong
Shen, Peiyi
Bennamoun, Mohammed
Zhu, Guangming
Shah, Syed Afaq Ali
Song, Juan
NEUROCOMPUTING, 2018, 318 : 182 - 195
[2] View-Volume Network for Semantic Scene Completion from a Single Depth Image
Guo, Yuxiao
Tong, Xin
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 726 - 732
[3] Semantic Scene Completion from a Single 360-Degree Image and Depth Map
Dourado, Aloisio
Kim, Hansung
de Campos, Teofilo E.
Hilton, Adrian
PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 5: VISAPP, 2020, : 36 - 46
[4] Adversarial Semantic Scene Completion from a Single Depth mage
Wang, Yida
Tan, David Joseph
Navab, Nassir
Tombari, Federico
2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 426 - 434
[5] 3D SEMANTIC SCENE COMPLETION FROM A SINGLE DEPTH IMAGE USING ADVERSARIAL TRAINING
Chen, Yueh-Tung
Garbade, Martin
Gall, Juergen
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1835 - 1839
[6] In Depth Bayesian Semantic Scene Completion
Gillsjo, David
Astrom, Kalle
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 6335 - 6342
[7] EdgeNet: Semantic Scene Completion from a Single RGB-D Image
Dourado, Aloisio
De Campos, Teofilo E.
Kim, Hansung
Hilton, Adrian
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 503 - 510
[8] ForkNet: Multi-branch Volumetric Semantic Completion from a Single Depth Image
Wang, Yida
Tan, David Joseph
Navab, Nassir
Tombari, Federico
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8607 - 8616
[9] Scene Intrinsics and Depth from a Single Image
Shelhamer, Evan
Barron, Jonathan T.
Darrell, Trevor
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, : 235 - 242
[10] Combining Semantic Scene Priors and Haze Removal for Single Image Depth Estimation
Wang, Ke
Dunn, Enrique
Tighe, Joseph
Frahm, Jan-Michael
2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 800 - 807

← 1 2 3 4 5 →