Semantic Scene Completion from a Single Depth Image

被引:672
|
作者
Song, Shuran [1 ]
Yu, Fisher [1 ]
Zeng, Andy [1 ]
Chang, Angel X. [1 ]
Savva, Manolis [1 ]
Funkhouser, Thomas [1 ]
机构
[1] Princeton Univ, Princeton, NJ 08544 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR.2017.28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper focuses on semantic scene completion, a task for producing a complete 3D voxel representation of volumetric occupancy and semantic labels for a scene from a single-view depth map observation. Previous work has considered scene completion and semantic labeling of depth maps separately. However, we observe that these two problems are tightly intertwined. To leverage the coupled nature of these two tasks, we introduce the semantic scene completion network (SSCNet), an end-to-end 3D convolutional network that takes a single depth image as input and simultaneously outputs occupancy and semantic labels for all voxels in the camera view frustum. Our network uses a dilation-based 3D context module to efficiently expand the receptive field and enable 3D context learning. To train our network, we construct SUNCG - a manually created large-scale dataset of synthetic 3D scenes with dense volumetric annotations. Our experiments demonstrate that the joint model outperforms methods addressing each task in isolation and outperforms alternative approaches on the semantic scene completion task. The dataset and code is available at http://sscnet.cs.princeton.edu.
引用
收藏
页码:190 / 198
页数:9
相关论文
共 50 条
  • [1] Semantic scene completion with dense CRF from a single depth image
    Zhang, Liang
    Wang, Le
    Zhang, Xiangdong
    Shen, Peiyi
    Bennamoun, Mohammed
    Zhu, Guangming
    Shah, Syed Afaq Ali
    Song, Juan
    NEUROCOMPUTING, 2018, 318 : 182 - 195
  • [2] View-Volume Network for Semantic Scene Completion from a Single Depth Image
    Guo, Yuxiao
    Tong, Xin
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 726 - 732
  • [3] Semantic Scene Completion from a Single 360-Degree Image and Depth Map
    Dourado, Aloisio
    Kim, Hansung
    de Campos, Teofilo E.
    Hilton, Adrian
    PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 5: VISAPP, 2020, : 36 - 46
  • [4] Adversarial Semantic Scene Completion from a Single Depth mage
    Wang, Yida
    Tan, David Joseph
    Navab, Nassir
    Tombari, Federico
    2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 426 - 434
  • [5] 3D SEMANTIC SCENE COMPLETION FROM A SINGLE DEPTH IMAGE USING ADVERSARIAL TRAINING
    Chen, Yueh-Tung
    Garbade, Martin
    Gall, Juergen
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1835 - 1839
  • [6] In Depth Bayesian Semantic Scene Completion
    Gillsjo, David
    Astrom, Kalle
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 6335 - 6342
  • [7] EdgeNet: Semantic Scene Completion from a Single RGB-D Image
    Dourado, Aloisio
    De Campos, Teofilo E.
    Kim, Hansung
    Hilton, Adrian
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 503 - 510
  • [8] ForkNet: Multi-branch Volumetric Semantic Completion from a Single Depth Image
    Wang, Yida
    Tan, David Joseph
    Navab, Nassir
    Tombari, Federico
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8607 - 8616
  • [9] Scene Intrinsics and Depth from a Single Image
    Shelhamer, Evan
    Barron, Jonathan T.
    Darrell, Trevor
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, : 235 - 242
  • [10] Combining Semantic Scene Priors and Haze Removal for Single Image Depth Estimation
    Wang, Ke
    Dunn, Enrique
    Tighe, Joseph
    Frahm, Jan-Michael
    2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 800 - 807