Semantic Scene Completion from a Single Depth Image

被引:672
|
作者
Song, Shuran [1 ]
Yu, Fisher [1 ]
Zeng, Andy [1 ]
Chang, Angel X. [1 ]
Savva, Manolis [1 ]
Funkhouser, Thomas [1 ]
机构
[1] Princeton Univ, Princeton, NJ 08544 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR.2017.28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper focuses on semantic scene completion, a task for producing a complete 3D voxel representation of volumetric occupancy and semantic labels for a scene from a single-view depth map observation. Previous work has considered scene completion and semantic labeling of depth maps separately. However, we observe that these two problems are tightly intertwined. To leverage the coupled nature of these two tasks, we introduce the semantic scene completion network (SSCNet), an end-to-end 3D convolutional network that takes a single depth image as input and simultaneously outputs occupancy and semantic labels for all voxels in the camera view frustum. Our network uses a dilation-based 3D context module to efficiently expand the receptive field and enable 3D context learning. To train our network, we construct SUNCG - a manually created large-scale dataset of synthetic 3D scenes with dense volumetric annotations. Our experiments demonstrate that the joint model outperforms methods addressing each task in isolation and outperforms alternative approaches on the semantic scene completion task. The dataset and code is available at http://sscnet.cs.princeton.edu.
引用
收藏
页码:190 / 198
页数:9
相关论文
共 50 条
  • [41] Point Cloud Semantic Scene Completion from RGB-D Images
    Zhang, Shoulong
    Li, Shuai
    Hao, Aimin
    Qin, Hong
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 3385 - 3393
  • [42] Semantic attention and relative scene depth-guided network for underwater image enhancement
    Chen, Tingkai
    Wang, Ning
    Chen, Yanzheng
    Kong, Xiangjun
    Lin, Yejin
    Zhao, Hong
    Karimi, Hamid Reza
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [43] 3D Semantic Scene Completion: A Survey
    Luis Roldão
    Raoul de Charette
    Anne Verroust-Blondet
    International Journal of Computer Vision, 2022, 130 : 1978 - 2005
  • [44] FFNet: Frequency Fusion Network for Semantic Scene Completion
    Wang, Xuzhi
    Lin, Di
    Wan, Liang
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2550 - 2557
  • [45] Depth estimation from a single RGB image using target foreground and background scene variations
    Alphonse, P. J. A.
    Sriharsha, K., V
    COMPUTERS & ELECTRICAL ENGINEERING, 2021, 94
  • [46] 3D Semantic Scene Completion: A Survey
    Roldao, Luis
    de Charette, Raoul
    Verroust-Blondet, Anne
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (08) : 1978 - 2005
  • [47] AUTOMATIC 3-D DEPTH RECOVERY FROM A SINGLE URBAN-SCENE IMAGE
    Tseng, Chen-yu
    Wang, Sheng-Jyh
    2012 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2012,
  • [48] Robust depth completion based on Semantic Aggregation
    Zhichao Fu
    Xin Li
    Tianyu Huai
    Weijie Li
    Daoguo Dong
    Liang He
    Applied Intelligence, 2024, 54 : 3825 - 3840
  • [49] Robust depth completion based on Semantic Aggregation
    Fu, Zhichao
    Li, Xin
    Huai, Tianyu
    Li, Weijie
    Dong, Daoguo
    He, Liang
    APPLIED INTELLIGENCE, 2024, 54 (05) : 3825 - 3840
  • [50] Scene change detection: semantic and depth information
    Jianjun Li
    Peiqi Tang
    Yong Wu
    Mian Pan
    Zheng Tang
    Guobao Hui
    Multimedia Tools and Applications, 2022, 81 : 19301 - 19319