Indoor Scene Understanding with Geometric and Semantic Contexts

被引:31
|
作者
Choi, Wongun [1 ]
Chao, Yu-Wei [2 ]
Pantofaru, Caroline [3 ]
Savarese, Silvio [4 ]
机构
[1] NEC Labs Amer, Cupertino, CA 95014 USA
[2] Univ Michigan, Ann Arbor, MI 48109 USA
[3] Google Inc, Mountain View, CA USA
[4] Stanford Univ, Stanford, CA 94305 USA
关键词
Scene understanding; Scene parsing; Object recognition; 3D layout;
D O I
10.1007/s11263-014-0779-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Truly understanding a scene involves integrating information at multiple levels as well as studying the interactions between scene elements. Individual object detectors, layout estimators and scene classifiers are powerful but ultimately confounded by complicated real-world scenes with high variability, different viewpoints and occlusions. We propose a method that can automatically learn the interactions among scene elements and apply them to the holistic understanding of indoor scenes from a single image. This interpretation is performed within a hierarchical interaction model which describes an image by a parse graph, thereby fusing together object detection, layout estimation and scene classification. At the root of the parse graph is the scene type and layout while the leaves are the individual detections of objects. In between is the core of the system, our 3D Geometric Phrases (3DGP). We conduct extensive experimental evaluations on single image 3D scene understanding using both 2D and 3D metrics. The results demonstrate that our model with 3DGPs can provide robust estimation of scene type, 3D space, and 3D objects by leveraging the contextual relationships among the visual elements.
引用
收藏
页码:204 / 220
页数:17
相关论文
共 50 条
  • [41] Semantic map construction for indoor home environment based on adjustable scene semantic annotation scope
    Zhang S.
    He Z.
    Zha F.
    Hou Z.
    Ma Y.
    Zhongguo Guanxing Jishu Xuebao/Journal of Chinese Inertial Technology, 2024, 32 (04): : 371 - 378
  • [42] Efficient RGB-D Semantic Segmentation for Indoor Scene Analysis
    Seichter, Daniel
    Koehler, Mona
    Lewandowski, Benjamin
    Wengefeld, Tim
    Gross, Horst-Michael
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 13525 - 13531
  • [43] Semantic Scene Segmentation for Indoor Robot Navigation via Deep Learning
    Yeboah, Yao
    Cai Yanguang
    Wei Wu
    Farisi, Zeyad
    PROCEEDINGS OF ICRCA 2018: 2018 THE 3RD INTERNATIONAL CONFERENCE ON ROBOTICS, CONTROL AND AUTOMATION / ICRMV 2018: 2018 THE 3RD INTERNATIONAL CONFERENCE ON ROBOTICS AND MACHINE VISION, 2018, : 112 - 118
  • [44] Semantic scene analysis of scanned 3D indoor environments
    Nüchter, A
    Surmann, H
    Lingemann, K
    Hertzberg, J
    VISION, MODELING, AND VISUALIZATION 2003, 2003, : 215 - +
  • [45] Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts
    Hou, Ji
    Graham, Benjamin
    Niesner, Matthias
    Xie, Saining
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15582 - 15592
  • [46] Facilitating and Exploring Planar Homogeneous Texture for Indoor Scene Understanding
    Ahmad, Shahzor
    Cheong, Loong-Fah
    COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 : 35 - 51
  • [47] A Search-Classify Approach for Cluttered Indoor Scene Understanding
    Nan, Liangliang
    Xie, Ke
    Sharf, Andrei
    ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (06):
  • [48] Learning depth-aware features for indoor scene understanding
    Suting Chen
    Dongwei Shao
    Liangchen Zhang
    Chuang Zhang
    Multimedia Tools and Applications, 2022, 81 : 42573 - 42590
  • [49] Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding
    Roberts, Mike
    Ramapuram, Jason
    Ranjan, Anurag
    Kumar, Atulit
    Bautista, Miguel Angel
    Paczan, Nathan
    Webb, Russ
    Susskind, Joshua M.
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10892 - 10902
  • [50] Discriminative Learning with Latent Variables for Cluttered Indoor Scene Understanding
    Wang, Huayan
    Gould, Stephen
    Koller, Daphne
    COMPUTER VISION-ECCV 2010, PT II, 2010, 6312 : 435 - +