Indoor Scene Understanding with Geometric and Semantic Contexts

被引:31
|
作者
Choi, Wongun [1 ]
Chao, Yu-Wei [2 ]
Pantofaru, Caroline [3 ]
Savarese, Silvio [4 ]
机构
[1] NEC Labs Amer, Cupertino, CA 95014 USA
[2] Univ Michigan, Ann Arbor, MI 48109 USA
[3] Google Inc, Mountain View, CA USA
[4] Stanford Univ, Stanford, CA 94305 USA
关键词
Scene understanding; Scene parsing; Object recognition; 3D layout;
D O I
10.1007/s11263-014-0779-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Truly understanding a scene involves integrating information at multiple levels as well as studying the interactions between scene elements. Individual object detectors, layout estimators and scene classifiers are powerful but ultimately confounded by complicated real-world scenes with high variability, different viewpoints and occlusions. We propose a method that can automatically learn the interactions among scene elements and apply them to the holistic understanding of indoor scenes from a single image. This interpretation is performed within a hierarchical interaction model which describes an image by a parse graph, thereby fusing together object detection, layout estimation and scene classification. At the root of the parse graph is the scene type and layout while the leaves are the individual detections of objects. In between is the core of the system, our 3D Geometric Phrases (3DGP). We conduct extensive experimental evaluations on single image 3D scene understanding using both 2D and 3D metrics. The results demonstrate that our model with 3DGPs can provide robust estimation of scene type, 3D space, and 3D objects by leveraging the contextual relationships among the visual elements.
引用
收藏
页码:204 / 220
页数:17
相关论文
共 50 条
  • [11] DRG-SLAM: A Semantic RGB-D SLAM using Geometric Features for Indoor Dynamic Scene
    Wang, Yanan
    Xu, Kun
    Tian, Yaobin
    Ding, Xilun
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 1352 - 1359
  • [12] Indoor Scene Recognition via Probabilistic Semantic Map
    Li, Kun
    Meng, Max Q-H
    2012 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND LOGISTICS (ICAL), 2012, : 352 - 357
  • [13] Semantic Segmentation and Reconstruction of Indoor Scene Point Clouds
    Hao, Wen
    Wei, Hainan
    Wang, Yang
    ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2024, 24 (03) : 3 - 12
  • [14] GEOMETRIC INVARIANTS CONSTRUCTION FOR SEMANTIC SCENE UNDERSTANDING FROM MULTIPLE VIEWS INSPIRED BY THE HUMAN VISUAL SYSTEM
    Fan, N.
    Jin, Cheng
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2012, 12 (02)
  • [15] INFERENCE WITH MODEL UNCERTAINTY ON INDOOR SCENE FOR SEMANTIC SEGMENTATION
    Isobe, Shuya
    Arai, Shuichi
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 1170 - 1174
  • [16] Support surfaces prediction for indoor scene understanding
    Guo, Ruiqi
    Hoiem, Derek
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 2144 - 2151
  • [17] Geometric and Semantic Improvement for Unbiased Scene Graph Generation
    Zhang, Ruhui
    Xu, Pengcheng
    Kang, Kang
    Yang, You
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2023, 17 (10): : 2643 - 2657
  • [18] Adopting Abstract Images for Semantic Scene Understanding
    Zitnick, C. Lawrence
    Vedantam, Ramakrishna
    Parikh, Devi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (04) : 627 - 638
  • [19] Semantic Foggy Scene Understanding with Synthetic Data
    Sakaridis, Christos
    Dai, Dengxin
    Van Gool, Luc
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2018, 126 (09) : 973 - 992
  • [20] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223