Indoor Scene Understanding with Geometric and Semantic Contexts

被引:31
|
作者
Choi, Wongun [1 ]
Chao, Yu-Wei [2 ]
Pantofaru, Caroline [3 ]
Savarese, Silvio [4 ]
机构
[1] NEC Labs Amer, Cupertino, CA 95014 USA
[2] Univ Michigan, Ann Arbor, MI 48109 USA
[3] Google Inc, Mountain View, CA USA
[4] Stanford Univ, Stanford, CA 94305 USA
关键词
Scene understanding; Scene parsing; Object recognition; 3D layout;
D O I
10.1007/s11263-014-0779-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Truly understanding a scene involves integrating information at multiple levels as well as studying the interactions between scene elements. Individual object detectors, layout estimators and scene classifiers are powerful but ultimately confounded by complicated real-world scenes with high variability, different viewpoints and occlusions. We propose a method that can automatically learn the interactions among scene elements and apply them to the holistic understanding of indoor scenes from a single image. This interpretation is performed within a hierarchical interaction model which describes an image by a parse graph, thereby fusing together object detection, layout estimation and scene classification. At the root of the parse graph is the scene type and layout while the leaves are the individual detections of objects. In between is the core of the system, our 3D Geometric Phrases (3DGP). We conduct extensive experimental evaluations on single image 3D scene understanding using both 2D and 3D metrics. The results demonstrate that our model with 3DGPs can provide robust estimation of scene type, 3D space, and 3D objects by leveraging the contextual relationships among the visual elements.
引用
收藏
页码:204 / 220
页数:17
相关论文
共 50 条
  • [21] Exploiting context for semantic scene content understanding
    Luo, Jiebo
    ICIS '06: INTERNATIONAL CONGRESS OF IMAGING SCIENCE, FINAL PROGRAM AND PROCEEDINGS: LINKING THE EXPLOSION OF IMAGING APPLICATIONS WITH THE SCIENCE AND TECHNOLOGY OF IMAGING, 2006, : 479 - 479
  • [22] Open Vocabulary Semantic Scene Sketch Understanding
    Bourouis, Ahmed
    Fan, Judith E.
    Gryaditskaya, Yulia
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 4176 - 4186
  • [23] Semantic Foggy Scene Understanding with Synthetic Data
    Christos Sakaridis
    Dengxin Dai
    Luc Van Gool
    International Journal of Computer Vision, 2018, 126 : 973 - 992
  • [24] Cost-Efficient Image Semantic Segmentation for Indoor Scene Understanding Using Weakly Supervised Learning and BIM
    Yang, Liu
    Cai, Hubo
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2023, 37 (02)
  • [25] Semantic Geometric Modelling of Unstructured Indoor Point Cloud
    Shi, Wenzhong
    Ahmed, Wael
    Li, Na
    Fan, Wenzheng
    Xiang, Haodong
    Wang, Muyang
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2019, 8 (01)
  • [26] Semantic embedding for indoor scene recognition by weighted hypergraph learning
    Yu, Jun
    Hong, Chaoqun
    Tao, Dapeng
    Wang, Meng
    SIGNAL PROCESSING, 2015, 112 : 129 - 136
  • [27] LEARNABLE CONTEXTUAL REGULARIZATION FOR SEMANTIC SEGMENTATION OF INDOOR SCENE IMAGES
    Chu, Jun
    Xiao, Xu
    Meng, Gaofeng
    Wang, Lingfeng
    Pan, Chunhong
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1267 - 1271
  • [28] SRRM: Semantic Region Relation Model for Indoor Scene Recognition
    Song, Chuanxin
    Ma, Xin
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [29] GeoSynth: A Photorealistic Synthetic Indoor Dataset for Scene Understanding
    Pugh, Brian
    Chernak, Davin
    Jiddi, Salma
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2023, 29 (05) : 2586 - 2595
  • [30] SceneNet: an Annotated Model Generator for Indoor Scene Understanding
    Handa, Ankur
    Patraucean, Viorica
    Stent, Simon
    Cipolla, Roberto
    2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 5737 - 5743