Indoor Scene Understanding with Geometric and Semantic Contexts

被引:31
|
作者
Choi, Wongun [1 ]
Chao, Yu-Wei [2 ]
Pantofaru, Caroline [3 ]
Savarese, Silvio [4 ]
机构
[1] NEC Labs Amer, Cupertino, CA 95014 USA
[2] Univ Michigan, Ann Arbor, MI 48109 USA
[3] Google Inc, Mountain View, CA USA
[4] Stanford Univ, Stanford, CA 94305 USA
关键词
Scene understanding; Scene parsing; Object recognition; 3D layout;
D O I
10.1007/s11263-014-0779-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Truly understanding a scene involves integrating information at multiple levels as well as studying the interactions between scene elements. Individual object detectors, layout estimators and scene classifiers are powerful but ultimately confounded by complicated real-world scenes with high variability, different viewpoints and occlusions. We propose a method that can automatically learn the interactions among scene elements and apply them to the holistic understanding of indoor scenes from a single image. This interpretation is performed within a hierarchical interaction model which describes an image by a parse graph, thereby fusing together object detection, layout estimation and scene classification. At the root of the parse graph is the scene type and layout while the leaves are the individual detections of objects. In between is the core of the system, our 3D Geometric Phrases (3DGP). We conduct extensive experimental evaluations on single image 3D scene understanding using both 2D and 3D metrics. The results demonstrate that our model with 3DGPs can provide robust estimation of scene type, 3D space, and 3D objects by leveraging the contextual relationships among the visual elements.
引用
收藏
页码:204 / 220
页数:17
相关论文
共 50 条
  • [1] Indoor Scene Understanding with Geometric and Semantic Contexts
    Wongun Choi
    Yu-Wei Chao
    Caroline Pantofaru
    Silvio Savarese
    International Journal of Computer Vision, 2015, 112 : 204 - 220
  • [2] Task-oriented Grasping with Semantic and Geometric Scene Understanding
    Detry, Renaud
    Papon, Jeremie
    Matthies, Larry
    2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 3266 - 3273
  • [3] RGB-D joint modelling with scene geometric information for indoor semantic segmentation
    Hong Liu
    Wenshan Wu
    Xiangdong Wang
    Yueliang Qian
    Multimedia Tools and Applications, 2018, 77 : 22475 - 22488
  • [4] RGB-D joint modelling with scene geometric information for indoor semantic segmentation
    Liu, Hong
    Wu, Wenshan
    Wang, Xiangdong
    Qian, Yueliang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (17) : 22475 - 22488
  • [5] Semantic-Relation-First Active Learning for Scene Understanding in Indoor Environments
    Gan, Rundong
    Su, Longfei
    Chen, Haotian
    Yuan, Jing
    Liu, Jie
    Sun, Fengchi
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6964 - 6969
  • [6] Indoor Scene Segmentation with Semantic Cuboids
    Fang, Zhuoqun
    Wu, Chengdong
    Jia, Tong
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2015, : 2545 - 2550
  • [7] Indoor Semantic Scene Understanding Using 2D-3D Fusion
    Gopinathan, Muraleekrishna
    Truong, Giang
    Abu-Khalaf, Jumana
    2021 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA 2021), 2021, : 133 - 140
  • [8] DSNET: ACCELERATE INDOOR SCENE SEMANTIC SEGMENTATION
    Jiang, Feng
    Guo, Feng
    Ji, Rongrong
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3317 - 3321
  • [9] Representation Learning for Semantic Scene Understanding
    Farshad, Azade
    HHAI 2023: AUGMENTING HUMAN INTELLECT, 2023, 368 : 445 - 458
  • [10] Rainy Night Scene Understanding With Near Scene Semantic Adaptation
    Di, Shuai
    Feng, Qi
    Li, Chun-Guang
    Zhang, Mei
    Zhang, Honggang
    Elezovikj, Semir
    Tan, Chiu C.
    Ling, Haibin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (03) : 1594 - 1602