Integrating Geometrical Context for Semantic Labeling of Indoor Scenes using RGBD Images

被引:16
|
作者
Khan, Salman H. [1 ]
Bennamoun, Mohammed [1 ]
Sohel, Ferdous [1 ]
Togneri, Roberto [2 ]
Naseem, Imran [3 ]
机构
[1] Univ Western Australia, Sch CSSE, 35 Stirling Highway, Crawley, WA 6009, Australia
[2] Univ Western Australia, Sch EECE, 35 Stirling Highway, Crawley, WA 6009, Australia
[3] Karachi Inst Econ & Technol, Dept Engn, Karachi 75190, Pakistan
基金
澳大利亚研究理事会;
关键词
Scene parsing; Graphical models; Geometric reasoning; Structured learning; OBJECT RECOGNITION; FEATURES; SCALE; TEXTURE;
D O I
10.1007/s11263-015-0843-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Inexpensive structured light sensors can capture rich information from indoor scenes, and scene labeling problems provide a compelling opportunity to make use of this information. In this paper we present a novel conditional random field (CRF) model to effectively utilize depth information for semantic labeling of indoor scenes. At the core of the model, we propose a novel and efficient plane detection algorithm which is robust to erroneous depthmaps. Our CRF formulation defines local, pairwise and higher order interactions between image pixels. At the local level, we propose a novel scheme to combine energies derived from appearance, depth and geometry-based cues. The proposed local energy also encodes the location of each object class by considering the approximate geometry of a scene. For the pairwise interactions, we learn a boundary measure which defines the spatial discontinuity of object classes across an image. To model higher-order interactions, the proposed energy treats smooth surfaces as cliques and encourages all the pixels on a surface to take the same label. We show that the proposed higher-order energies can be decomposed into pairwise submodular energies and efficient inference can be made using the graph-cuts algorithm. We follow a systematic approach which uses structured learning to fine-tune the model parameters. We rigorously test our approach on SUN3D and both versions of the NYU-Depth database. Experimental results show that our work achieves superior performance to state-of-the-art scene labeling techniques.
引用
收藏
页码:1 / 20
页数:20
相关论文
共 44 条
  • [1] Integrating Geometrical Context for Semantic Labeling of Indoor Scenes using RGBD Images
    Salman H. Khan
    Mohammed Bennamoun
    Ferdous Sohel
    Roberto Togneri
    Imran Naseem
    International Journal of Computer Vision, 2016, 117 : 1 - 20
  • [2] An Interactive Approach to Semantic Modeling of Indoor Scenes with an RGBD Camera
    Shao, Tianjia
    Xu, Weiwei
    Zhou, Kun
    Wang, Jingdong
    Li, Dongping
    Guo, Baining
    ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (06):
  • [3] Geometry Driven Semantic Labeling of Indoor Scenes
    Khan, Salman Hameed
    Bennamoun, Mohammed
    Sohel, Ferdous
    Togneri, Roberto
    COMPUTER VISION - ECCV 2014, PT I, 2014, 8689 : 679 - 694
  • [4] Semantic Labeling of Indoor Scenes from RGB-D Images with Discriminative Learning
    Liu, Bo
    Fan, Haoqi
    SIXTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2013), 2013, 9067
  • [5] Labeling Images by Integrating Sparse Multiple Distance Learning and Semantic Context Modeling
    Ji, Chuanjun
    Zhou, Xiangdong
    Lin, Lan
    Yang, Weidong
    COMPUTER VISION - ECCV 2012, PT IV, 2012, 7575 : 688 - 701
  • [6] Context Aggregation Network for Semantic Labeling in Aerial Images
    Cheng, Wensheng
    Yang, Wen
    Wang, Min
    Wang, Gang
    Chen, Jinyong
    REMOTE SENSING, 2019, 11 (10)
  • [7] CONTEXT AGGREGATION NETWORK FOR SEMANTIC LABELING IN HISTOPATHOLOGY IMAGES
    Samanta, Pranab
    Raipuria, Geetank
    Singhal, Nitin
    2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 673 - 676
  • [8] CONTEXT AGGREGATION NETWORK FOR SEMANTIC LABELING IN AERIAL IMAGES
    Cheng, Wensheng
    Yang, Wen
    Pan, Youqi
    Guo, Haowen
    Cheng, Yu
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 4484 - 4488
  • [9] Object-aware Semantic Mapping of Indoor Scenes using Octomap
    Liu, Kaijian
    Fan, Zhen
    Liu, Meiqin
    Zhang, Senlin
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 8671 - 8676
  • [10] SEMANTIC DESCRIPTION OF AERIAL IMAGES USING STOCHASTIC LABELING
    FAUGERAS, OD
    PRICE, KE
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1981, 3 (06) : 633 - 642