Integrating Geometrical Context for Semantic Labeling of Indoor Scenes using RGBD Images

被引:16
|
作者
Khan, Salman H. [1 ]
Bennamoun, Mohammed [1 ]
Sohel, Ferdous [1 ]
Togneri, Roberto [2 ]
Naseem, Imran [3 ]
机构
[1] Univ Western Australia, Sch CSSE, 35 Stirling Highway, Crawley, WA 6009, Australia
[2] Univ Western Australia, Sch EECE, 35 Stirling Highway, Crawley, WA 6009, Australia
[3] Karachi Inst Econ & Technol, Dept Engn, Karachi 75190, Pakistan
基金
澳大利亚研究理事会;
关键词
Scene parsing; Graphical models; Geometric reasoning; Structured learning; OBJECT RECOGNITION; FEATURES; SCALE; TEXTURE;
D O I
10.1007/s11263-015-0843-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Inexpensive structured light sensors can capture rich information from indoor scenes, and scene labeling problems provide a compelling opportunity to make use of this information. In this paper we present a novel conditional random field (CRF) model to effectively utilize depth information for semantic labeling of indoor scenes. At the core of the model, we propose a novel and efficient plane detection algorithm which is robust to erroneous depthmaps. Our CRF formulation defines local, pairwise and higher order interactions between image pixels. At the local level, we propose a novel scheme to combine energies derived from appearance, depth and geometry-based cues. The proposed local energy also encodes the location of each object class by considering the approximate geometry of a scene. For the pairwise interactions, we learn a boundary measure which defines the spatial discontinuity of object classes across an image. To model higher-order interactions, the proposed energy treats smooth surfaces as cliques and encourages all the pixels on a surface to take the same label. We show that the proposed higher-order energies can be decomposed into pairwise submodular energies and efficient inference can be made using the graph-cuts algorithm. We follow a systematic approach which uses structured learning to fine-tune the model parameters. We rigorously test our approach on SUN3D and both versions of the NYU-Depth database. Experimental results show that our work achieves superior performance to state-of-the-art scene labeling techniques.
引用
收藏
页码:1 / 20
页数:20
相关论文
共 44 条
  • [31] RGB-D Images for Object Segmentation, Localization and Recognition in Indoor Scenes using Feature Descriptor and Hough Voting
    Ahmed, Abrar
    Jalal, Ahmad
    Kim, Kibum
    PROCEEDINGS OF 2020 17TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST), 2020, : 290 - 295
  • [32] Semantic Labeling of Globally Distributed Urban and Nonurban Satellite Images Using High-Resolution SAR Data
    Dumitru, Corneliu Octavian
    Schwarz, Gottfried
    Datcu, Mihai
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 (14) : 6009 - 6068
  • [33] Context Aware Indoor Route Planning Using Semantic 3D Building Models with Cloud Computing
    Khan, Aftab Ahmed
    Yao, Zhihang
    Kolbe, Thomas H.
    3D GEOINFORMATION SCIENCE, 3D GEOINFO 2014, 2015, : 175 - 192
  • [34] Single Image Reflection Removal Using Non-Linearly Synthesized Glass Images and Semantic Context
    Han, Byeong-Ju
    Sim, Jae-Young
    IEEE ACCESS, 2019, 7 : 170796 - 170806
  • [35] Semantic Labeling and Instance Segmentation of 3D Point Clouds Using Patch Context Analysis and Multiscale Processing
    Hu, Shi-Min
    Cai, Jun-Xiong
    Lai, Yu-Kun
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2020, 26 (07) : 2485 - 2498
  • [36] DocSemMap 2.0: Semantic Labeling based on Textual Data Documentations Using Seq2Seq Context Learner
    Burgdorf, Andreas
    Paulus, Alexander
    Pomp, Andre
    Meisen, Tobias
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 98 - 107
  • [37] Context vector-based visual mapless navigation in indoor using hierarchical semantic information and meta-learning
    Li, Fei
    Guo, Chi
    Zhang, Huyin
    Luo, Binhan
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (02) : 2031 - 2041
  • [38] Context vector-based visual mapless navigation in indoor using hierarchical semantic information and meta-learning
    Fei Li
    Chi Guo
    Huyin Zhang
    Binhan Luo
    Complex & Intelligent Systems, 2023, 9 : 2031 - 2041
  • [39] Navigational Aid for Open-Ended Surveillance, by Fusing Estimated Depth and Scene Segmentation Maps, Using RGB Images of Indoor Scenes
    Saha, Binoy
    Shah, Neha
    Das, Sukhendu
    2022 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, SPCOM, 2022,
  • [40] Semantic Labeling of Large-Area Geographic Regions Using Multiview and Multidate Satellite Images and Noisy OSM Training Labels
    Comandur, Bharath
    Kak, Avinash C.
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 4573 - 4594