Multi-class indoor semantic segmentation with deep structured model

被引:0
|
作者
Chuanxia Zheng
Jianhua Wang
Weihai Chen
Xingming Wu
机构
[1] Beihang University,School of Automation Science and Electrical Engineering
来源
The Visual Computer | 2018年 / 34卷
关键词
Semantic segmentation; Scene classification; Convolutional neural network; Graph-RNN; Conditional random field;
D O I
暂无
中图分类号
学科分类号
摘要
Indoor semantic segmentation plays a critical role in many applications, such as intelligent robots. However, multi-class recognition is still challenging, especially for pixel-level indoor semantic labeling. In this paper, a novel deep structured model that combines the strengths of the widely used convolutional neural networks (CNNs) and recurrent neural networks (RNNs) is proposed. We first present a multi-information fusion model that utilizes the scene category information to fine-tune the fully convolutional network. Then, to refine the coarse outputs of CNN, the RNN is applied to the final CNN layer so that we can build an end-to-end trainable system. This Graph-RNN is transformed from a conditional random field based on superpixel segmentation graphical modeling that can utilize flexible contextual information of different neighboring regions. The experimental results on the recent large SUN RGB-D dataset demonstrate that the proposed model outperforms existing state-of-the-art methods on the challenging 40 dominant classes task (40.8%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$40.8\%$$\end{document} mean IU accuracy and 69.1%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$69.1\%$$\end{document} pixel accuracy). We also evaluate our model on the public NYU depth V2 dataset and achieve remarkable performance.
引用
收藏
页码:735 / 747
页数:12
相关论文
共 50 条
  • [1] Multi-class indoor semantic segmentation with deep structured model
    Zheng, Chuanxia
    Wang, Jianhua
    Chen, Weihai
    Wu, Xingming
    VISUAL COMPUTER, 2018, 34 (05): : 735 - 747
  • [2] MULTI-CLASS SEMANTIC SEGMENTATION OF FACES
    Khan, Khalil
    Mauro, Massimo
    Leonardi, Riccardo
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 827 - 831
  • [3] Deep Learning Approach for Multi-class Semantic Segmentation of UAV Images
    Chouhan, Avinash
    Chutia, Dibyajyoti
    Aggarwal, Shiv Prasad
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2023, 32 (07)
  • [4] A Combined Method for Multi-class Image Semantic Segmentation
    Gao, Chao
    Zhang, Xin
    Wang, Hui
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (02) : 596 - 604
  • [5] Multi-class semantic segmentation of pediatric chest radiographs
    Holste, Gregory
    Sullivan, Ryan P.
    Bindschadler, Michael
    Nagy, Nicholas
    Alessio, Adam
    MEDICAL IMAGING 2020: IMAGE PROCESSING, 2021, 11313
  • [6] Semantic Segmentation for Brain Injury Using Multi-Class Deep Level Convolution Networks
    Roselin Mary, S.
    Singh, Manmohan
    Aparna, N.
    Rosy Salomi Victoria, D.
    CYBERNETICS AND SYSTEMS, 2023,
  • [7] Deep Semantic Segmentation and Multi-Class Skin Lesion Classification Based on Convolutional Neural Network
    Anjum, Muhammad Almas
    Amin, Javaria
    Sharif, Muhammad
    Khan, Habib Ullah
    Malik, Muhammad Sheraz Arshad
    Kadry, Seifedine
    IEEE ACCESS, 2020, 8 : 129668 - 129678
  • [8] Segmentation-based multi-class semantic object detection
    Vieux, Remi
    Benois-Pineau, Jenny
    Domenger, Jean-Philippe
    Braquelaire, Achille
    MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 60 (02) : 305 - 326
  • [9] Multi-class Token Transformer for Weakly Supervised Semantic Segmentation
    Xu, Lian
    Ouyang, Wanli
    Bennamoun, Mohammed
    Boussaid, Farid
    Xu, Dan
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4300 - 4309
  • [10] Segmentation-based multi-class semantic object detection
    Remi Vieux
    Jenny Benois-Pineau
    Jean-Philippe Domenger
    Achille Braquelaire
    Multimedia Tools and Applications, 2012, 60 : 305 - 326