Multi-class indoor semantic segmentation with deep structured model

被引:0
|
作者
Chuanxia Zheng
Jianhua Wang
Weihai Chen
Xingming Wu
机构
[1] Beihang University,School of Automation Science and Electrical Engineering
来源
The Visual Computer | 2018年 / 34卷
关键词
Semantic segmentation; Scene classification; Convolutional neural network; Graph-RNN; Conditional random field;
D O I
暂无
中图分类号
学科分类号
摘要
Indoor semantic segmentation plays a critical role in many applications, such as intelligent robots. However, multi-class recognition is still challenging, especially for pixel-level indoor semantic labeling. In this paper, a novel deep structured model that combines the strengths of the widely used convolutional neural networks (CNNs) and recurrent neural networks (RNNs) is proposed. We first present a multi-information fusion model that utilizes the scene category information to fine-tune the fully convolutional network. Then, to refine the coarse outputs of CNN, the RNN is applied to the final CNN layer so that we can build an end-to-end trainable system. This Graph-RNN is transformed from a conditional random field based on superpixel segmentation graphical modeling that can utilize flexible contextual information of different neighboring regions. The experimental results on the recent large SUN RGB-D dataset demonstrate that the proposed model outperforms existing state-of-the-art methods on the challenging 40 dominant classes task (40.8%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$40.8\%$$\end{document} mean IU accuracy and 69.1%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$69.1\%$$\end{document} pixel accuracy). We also evaluate our model on the public NYU depth V2 dataset and achieve remarkable performance.
引用
收藏
页码:735 / 747
页数:12
相关论文
共 50 条
  • [11] Efficient semantic image segmentation with multi-class ranking prior
    Pei, Deli
    Li, Zhenguo
    Ji, Rongrong
    Sun, Fuchun
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2014, 120 : 81 - 90
  • [12] Multi-class semantic segmentation for identification of silicate island defects
    Ramachandran, Vishwath
    Elias, Susan
    Narayanan, Badri
    Thilagam, Ayyappan Uma Chandra
    Sridharann, Niyanth
    WELDING INTERNATIONAL, 2023, 37 (01) : 12 - 20
  • [13] Weakly supervised multi-class semantic video segmentation for road scenes
    Awan, Mehwish
    Shin, Jitae
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 230
  • [14] Methods for the frugal labeler: Multi-class semantic segmentation on heterogeneous labels
    Schutera, Mark
    Rettenberger, Luca
    Pylatiuk, Christian
    Reischl, Markus
    PLOS ONE, 2022, 17 (02):
  • [15] Multi-Class Lane Semantic Segmentation using Efficient Convolutional Networks
    Lo, Shao-Yuan
    Hang, Hsueh-Ming
    Chan, Sheng-Wei
    Lin, Jing-Jhih
    2019 IEEE 21ST INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP 2019), 2019,
  • [16] Multi-class motion-based semantic segmentation for ureteroscopy and laser
    Gupta, Soumya
    Ali, Sharib
    Goldsmith, Louise
    Turney, Ben
    Rittscher, Jens
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2022, 101
  • [17] Automatic segmentation of multi-class images with NLS model
    Chandar, K. Punnam
    Savithri, T. Satya
    Swarnalatha, B.
    INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2018, 28 (01) : 81 - 104
  • [18] Deep learning for multi-class semantic segmentation enables colorectal cancer detection and classification in digital pathology images
    Bokhorst, John-Melle
    Nagtegaal, Iris D.
    Fraggetta, Filippo
    Vatrano, Simona
    Mesker, Wilma
    Vieth, Michael
    van der Laak, Jeroen
    Ciompi, Francesco
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [19] Deep learning for multi-class semantic segmentation enables colorectal cancer detection and classification in digital pathology images
    John-Melle Bokhorst
    Iris D. Nagtegaal
    Filippo Fraggetta
    Simona Vatrano
    Wilma Mesker
    Michael Vieth
    Jeroen van der Laak
    Francesco Ciompi
    Scientific Reports, 13
  • [20] Multi-class Semantic Segmentation of Skin Lesions via Fully Convolutional Networks
    Goyal, Manu
    Yap, Moi Hoon
    Hassanpour, Saeed
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 3: BIOINFORMATICS, 2020, : 290 - 295