Multi-class indoor semantic segmentation with deep structured model

被引:0
|
作者
Chuanxia Zheng
Jianhua Wang
Weihai Chen
Xingming Wu
机构
[1] Beihang University,School of Automation Science and Electrical Engineering
来源
The Visual Computer | 2018年 / 34卷
关键词
Semantic segmentation; Scene classification; Convolutional neural network; Graph-RNN; Conditional random field;
D O I
暂无
中图分类号
学科分类号
摘要
Indoor semantic segmentation plays a critical role in many applications, such as intelligent robots. However, multi-class recognition is still challenging, especially for pixel-level indoor semantic labeling. In this paper, a novel deep structured model that combines the strengths of the widely used convolutional neural networks (CNNs) and recurrent neural networks (RNNs) is proposed. We first present a multi-information fusion model that utilizes the scene category information to fine-tune the fully convolutional network. Then, to refine the coarse outputs of CNN, the RNN is applied to the final CNN layer so that we can build an end-to-end trainable system. This Graph-RNN is transformed from a conditional random field based on superpixel segmentation graphical modeling that can utilize flexible contextual information of different neighboring regions. The experimental results on the recent large SUN RGB-D dataset demonstrate that the proposed model outperforms existing state-of-the-art methods on the challenging 40 dominant classes task (40.8%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$40.8\%$$\end{document} mean IU accuracy and 69.1%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$69.1\%$$\end{document} pixel accuracy). We also evaluate our model on the public NYU depth V2 dataset and achieve remarkable performance.
引用
收藏
页码:735 / 747
页数:12
相关论文
共 50 条
  • [41] Enhancing weakly supervised semantic segmentation through multi-class token attention learning
    Luo, Huilan
    Zeng, Zhen
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (01):
  • [42] Semantic Correlations Loss: Improving Model Interpretability for Multi-class Classification
    Tong, Xuezhi
    Wang, Rui
    Cao, Xiaochun
    Ren, Wenqi
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 5070 - 5079
  • [43] Multi-Class Segmentation of Lung Immunofluorescence Confocal Images Using Deep Learning
    Isaka, Shu
    Kawanaka, Hiroharu
    Aronow, Bruce J.
    Prasath, V. B. Surya
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 2362 - 2368
  • [44] The Effect of Class-Weighted Penalization in Deep Neural Networks for Multi-Class Cell Segmentation
    Aydin, Musa
    Kus, Zeki
    Kiraz, Berna
    Hosavci, Reyhan
    Kiraz, Alper
    32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,
  • [45] Multi-class Segmentation of Neuronal Electron Microscopy Images Using Deep Learning
    Khobragade, Nivedita
    Agarwal, Chirag
    MEDICAL IMAGING 2018: IMAGE PROCESSING, 2018, 10574
  • [46] Multi-class video segmentation based on temporally consistent energy model
    Bing, Liu
    Advances in Information Sciences and Service Sciences, 2012, 4 (01): : 85 - 92
  • [47] Topological Deep Learning Model for Thyroid Multi-Class Categorization
    Priya, T. Selva Banu
    Rajabhushanam, C.
    Sriram, M.
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (03) : 701 - 711
  • [48] Multi-class Classification for Semantic Labeling of Places
    Shi, Lei
    Kodagoda, Sarath
    Dissanayake, Gamini
    11TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2010), 2010, : 2307 - 2312
  • [49] Multi-class segmentation with relative location prior
    Gould, Stephen
    Rodgers, Jim
    Cohen, David
    Elidan, Gal
    Koller, Daphne
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2008, 80 (03) : 300 - 316
  • [50] Layered Object Detection for Multi-Class Segmentation
    Yang, Yi
    Hallman, Sam
    Ramanan, Deva
    Fowlkes, Charless
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 3113 - 3120