Multi-class indoor semantic segmentation with deep structured model

被引：0

作者：

Chuanxia Zheng

Jianhua Wang

Weihai Chen

Xingming Wu

机构：

[1] Beihang University,School of Automation Science and Electrical Engineering

来源：

The Visual Computer | 2018年 / 34卷

关键词：

Semantic segmentation; Scene classification; Convolutional neural network; Graph-RNN; Conditional random field;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Indoor semantic segmentation plays a critical role in many applications, such as intelligent robots. However, multi-class recognition is still challenging, especially for pixel-level indoor semantic labeling. In this paper, a novel deep structured model that combines the strengths of the widely used convolutional neural networks (CNNs) and recurrent neural networks (RNNs) is proposed. We first present a multi-information fusion model that utilizes the scene category information to fine-tune the fully convolutional network. Then, to refine the coarse outputs of CNN, the RNN is applied to the final CNN layer so that we can build an end-to-end trainable system. This Graph-RNN is transformed from a conditional random field based on superpixel segmentation graphical modeling that can utilize flexible contextual information of different neighboring regions. The experimental results on the recent large SUN RGB-D dataset demonstrate that the proposed model outperforms existing state-of-the-art methods on the challenging 40 dominant classes task (40.8%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$40.8\%$$\end{document} mean IU accuracy and 69.1%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$69.1\%$$\end{document} pixel accuracy). We also evaluate our model on the public NYU depth V2 dataset and achieve remarkable performance.

引用

页码：735 / 747

页数：12

共 50 条

[41] Enhancing weakly supervised semantic segmentation through multi-class token attention learning
Luo, Huilan
Zeng, Zhen
JOURNAL OF SUPERCOMPUTING, 2025, 81 (01):
[42] Semantic Correlations Loss: Improving Model Interpretability for Multi-class Classification
Tong, Xuezhi
Wang, Rui
Cao, Xiaochun
Ren, Wenqi
2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 5070 - 5079
[43] Multi-Class Segmentation of Lung Immunofluorescence Confocal Images Using Deep Learning
Isaka, Shu
Kawanaka, Hiroharu
Aronow, Bruce J.
Prasath, V. B. Surya
2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 2362 - 2368
[44] The Effect of Class-Weighted Penalization in Deep Neural Networks for Multi-Class Cell Segmentation
Aydin, Musa
Kus, Zeki
Kiraz, Berna
Hosavci, Reyhan
Kiraz, Alper
32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,
[45] Multi-class Segmentation of Neuronal Electron Microscopy Images Using Deep Learning
Khobragade, Nivedita
Agarwal, Chirag
MEDICAL IMAGING 2018: IMAGE PROCESSING, 2018, 10574
[46] Multi-class video segmentation based on temporally consistent energy model
Bing, Liu
Advances in Information Sciences and Service Sciences, 2012, 4 (01): : 85 - 92
[47] Topological Deep Learning Model for Thyroid Multi-Class Categorization
Priya, T. Selva Banu
Rajabhushanam, C.
Sriram, M.
JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (03) : 701 - 711
[48] Multi-class Classification for Semantic Labeling of Places
Shi, Lei
Kodagoda, Sarath
Dissanayake, Gamini
11TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2010), 2010, : 2307 - 2312
[49] Multi-class segmentation with relative location prior
Gould, Stephen
Rodgers, Jim
Cohen, David
Elidan, Gal
Koller, Daphne
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2008, 80 (03) : 300 - 316
[50] Layered Object Detection for Multi-Class Segmentation
Yang, Yi
Hallman, Sam
Ramanan, Deva
Fowlkes, Charless
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 3113 - 3120

← 1 2 3 4 5 →