Multi-class indoor semantic segmentation with deep structured model

被引：0

作者：

Chuanxia Zheng

Jianhua Wang

Weihai Chen

Xingming Wu

机构：

[1] Beihang University,School of Automation Science and Electrical Engineering

来源：

The Visual Computer | 2018年 / 34卷

关键词：

Semantic segmentation; Scene classification; Convolutional neural network; Graph-RNN; Conditional random field;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Indoor semantic segmentation plays a critical role in many applications, such as intelligent robots. However, multi-class recognition is still challenging, especially for pixel-level indoor semantic labeling. In this paper, a novel deep structured model that combines the strengths of the widely used convolutional neural networks (CNNs) and recurrent neural networks (RNNs) is proposed. We first present a multi-information fusion model that utilizes the scene category information to fine-tune the fully convolutional network. Then, to refine the coarse outputs of CNN, the RNN is applied to the final CNN layer so that we can build an end-to-end trainable system. This Graph-RNN is transformed from a conditional random field based on superpixel segmentation graphical modeling that can utilize flexible contextual information of different neighboring regions. The experimental results on the recent large SUN RGB-D dataset demonstrate that the proposed model outperforms existing state-of-the-art methods on the challenging 40 dominant classes task (40.8%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$40.8\%$$\end{document} mean IU accuracy and 69.1%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$69.1\%$$\end{document} pixel accuracy). We also evaluate our model on the public NYU depth V2 dataset and achieve remarkable performance.

引用

页码：735 / 747

页数：12

共 50 条

[1] Multi-class indoor semantic segmentation with deep structured model
Zheng, Chuanxia
Wang, Jianhua
Chen, Weihai
Wu, Xingming
VISUAL COMPUTER, 2018, 34 (05): : 735 - 747
[2] MULTI-CLASS SEMANTIC SEGMENTATION OF FACES
Khan, Khalil
Mauro, Massimo
Leonardi, Riccardo
2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 827 - 831
[3] Deep Learning Approach for Multi-class Semantic Segmentation of UAV Images
Chouhan, Avinash
Chutia, Dibyajyoti
Aggarwal, Shiv Prasad
INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2023, 32 (07)
[4] A Combined Method for Multi-class Image Semantic Segmentation
Gao, Chao
Zhang, Xin
Wang, Hui
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (02) : 596 - 604
[5] Multi-class semantic segmentation of pediatric chest radiographs
Holste, Gregory
Sullivan, Ryan P.
Bindschadler, Michael
Nagy, Nicholas
Alessio, Adam
MEDICAL IMAGING 2020: IMAGE PROCESSING, 2021, 11313
[6] Semantic Segmentation for Brain Injury Using Multi-Class Deep Level Convolution Networks
Roselin Mary, S.
Singh, Manmohan
Aparna, N.
Rosy Salomi Victoria, D.
CYBERNETICS AND SYSTEMS, 2023,
[7] Deep Semantic Segmentation and Multi-Class Skin Lesion Classification Based on Convolutional Neural Network
Anjum, Muhammad Almas
Amin, Javaria
Sharif, Muhammad
Khan, Habib Ullah
Malik, Muhammad Sheraz Arshad
Kadry, Seifedine
IEEE ACCESS, 2020, 8 : 129668 - 129678
[8] Segmentation-based multi-class semantic object detection
Vieux, Remi
Benois-Pineau, Jenny
Domenger, Jean-Philippe
Braquelaire, Achille
MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 60 (02) : 305 - 326
[9] Multi-class Token Transformer for Weakly Supervised Semantic Segmentation
Xu, Lian
Ouyang, Wanli
Bennamoun, Mohammed
Boussaid, Farid
Xu, Dan
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4300 - 4309
[10] Segmentation-based multi-class semantic object detection
Remi Vieux
Jenny Benois-Pineau
Jean-Philippe Domenger
Achille Braquelaire
Multimedia Tools and Applications, 2012, 60 : 305 - 326

← 1 2 3 4 5 →