Multi-class indoor semantic segmentation with deep structured model

被引：0

作者：

Chuanxia Zheng

Jianhua Wang

Weihai Chen

Xingming Wu

机构：

[1] Beihang University,School of Automation Science and Electrical Engineering

来源：

The Visual Computer | 2018年 / 34卷

关键词：

Semantic segmentation; Scene classification; Convolutional neural network; Graph-RNN; Conditional random field;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Indoor semantic segmentation plays a critical role in many applications, such as intelligent robots. However, multi-class recognition is still challenging, especially for pixel-level indoor semantic labeling. In this paper, a novel deep structured model that combines the strengths of the widely used convolutional neural networks (CNNs) and recurrent neural networks (RNNs) is proposed. We first present a multi-information fusion model that utilizes the scene category information to fine-tune the fully convolutional network. Then, to refine the coarse outputs of CNN, the RNN is applied to the final CNN layer so that we can build an end-to-end trainable system. This Graph-RNN is transformed from a conditional random field based on superpixel segmentation graphical modeling that can utilize flexible contextual information of different neighboring regions. The experimental results on the recent large SUN RGB-D dataset demonstrate that the proposed model outperforms existing state-of-the-art methods on the challenging 40 dominant classes task (40.8%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$40.8\%$$\end{document} mean IU accuracy and 69.1%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$69.1\%$$\end{document} pixel accuracy). We also evaluate our model on the public NYU depth V2 dataset and achieve remarkable performance.

引用

页码：735 / 747

页数：12

共 50 条

[21] MCTformer plus : Multi-Class Token Transformer for Weakly Supervised Semantic Segmentation
Xu, Lian
Bennamoun, Mohammed
Boussaid, Farid
Laga, Hamid
Ouyang, Wanli
Xu, Dan
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 8380 - 8395
[22] Multi-class Semantic Video Segmentation with Exemplar-based Object Reasoning
Liu, Buyu
He, Xuming
Gould, Stephen
2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, : 1014 - 1021
[23] PROTOTYPE QUEUE LEARNING FOR MULTI-CLASS FEW-SHOT SEMANTIC SEGMENTATION
Wang, Zichao
Jiang, Zhiyu
Yuan, Yuan
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1721 - 1725
[24] Active Bayesian Multi-class Mapping from Range and Semantic Segmentation Observations
Asgharivaskasi, Arash
Atanasov, Nikolay
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 1 - 7
[25] Multi-Class Lane Semantic Segmentation of Expressway Dataset Based on Aerial View
Fan, Yongnian
Wang, Zhiguang
Chen, Cheng
Zhang, Xue
Lu, Qiang
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT III, 2022, 13531 : 200 - 211
[26] Deep Structured Features for Semantic Segmentation
Tschannen, Michael
Cavigelli, Lukas
Mentzer, Fabian
Wiatowski, Thomas
Benini, Luca
2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 61 - 65
[27] Bayesian Polytrees With Learned Deep Features for Multi-Class Cell Segmentation
Fehri, Hamid
Gooya, Ali
Lu, Yuanjun
Meijering, Erik
Johnston, Simon A.
Frangi, Alejandro F.
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (07) : 3246 - 3260
[28] Multi-class segmentation of temporomandibular joint using ensemble deep learning
Yoon, Kyubaek
Kim, Jae-Young
Kim, Sun-Jong
Huh, Jong-Ki
Kim, Jin-Woo
Choi, Jongeun
SCIENTIFIC REPORTS, 2024, 14 (01):
[29] A Comparative Study of Deep Learning Methods for Multi-Class Semantic Segmentation of 2D Kidney Ultrasound Images
Valente, Simao
Morais, Pedro
Tones, Helena R.
Oliveira, Bruno
Buschle, L. R.
Fritz, A.
Correia-Pinto, Jorge
Lima, Estevao
Vilaca, Joao L.
2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
[30] Dual attention-based deep learning network for multi-class object semantic segmentation of tunnel point clouds
Ji, Ankang
Zhang, Limao
Fan, Hongqin
Xue, Xiaolong
Dou, Yudan
AUTOMATION IN CONSTRUCTION, 2023, 156

← 1 2 3 4 5 →