Multi-class indoor semantic segmentation with deep structured model

被引:0
|
作者
Chuanxia Zheng
Jianhua Wang
Weihai Chen
Xingming Wu
机构
[1] Beihang University,School of Automation Science and Electrical Engineering
来源
The Visual Computer | 2018年 / 34卷
关键词
Semantic segmentation; Scene classification; Convolutional neural network; Graph-RNN; Conditional random field;
D O I
暂无
中图分类号
学科分类号
摘要
Indoor semantic segmentation plays a critical role in many applications, such as intelligent robots. However, multi-class recognition is still challenging, especially for pixel-level indoor semantic labeling. In this paper, a novel deep structured model that combines the strengths of the widely used convolutional neural networks (CNNs) and recurrent neural networks (RNNs) is proposed. We first present a multi-information fusion model that utilizes the scene category information to fine-tune the fully convolutional network. Then, to refine the coarse outputs of CNN, the RNN is applied to the final CNN layer so that we can build an end-to-end trainable system. This Graph-RNN is transformed from a conditional random field based on superpixel segmentation graphical modeling that can utilize flexible contextual information of different neighboring regions. The experimental results on the recent large SUN RGB-D dataset demonstrate that the proposed model outperforms existing state-of-the-art methods on the challenging 40 dominant classes task (40.8%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$40.8\%$$\end{document} mean IU accuracy and 69.1%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$69.1\%$$\end{document} pixel accuracy). We also evaluate our model on the public NYU depth V2 dataset and achieve remarkable performance.
引用
收藏
页码:735 / 747
页数:12
相关论文
共 50 条
  • [21] MCTformer plus : Multi-Class Token Transformer for Weakly Supervised Semantic Segmentation
    Xu, Lian
    Bennamoun, Mohammed
    Boussaid, Farid
    Laga, Hamid
    Ouyang, Wanli
    Xu, Dan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 8380 - 8395
  • [22] Multi-class Semantic Video Segmentation with Exemplar-based Object Reasoning
    Liu, Buyu
    He, Xuming
    Gould, Stephen
    2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, : 1014 - 1021
  • [23] PROTOTYPE QUEUE LEARNING FOR MULTI-CLASS FEW-SHOT SEMANTIC SEGMENTATION
    Wang, Zichao
    Jiang, Zhiyu
    Yuan, Yuan
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1721 - 1725
  • [24] Active Bayesian Multi-class Mapping from Range and Semantic Segmentation Observations
    Asgharivaskasi, Arash
    Atanasov, Nikolay
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 1 - 7
  • [25] Multi-Class Lane Semantic Segmentation of Expressway Dataset Based on Aerial View
    Fan, Yongnian
    Wang, Zhiguang
    Chen, Cheng
    Zhang, Xue
    Lu, Qiang
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT III, 2022, 13531 : 200 - 211
  • [26] Deep Structured Features for Semantic Segmentation
    Tschannen, Michael
    Cavigelli, Lukas
    Mentzer, Fabian
    Wiatowski, Thomas
    Benini, Luca
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 61 - 65
  • [27] Bayesian Polytrees With Learned Deep Features for Multi-Class Cell Segmentation
    Fehri, Hamid
    Gooya, Ali
    Lu, Yuanjun
    Meijering, Erik
    Johnston, Simon A.
    Frangi, Alejandro F.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (07) : 3246 - 3260
  • [28] Multi-class segmentation of temporomandibular joint using ensemble deep learning
    Yoon, Kyubaek
    Kim, Jae-Young
    Kim, Sun-Jong
    Huh, Jong-Ki
    Kim, Jin-Woo
    Choi, Jongeun
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [29] A Comparative Study of Deep Learning Methods for Multi-Class Semantic Segmentation of 2D Kidney Ultrasound Images
    Valente, Simao
    Morais, Pedro
    Tones, Helena R.
    Oliveira, Bruno
    Buschle, L. R.
    Fritz, A.
    Correia-Pinto, Jorge
    Lima, Estevao
    Vilaca, Joao L.
    2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
  • [30] Dual attention-based deep learning network for multi-class object semantic segmentation of tunnel point clouds
    Ji, Ankang
    Zhang, Limao
    Fan, Hongqin
    Xue, Xiaolong
    Dou, Yudan
    AUTOMATION IN CONSTRUCTION, 2023, 156