Encoder-Decoder Structure Fusing Depth Information for Outdoor Semantic Segmentation

被引:1
|
作者
Chen, Songnan [1 ]
Tang, Mengxia [2 ]
Dong, Ruifang [2 ]
Kan, Jiangming [2 ]
机构
[1] Wuhan Polytech Univ, Sch Math & Comp Sci, 36 Huanhu Middle Rd, Wuhan 430048, Peoples R China
[2] Beijing Forestry Univ, Sch Technol, 35 Qinghua East Rd, Beijing 100083, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 17期
关键词
semantic segmentation; RGB-D image; predicted depth map; fusion structure; feature pyramid; NETWORK;
D O I
10.3390/app13179924
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The semantic segmentation of outdoor images is the cornerstone of scene understanding and plays a crucial role in the autonomous navigation of robots. Although RGB-D images can provide additional depth information for improving the performance of semantic segmentation tasks, current state-of-the-art methods directly use ground truth depth maps for depth information fusion, which relies on highly developed and expensive depth sensors. Aiming to solve such a problem, we proposed a self-calibrated RGB-D image semantic segmentation neural network model based on an improved residual network without relying on depth sensors, which utilizes multi-modal information from depth maps predicted with depth estimation models and RGB image fusion for image semantic segmentation to enhance the understanding of a scene. First, we designed a novel convolution neural network (CNN) with an encoding and decoding structure as our semantic segmentation model. The encoder was constructed using IResNet to extract the semantic features of the RGB image and the predicted depth map and then effectively fuse them with the self-calibration fusion structure. The decoder restored the resolution of the output features with a series of successive upsampling structures. Second, we presented a feature pyramid attention mechanism to extract the fused information at multiple scales and obtain features with rich semantic information. The experimental results using the publicly available Cityscapes dataset and collected forest scene images show that our model trained with the estimated depth information can achieve comparable performance to the ground truth depth map in improving the accuracy of the semantic segmentation task and even outperforming some competitive methods.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Semantic segmentation method of underwater images based on encoder-decoder architecture
    Wang, Jinkang
    He, Xiaohui
    Shao, Faming
    Lu, Guanlin
    Hu, Ruizhe
    Jiang, Qunyan
    PLOS ONE, 2022, 17 (08):
  • [22] Deep Convolutional Encoder-Decoder Network with Model Uncertainty for Semantic Segmentation
    Isobe, Shuya
    Arai, Shuichi
    2017 IEEE INTERNATIONAL CONFERENCE ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA), 2017, : 365 - 370
  • [23] DeepLab-Rail: semantic segmentation network for railway scenes based on encoder-decoder structure
    Zeng, Qingsong
    Zhang, Linxuan
    Wang, Yuan
    Luo, Xiaolong
    Chen, Yannan
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (04)
  • [24] Real-time semantic segmentation of microvascular decompression images based on encoder-decoder structure
    Bai Rui-feng
    Jiang Shan
    Sun Hai-jiang
    Liu Xin-rui
    CHINESE OPTICS, 2022, 15 (05) : 1055 - 1065
  • [25] Rethinking the Encoder-decoder Structure in Medical Image Segmentation from Releasing Decoder Structure
    Ni, Jiajia
    Mu, Wei
    Pan, An
    Chen, Zhengming
    JOURNAL OF BIONIC ENGINEERING, 2024, 21 (03) : 1511 - 1521
  • [26] Fusing Brilliance: Evaluating the Encoder-Decoder Hybrids With CNN and Swin Transformer for Medical Segmentation
    Lee, Seunghyuk
    Kim, Songkuk
    IEEE ACCESS, 2024, 12 : 81842 - 81852
  • [27] Deep Convolutional Encoder-Decoder Architecture for Neuronal Structure Segmentation
    Cui, Qingqing
    Pu, Peng
    Chen, Lu
    Zhao, Wenzheng
    Liu, Yu
    2018 INTERNATIONAL CONFERENCE ON CONTROL, ARTIFICIAL INTELLIGENCE, ROBOTICS & OPTIMIZATION (ICCAIRO), 2018, : 242 - 247
  • [28] Semantic Segmentation of Crop and Weed using an Encoder-Decoder Network and Image Enhancement Method under Uncontrolled Outdoor Illumination
    Wang, Aichen
    Xu, Yifei
    Wei, Xinhua
    Cui, Bingbo
    IEEE ACCESS, 2020, 8 : 81724 - 81734
  • [29] SEMANTIC SEGMENTATION OF REMOTE SENSING IMAGERY USING AN ENHANCED ENCODER-DECODER ARCHITECTURE
    Aburaed, N.
    Al-Saad, M.
    Alkhatib, M. Q.
    Zitouni, M. S.
    Almansoori, S.
    Al-Ahmad, H.
    GEOSPATIAL WEEK 2023, VOL. 10-1, 2023, : 1015 - 1020
  • [30] LEDNET: A LIGHTWEIGHT ENCODER-DECODER NETWORK FOR REAL-TIME SEMANTIC SEGMENTATION
    Wang, Yu
    Zhou, Quan
    Liu, Jia
    Xiong, Jian
    Gao, Guangwei
    Wu, Xiaofu
    Latecki, Longin Jan
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1860 - 1864