Encoder-Decoder Structure Fusing Depth Information for Outdoor Semantic Segmentation

被引:1
|
作者
Chen, Songnan [1 ]
Tang, Mengxia [2 ]
Dong, Ruifang [2 ]
Kan, Jiangming [2 ]
机构
[1] Wuhan Polytech Univ, Sch Math & Comp Sci, 36 Huanhu Middle Rd, Wuhan 430048, Peoples R China
[2] Beijing Forestry Univ, Sch Technol, 35 Qinghua East Rd, Beijing 100083, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 17期
关键词
semantic segmentation; RGB-D image; predicted depth map; fusion structure; feature pyramid; NETWORK;
D O I
10.3390/app13179924
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The semantic segmentation of outdoor images is the cornerstone of scene understanding and plays a crucial role in the autonomous navigation of robots. Although RGB-D images can provide additional depth information for improving the performance of semantic segmentation tasks, current state-of-the-art methods directly use ground truth depth maps for depth information fusion, which relies on highly developed and expensive depth sensors. Aiming to solve such a problem, we proposed a self-calibrated RGB-D image semantic segmentation neural network model based on an improved residual network without relying on depth sensors, which utilizes multi-modal information from depth maps predicted with depth estimation models and RGB image fusion for image semantic segmentation to enhance the understanding of a scene. First, we designed a novel convolution neural network (CNN) with an encoding and decoding structure as our semantic segmentation model. The encoder was constructed using IResNet to extract the semantic features of the RGB image and the predicted depth map and then effectively fuse them with the self-calibration fusion structure. The decoder restored the resolution of the output features with a series of successive upsampling structures. Second, we presented a feature pyramid attention mechanism to extract the fused information at multiple scales and obtain features with rich semantic information. The experimental results using the publicly available Cityscapes dataset and collected forest scene images show that our model trained with the estimated depth information can achieve comparable performance to the ground truth depth map in improving the accuracy of the semantic segmentation task and even outperforming some competitive methods.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Retinal vessel image segmentation algorithm based on encoder-decoder structure
    Zhai, ZhengLi
    Feng, Shu
    Yao, Luyao
    Li, Penghui
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (23) : 33361 - 33373
  • [42] CoAtUNet: A symmetric encoder-decoder with hybrid transformers for semantic segmentation of breast ultrasound images
    Zaidkilani, Nadeem
    Garcia, Miguel Angel
    Puig, Domenec
    NEUROCOMPUTING, 2025, 629
  • [43] Semantic Segmentation for Identifying Road Surface Damages Using Lightweight Encoder-Decoder Network
    Abdussyukur, Hafizh
    Sulistiyo, Mahmud Dwi
    Rachmawati, Ema
    Arief, Mansur Maturidi
    Kosala, Gamma
    Adiwijaya
    2022 INTERNATIONAL CONFERENCE ON ADVANCED CREATIVE NETWORKS AND INTELLIGENT SYSTEMS, ICACNIS, 2022, : 165 - 170
  • [44] Image Semantic Segmentation Method Based on Context and Shallow Space Encoder-decoder Network
    Luo, Hui-Lan
    Li, Xiao
    Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (07): : 1834 - 1846
  • [45] Semantic segmentation of retinal exudates using a residual encoder-decoder architecture in diabetic retinopathy
    Manan, Malik Abdul
    Jinchao, Feng
    Khan, Tariq M. M.
    Yaqub, Muhammad
    Ahmed, Shahzad
    Chuhan, Imran Shabir
    MICROSCOPY RESEARCH AND TECHNIQUE, 2023, 86 (11) : 1443 - 1460
  • [46] SegNetRes-CRF: A Deep Convolutional Encoder-Decoder Architecture for Semantic Image Segmentation
    de Oliveira Junior, Luiz Antonio
    Medeiros, Heitor R.
    Macedo, David
    Zanchettin, Cleber
    Oliveira, Adriano L., I
    Ludermir, Teresa
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [47] A lightweight efficient semantic segmentation with encoder-decoder for arc interference in robotic arc welding
    Chen, Xinyu
    He, Zhuzhen
    Ma, Qihao
    Ren, Yan
    Cui, Tong
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (02)
  • [48] Semantic Segmentation of Anaemic RBCs Using Multilevel Deep Convolutional Encoder-Decoder Network
    Shahzad, Muhammad
    Umar, Arif Iqbal
    Shirazi, Syed Hamad
    Shaikh, Israr Ahmed
    IEEE ACCESS, 2021, 9 : 161326 - 161341
  • [49] PPEDNet: Pyramid Pooling Encoder-Decoder Network for Real-Time Semantic Segmentation
    Tan, Zhentao
    Liu, Bin
    Yu, Nenghai
    IMAGE AND GRAPHICS (ICIG 2017), PT I, 2017, 10666 : 328 - 339
  • [50] Semantic Segmentation of Remote Sensing Image Based on Encoder-Decoder Convolutional Neural Network
    Zhang Zhehan
    Fang Wei
    Du Lili
    Qiao Yanli
    Zhang Dongying
    Ding Guoshen
    ACTA OPTICA SINICA, 2020, 40 (03)