Channel2DTransformer: A Multi-level Features Self-attention Fusion Module for Semantic Segmentation

被引:0
|
作者
Liu, Weitao [1 ]
Wu, Junjun [1 ]
机构
[1] Foshan Univ, Guangdong Prov Key Lab Ind Intelligent Inspection, Foshan, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Semantic segmentation; Channel2DTransformer; Self-attention; Deep learning;
D O I
10.1007/s44196-024-00630-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation is a crucial technology for intelligent vehicles, enabling scene understanding in complex driving environments. However, complex real-world scenarios often contain diverse multi-scale objects, which bring challenges to the accurate semantic segmentation. To address this challenge, we propose a multi-level features self-attention fusion module called Channel2DTransformer. The module utilizes self-attention mechanisms to dynamically fuse multi-level features by computing self-attention weights between their channels, resulting in a consistent and comprehensive representation of scene features. We perform the module on the Cityscapes and NYUDepthV2 datasets, which contain a large number of multi-scale objects. The experimental results validate the positive contributions of the module in enhancing the semantic segmentation accuracy of multi-scale objects and improving the performance of semantic segmentation in complex scenes.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] An Object Detection Method Combining Multi-Level Feature Fusion and Region Channel Attention
    Zhu, Ge
    Wei, Zizun
    Lin, Feng
    IEEE ACCESS, 2021, 9 : 25101 - 25109
  • [32] JS']JSMNet: Improving Indoor Point Cloud Semantic and Instance Segmentation through Self-Attention and Multiscale Fusion
    Xu, Shuochen
    Zhang, Zhenxin
    GEOSPATIAL WEEK 2023, VOL. 48-1, 2023, : 195 - 201
  • [33] MLFNet: Multi-Level Fusion Network for Real-Time Semantic Segmentation of Autonomous Driving
    Fan, Jiaqi
    Wang, Fei
    Chu, Hongqing
    Hu, Xiao
    Cheng, Yifan
    Gao, Bingzhao
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (01): : 756 - 767
  • [34] RDFNet: RGB-D Multi-level Residual Feature Fusion for Indoor Semantic Segmentation
    Park, Seong-Jin
    Hong, Ki-Sang
    Lee, Seungyong
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4990 - 4999
  • [35] MAFFNet: real-time multi-level attention feature fusion network with RGB-D semantic segmentation for autonomous driving
    Lv, Tongfei
    Zhang, Yu
    Luo, Lin
    Gao, Xiaorong
    APPLIED OPTICS, 2022, 61 (09) : 2219 - 2229
  • [36] CCAFFMNet: Dual-spectral semantic segmentation network with channel-coordinate attention feature fusion module
    Yi, Shi
    Li, Junjie
    Liu, Xi
    Yuan, Xuesong
    NEUROCOMPUTING, 2022, 482 : 236 - 251
  • [37] DAMAF: dual attention network with multi-level adaptive complementary fusion for medical image segmentation
    Pan, Yueqian
    Chen, Qiaohong
    Fang, Xian
    VISUAL COMPUTER, 2025, 41 (04): : 2409 - 2424
  • [38] A sketch semantic segmentation method using novel local feature aggregation and segment-level self-attention
    Wang, Lei
    Zhang, Shihui
    Wang, Wei
    Zhao, Weibo
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (21): : 15295 - 15313
  • [39] A sketch semantic segmentation method using novel local feature aggregation and segment-level self-attention
    Lei Wang
    Shihui Zhang
    Wei Wang
    Weibo Zhao
    Neural Computing and Applications, 2023, 35 : 15295 - 15313
  • [40] Unsupervised domain adaptation multi-level adversarial network for semantic segmentation based on multi-modal features
    Wang Z.
    Bu S.
    Huang W.
    Zheng Y.
    Wu Q.
    Chang H.
    Zhang X.
    Tongxin Xuebao/Journal on Communications, 2022, 43 (12): : 157 - 171