Lunet: an enhanced upsampling fusion network with efficient self-attention for semantic segmentation

被引:1
|
作者
Zhou, Yan [1 ]
Zhou, Haibin [2 ]
Yang, Yin [2 ]
Li, Jianxun [3 ]
Irampaye, Richard [2 ]
Wang, Dongli [1 ]
Zhang, Zhengpeng [1 ]
机构
[1] Xiangtan Univ, Sch Automat & Elect Informat, Xiangtan, Peoples R China
[2] Xiangtan Univ, Sch Math & Computat Sci, Xiangtan, Peoples R China
[3] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Semantic segmentation; Lightweight model; Upsampling fusion; Self-attention; U-NET;
D O I
10.1007/s00371-024-03590-1
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Semantic segmentation is an essential aspect of many computer vision tasks. Self-attention (SA)-based deep learning methods have shown impressive results in semantic segmentation by capturing long-range dependencies and contextual information. However, the standard SA module has high computational complexity, which limits its use in resource-constrained scenarios. This paper proposes a novel LUNet to improve semantic segmentation performance while addressing the computational challenges of SA. The lightweight self-attention plus (LSA++) module is introduced as a lightweight and efficient variant of the SA module. LSA++ uses compact feature representation and local position embedding to significantly reduce computational complexity while surpassing the accuracy of the standard SA module. Furthermore, to address the loss of edge details during decoding, we propose the enhanced upsampling fusion module (EUP-FM). This module comprises an enhanced upsampling module and a semantic vector-guided fusion mechanism. EUP-FM effectively recovers edge information and improves the precision of the segmentation map. Comprehensive experiments on PASCAL VOC 2012, Cityscapes, COCO, and SegPC 2021 demonstrate that LUNet outperforms all compared methods. It achieves superior runtime performance and accurate segmentation with excellent model generalization ability. The code is available at https://github.com/hbzhou530/LUNet.
引用
收藏
页码:3109 / 3128
页数:20
相关论文
共 50 条
  • [21] Investigating Self-Attention Network for Chinese Word Segmentation
    Gan, Leilei
    Zhang, Yue
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2933 - 2941
  • [22] Fusion Attention Network for Autonomous Cars Semantic Segmentation
    Wang, Chuyao
    Aouf, Nabil
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 1525 - 1530
  • [23] CFSA-Net: Efficient Large-Scale Point Cloud Semantic Segmentation Based on Cross-Fusion Self-Attention
    Shu, Jun
    Wang, Shuai
    Yu, Shiqi
    Zhang, Jie
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (03): : 2677 - 2697
  • [24] Cascaded Semantic and Positional Self-Attention Network for Document Classification
    Jiang, Juyong
    Zhang, Jie
    Zhang, Kai
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 669 - 677
  • [25] TSNet: Three-Stream Self-Attention Network for RGB-D Indoor Semantic Segmentation
    Zhou, Wujie
    Yuan, Jianzhong
    Lei, Jingsheng
    Luo, Ting
    IEEE INTELLIGENT SYSTEMS, 2021, 36 (04) : 73 - 78
  • [26] 1D Self-Attention Network for Point Cloud Semantic Segmentation Using Omnidirectional LiDAR
    Suzuki, Takahiro
    Hirakawa, Tsubasa
    Yamashita, Takayoshi
    Fujiyoshi, Hironobu
    PATTERN RECOGNITION, ACPR 2021, PT I, 2022, 13188 : 257 - 270
  • [27] An Attention Enhanced Graph Convolutional Network for Semantic Segmentation
    Chen, Ao
    Zhou, Yue
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, PRCV 2020, 2020, 12305 : 734 - 745
  • [28] Channel2DTransformer: A Multi-level Features Self-attention Fusion Module for Semantic Segmentation
    Liu, Weitao
    Wu, Junjun
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2024, 17 (01)
  • [29] JS']JSMNet: Improving Indoor Point Cloud Semantic and Instance Segmentation through Self-Attention and Multiscale Fusion
    Xu, Shuochen
    Zhang, Zhenxin
    GEOSPATIAL WEEK 2023, VOL. 48-1, 2023, : 195 - 201
  • [30] Progressively Normalized Self-Attention Network for Video Polyp Segmentation
    Ji, Ge-Peng
    Chou, Yu-Cheng
    Fan, Deng-Ping
    Chen, Geng
    Fu, Huazhu
    Jha, Debesh
    Shao, Ling
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I, 2021, 12901 : 142 - 152