Lunet: an enhanced upsampling fusion network with efficient self-attention for semantic segmentation

被引:1
|
作者
Zhou, Yan [1 ]
Zhou, Haibin [2 ]
Yang, Yin [2 ]
Li, Jianxun [3 ]
Irampaye, Richard [2 ]
Wang, Dongli [1 ]
Zhang, Zhengpeng [1 ]
机构
[1] Xiangtan Univ, Sch Automat & Elect Informat, Xiangtan, Peoples R China
[2] Xiangtan Univ, Sch Math & Computat Sci, Xiangtan, Peoples R China
[3] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Semantic segmentation; Lightweight model; Upsampling fusion; Self-attention; U-NET;
D O I
10.1007/s00371-024-03590-1
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Semantic segmentation is an essential aspect of many computer vision tasks. Self-attention (SA)-based deep learning methods have shown impressive results in semantic segmentation by capturing long-range dependencies and contextual information. However, the standard SA module has high computational complexity, which limits its use in resource-constrained scenarios. This paper proposes a novel LUNet to improve semantic segmentation performance while addressing the computational challenges of SA. The lightweight self-attention plus (LSA++) module is introduced as a lightweight and efficient variant of the SA module. LSA++ uses compact feature representation and local position embedding to significantly reduce computational complexity while surpassing the accuracy of the standard SA module. Furthermore, to address the loss of edge details during decoding, we propose the enhanced upsampling fusion module (EUP-FM). This module comprises an enhanced upsampling module and a semantic vector-guided fusion mechanism. EUP-FM effectively recovers edge information and improves the precision of the segmentation map. Comprehensive experiments on PASCAL VOC 2012, Cityscapes, COCO, and SegPC 2021 demonstrate that LUNet outperforms all compared methods. It achieves superior runtime performance and accurate segmentation with excellent model generalization ability. The code is available at https://github.com/hbzhou530/LUNet.
引用
收藏
页码:3109 / 3128
页数:20
相关论文
共 50 条
  • [31] A lightweight segmentation network for endoscopic surgical instruments based on edge refinement and efficient self-attention
    Zhou, Mengyu
    Han, Xiaoxiang
    Liu, Zhoujin
    Chen, Yitong
    Sun, Liping
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [32] Image Editing via Segmentation Guided Self-Attention Network
    Zhang, Jianfu
    Yang, Peiming
    Wang, Wentao
    Hong, Yan
    Zhang, Liqing
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 1605 - 1609
  • [33] A lightweight segmentation network for endoscopic surgical instruments based on edge refinement and efficient self-attention
    Zhou M.
    Han X.
    Liu Z.
    Chen Y.
    Sun L.
    PeerJ Computer Science, 2023, 9
  • [34] Lightweight Semantic Segmentation Network based on Attention Feature Fusion
    Kuang, Xianyan
    Liu, Ping
    Chen, Yixi
    Zhang, Jianhua
    ENGINEERING LETTERS, 2023, 31 (04) : 1584 - 1591
  • [35] Attention fusion network for multi-spectral semantic segmentation
    Xu, Jiangtao
    Lu, Kaige
    Wang, Han
    PATTERN RECOGNITION LETTERS, 2021, 146 : 179 - 184
  • [36] Feature Fusion Network Based on Hybrid Attention for Semantic Segmentation
    Xie Xinchen
    Li, Chen
    Tian, Lihua
    2022 IEEE WORLD AI IOT CONGRESS (AIIOT), 2022, : 9 - 14
  • [37] Self-Attention Progressive Network for Infrared and Visible Image Fusion
    Li, Shuying
    Han, Muyi
    Qin, Yuemei
    Li, Qiang
    REMOTE SENSING, 2024, 16 (18)
  • [38] UNSUPERVISED DOMAIN ADAPTION FOR REMOTE SENSING SEMANTIC SEGMENTATION WITH SELF-ATTENTION MECHANISM
    Liu, Keming
    Liu, Fang
    Liu, Jia
    Xiao, Liang
    Tang, Xu
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6916 - 6919
  • [39] Multi-layered self-attention mechanism for weakly supervised semantic segmentation
    Yaganapu, Avinash
    Kang, Mingon
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 239
  • [40] SEMANTIC IMAGES SEGMENTATION FOR AUTONOMOUS DRIVING USING SELF-ATTENTION KNOWLEDGE DISTILLATION
    Karine, Ayoub
    Napoleon, Thibault
    Jridi, Maher
    2022 16TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS, SITIS, 2022, : 198 - 202