Semantic segmentation using cross-stage feature reweighting and efficient self-attention

被引:0
|
作者
Ma, Yingdong [1 ]
Lan, Xiaobin [1 ]
机构
[1] Inner Mongolia Univ, Coll Comp Sci, 235 West Daxue Rd, Hohhot, Peoples R China
关键词
Semantic segmentation; Convolutional neural networks; Transformer; Feature fusion and reweighting; NETWORK;
D O I
10.1016/j.imavis.2024.104996
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, vision transformers have demonstrated strong performance in various computer vision tasks. The success of ViTs can be attribute to the ability of capturing long-range dependencies. However, transformer-based approaches often yield segmentation maps with incomplete object structures because of restricted cross-stage information propagation and lack of low-level details. To address these problems, we introduce a CNNtransformer semantic segmentation architecture which adopts a CNN backbone for multi-level feature extraction and a transformer encoder that focuses on global perception learning. Transformer embeddings of all stages are integrated to compute feature weights for dynamic cross-stage feature reweighting. As a result, high-level semantic context and low-level spatial details can be embedded into each stage to preserve multi-level information. An efficient attention-based feature fusion mechanism is developed to combine reweighted transformer embeddings with CNN features to generate segmentation maps with more complete object structure. Different from regular self-attention that has quadratic computational complexity, our efficient self-attention method achieves similar performance with linear complexity. Experimental results on ADE20K and Cityscapes datasets show that the proposed segmentation approach demonstrates superior performance against most state-of-the-art networks.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Cross-Modal Self-Attention Network for Referring Image Segmentation
    Ye, Linwei
    Rochan, Mrigank
    Liu, Zhi
    Wang, Yang
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10494 - 10503
  • [32] Cross-Modal Self-Attention Distillation for Prostate Cancer Segmentation
    Zhang, Guokai
    Shen, Xiaoang
    Luo, Ye
    Luo, Jihao
    Wang, Zeju
    Wang, Weigang
    Zhao, Binghui
    Lu, Jianwei
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 909 - 914
  • [33] Research of Self-Attention in Image Segmentation
    Cao, Fude
    Zheng, Chunguang
    Huang, Limin
    Wang, Aihua
    Zhang, Jiong
    Zhou, Feng
    Ju, Haoxue
    Guo, Haitao
    Du, Yuxia
    JOURNAL OF INFORMATION TECHNOLOGY RESEARCH, 2022, 15 (01)
  • [34] Self-Attention Technology in Image Segmentation
    Cao, Fude
    Lu, Xueyun
    INTERNATIONAL CONFERENCE ON INTELLIGENT TRAFFIC SYSTEMS AND SMART CITY (ITSSC 2021), 2022, 12165
  • [35] Using Guided Self-Attention with Local Information for Polyp Segmentation
    Cai, Linghan
    Wu, Meijing
    Chen, Lijiang
    Bai, Wenpei
    Yang, Min
    Lyu, Shuchang
    Zhao, Qi
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT IV, 2022, 13434 : 629 - 638
  • [36] A global reweighting approach for cross-domain semantic segmentation
    Zhang, Yuhang
    Tian, Shishun
    Liao, Muxin
    Hua, Guoguang
    Zou, Wenbin
    Xu, Chen
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2025, 130
  • [37] Semantic segmentation of 3D point cloud based on self-attention feature fusion group convolutional neural network
    Yang J.
    Li B.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2022, 30 (07): : 840 - 853
  • [38] Self-Attention Prediction Correction with Channel Suppression for Weakly-Supervised Semantic Segmentation
    Sun, Guoying
    Yang, Meng
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 846 - 851
  • [39] Robust Semi-Supervised Semantic Segmentation Based on Self-Attention and Spectral Normalization
    Zhang, Jia
    Li, Zhixin
    Zhang, Canlong
    Ma, Huifang
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [40] Saliency Guided Self-Attention Network for Weakly and Semi-Supervised Semantic Segmentation
    Yao, Qi
    Gong, Xiaojin
    IEEE ACCESS, 2020, 8 : 14413 - 14423