Semantic segmentation using cross-stage feature reweighting and efficient self-attention

被引:0
|
作者
Ma, Yingdong [1 ]
Lan, Xiaobin [1 ]
机构
[1] Inner Mongolia Univ, Coll Comp Sci, 235 West Daxue Rd, Hohhot, Peoples R China
关键词
Semantic segmentation; Convolutional neural networks; Transformer; Feature fusion and reweighting; NETWORK;
D O I
10.1016/j.imavis.2024.104996
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, vision transformers have demonstrated strong performance in various computer vision tasks. The success of ViTs can be attribute to the ability of capturing long-range dependencies. However, transformer-based approaches often yield segmentation maps with incomplete object structures because of restricted cross-stage information propagation and lack of low-level details. To address these problems, we introduce a CNNtransformer semantic segmentation architecture which adopts a CNN backbone for multi-level feature extraction and a transformer encoder that focuses on global perception learning. Transformer embeddings of all stages are integrated to compute feature weights for dynamic cross-stage feature reweighting. As a result, high-level semantic context and low-level spatial details can be embedded into each stage to preserve multi-level information. An efficient attention-based feature fusion mechanism is developed to combine reweighted transformer embeddings with CNN features to generate segmentation maps with more complete object structure. Different from regular self-attention that has quadratic computational complexity, our efficient self-attention method achieves similar performance with linear complexity. Experimental results on ADE20K and Cityscapes datasets show that the proposed segmentation approach demonstrates superior performance against most state-of-the-art networks.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Efficient brain tumor segmentation using Swin transformer and enhanced local self-attention
    Fethi Ghazouani
    Pierre Vera
    Su Ruan
    International Journal of Computer Assisted Radiology and Surgery, 2024, 19 : 273 - 281
  • [22] Efficient brain tumor segmentation using Swin transformer and enhanced local self-attention
    Ghazouani, Fethi
    Vera, Pierre
    Ruan, Su
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2023, 19 (2) : 273 - 281
  • [23] UNSUPERVISED DOMAIN ADAPTION FOR REMOTE SENSING SEMANTIC SEGMENTATION WITH SELF-ATTENTION MECHANISM
    Liu, Keming
    Liu, Fang
    Liu, Jia
    Xiao, Liang
    Tang, Xu
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6916 - 6919
  • [24] Multi-layered self-attention mechanism for weakly supervised semantic segmentation
    Yaganapu, Avinash
    Kang, Mingon
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 239
  • [25] Non-bias self-attention learning for weakly supervised semantic segmentation
    Sun, Wanchun
    Feng, Xin
    Liu, Jingyao
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 105
  • [26] Semantic Segmentation of Remote Sensing Image Based on Regional Self-Attention Mechanism
    Zhao, Danpei
    Wang, Chenxu
    Gao, Yue
    Shi, Zhenwei
    Xie, Fengying
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [27] CaSaFormer: A cross- and self-attention based lightweight network for large-scale building semantic segmentation
    Li, Jiayi
    Hu, Yuping
    Huang, Xin
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 130
  • [28] CSNet: Cross-Stage Subtraction Network for Real-Time Semantic Segmentation in Autonomous Driving
    Elhassan, Mohammed A. M.
    Zhou, Changjun
    Zhu, Donglin
    Adam, Abuzar B. M.
    Benabid, Amina
    Khan, Ali
    Mehmood, Atif
    Zhang, Jun
    Jin, Hu
    Jeon, Sang-Woon
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (03) : 4093 - 4108
  • [29] Real-Time Semantic Segmentation Network Based on Regional Self-Attention
    Bao Hailong
    Wan Min
    Liu Zhongxian
    Qin Mian
    Cui Haoyu
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (08)
  • [30] 1D Self-Attention Network for Point Cloud Semantic Segmentation Using Omnidirectional LiDAR
    Suzuki, Takahiro
    Hirakawa, Tsubasa
    Yamashita, Takayoshi
    Fujiyoshi, Hironobu
    PATTERN RECOGNITION, ACPR 2021, PT I, 2022, 13188 : 257 - 270