UNestFormer: Enhancing Decoders and Skip Connections With Nested Transformers for Medical Image Segmentation

被引:0
|
作者
Tayeb, Adnan Md [1 ]
Kim, Tae-Hyong [1 ]
机构
[1] Kumoh Natl Inst Technol, Gumi Si 39177, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Transformers; Image segmentation; Decoding; Computational modeling; Semantics; Accuracy; Feature extraction; Computer architecture; Medical diagnostic imaging; Standards; Medical image segmentation; U-Net; CNN-transformer hybrid framework; omni-attention; nested skip connections;
D O I
10.1109/ACCESS.2024.3516079
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Precise identification of organs and lesions in medical images is essential for accurate disease diagnosis and analysis of organ structures. Deep convolutional neural network (CNN)-based U-shaped networks are among the most popular and promising approaches for this task. Recently, full-Transformer or hybrid CNN-Transformer structures have gained traction in medical image segmentation due to their effectiveness. However, current approaches face at least one of three limitations: 1) CNN-based models struggle to capture long-range dependencies, 2) Transformer-based models have limitations in extracting local features, resulting in a loss of low-level details, and 3) hybrid models are computationally complex. In this paper, we propose UNestFormer, a novel CNN-Transformer hybrid framework where densely connected nested transformers link the encoder and decoder to achieve more precise segmentation results by reducing the semantic gap between them. The traditional transformer, which uses the multi-head self-attention (MHSA), primarily addresses global attention but misses other forms of attention. In contrast, we designed an omni-attention, which incorporates four forms of attention: local, global, channel, and spatial, and introduced the omni-attention transformer block (OmniBlock). UNestFormer is designed to be comparatively lightweight yet robust and accurate. We argue that nested transformers with the proposed OmniBlock serve as strong decoders with efficient skip connections for medical image segmentation, enhancing feature aggregation and minimizing information loss. Extensive experiments validate UNestFormer's superiority across benchmark medical datasets. It outperforms its closest competitors by 2.1% on Synapse and 1.29% on ISIC 2018 in terms of the Dice similarity coefficient (DSC), and achieves a 1.45% improvement in the Hausdorff distance (HD) on Synapse, all while maintaining lower computational costs.
引用
收藏
页码:190996 / 191009
页数:14
相关论文
共 50 条
  • [1] The Importance of Skip Connections in Biomedical Image Segmentation
    Drozdzal, Michal
    Vorontsov, Eugene
    Chartrand, Gabriel
    Kadoury, Samuel
    Pal, Chris
    DEEP LEARNING AND DATA LABELING FOR MEDICAL APPLICATIONS, 2016, 10008 : 179 - 187
  • [2] Transformers in medical image segmentation: A review
    Xiao, Hanguang
    Li, Li
    Liu, Qiyuan
    Zhu, Xiuhong
    Zhang, Qihang
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 84
  • [3] LFU-Net: A Lightweight U-Net with Full Skip Connections for Medical Image Segmentation
    Deng, Yunjiao
    Wang, Hui
    Hou, Yulei
    Liang, Shunpan
    Zeng, Daxing
    CURRENT MEDICAL IMAGING, 2023, 19 (04) : 347 - 360
  • [4] Narrowing the semantic gaps in U-Net with learnable skip connections: The case of medical image segmentation
    Wang, Haonan
    Cao, Peng
    Yang, Jinzhu
    Zaiane, Osmar
    NEURAL NETWORKS, 2024, 178
  • [5] SK-VM plus plus : Mamba assists skip-connections for medical image segmentation
    Wu, Renkai
    Pan, Liuyue
    Liang, Pengchen
    Chang, Qing
    Wang, Xianjin
    Fang, Weihuan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 105
  • [6] Transformers in medical image segmentation: a narrative review
    Khan, Rabeea Fatma
    Lee, Byoung-Dai
    Lee, Mu Sook
    QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2023, 13 (12) : 8747 - 8767
  • [7] Skip Connections for Medical Image Synthesis with Generative Adversarial Networks
    Mirza, Muhammad Usama
    Dalmaz, Onat
    Cukur, Tolga
    2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
  • [8] Multi-scale context UNet-like network with redesigned skip connections for medical image segmentation
    Qian, Ledan
    Wen, Caiyun
    Li, Yi
    Hu, Zhongyi
    Zhou, Xiao
    Xia, Xiaonyu
    Kim, Soo-Hyung
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 243
  • [9] DEEP CONVOLUTIONAL ENCODER-DECODERS WITH AGGREGATED MULTI-RESOLUTION SKIP CONNECTIONS FOR SKIN LESION SEGMENTATION
    Shahin, Ahmed H.
    Amer, Karim
    Elattar, Mustafa A.
    2019 IEEE 16TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2019), 2019, : 451 - 454
  • [10] Deep convolutional encoder-decoders with aggregated multi-resolution skip connections for skin lesion segmentation
    Medical Imaging and Image Processing Group, Center for Informatics Sciences, Nile University, Egypt
    IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recogn., 1945, (451-454): : 451 - 454