UNestFormer: Enhancing Decoders and Skip Connections With Nested Transformers for Medical Image Segmentation

被引:0
|
作者
Tayeb, Adnan Md [1 ]
Kim, Tae-Hyong [1 ]
机构
[1] Kumoh Natl Inst Technol, Gumi Si 39177, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Transformers; Image segmentation; Decoding; Computational modeling; Semantics; Accuracy; Feature extraction; Computer architecture; Medical diagnostic imaging; Standards; Medical image segmentation; U-Net; CNN-transformer hybrid framework; omni-attention; nested skip connections;
D O I
10.1109/ACCESS.2024.3516079
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Precise identification of organs and lesions in medical images is essential for accurate disease diagnosis and analysis of organ structures. Deep convolutional neural network (CNN)-based U-shaped networks are among the most popular and promising approaches for this task. Recently, full-Transformer or hybrid CNN-Transformer structures have gained traction in medical image segmentation due to their effectiveness. However, current approaches face at least one of three limitations: 1) CNN-based models struggle to capture long-range dependencies, 2) Transformer-based models have limitations in extracting local features, resulting in a loss of low-level details, and 3) hybrid models are computationally complex. In this paper, we propose UNestFormer, a novel CNN-Transformer hybrid framework where densely connected nested transformers link the encoder and decoder to achieve more precise segmentation results by reducing the semantic gap between them. The traditional transformer, which uses the multi-head self-attention (MHSA), primarily addresses global attention but misses other forms of attention. In contrast, we designed an omni-attention, which incorporates four forms of attention: local, global, channel, and spatial, and introduced the omni-attention transformer block (OmniBlock). UNestFormer is designed to be comparatively lightweight yet robust and accurate. We argue that nested transformers with the proposed OmniBlock serve as strong decoders with efficient skip connections for medical image segmentation, enhancing feature aggregation and minimizing information loss. Extensive experiments validate UNestFormer's superiority across benchmark medical datasets. It outperforms its closest competitors by 2.1% on Synapse and 1.29% on ISIC 2018 in terms of the Dice similarity coefficient (DSC), and achieves a 1.45% improvement in the Hausdorff distance (HD) on Synapse, all while maintaining lower computational costs.
引用
收藏
页码:190996 / 191009
页数:14
相关论文
共 50 条
  • [21] Medical Image Segmentation Using Squeeze-and-Expansion Transformers
    Li, Shaohua
    Sui, Xiuchao
    Luo, Xiangde
    Xu, Xinxing
    Liu, Yong
    Goh, Rick
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 807 - 815
  • [22] Class-Aware Adversarial Transformers for Medical Image Segmentation
    You, Chenyu
    Zhao, Ruihan
    Liu, Fenglin
    Dong, Siyuan
    Chinchali, Sandeep
    Topcu, Ufuk
    Staib, Lawrence
    Duncan, James S.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [23] UNETR: Transformers for 3D Medical Image Segmentation
    Hatamizadeh, Ali
    Tang, Yucheng
    Nath, Vishwesh
    Yang, Dong
    Myronenko, Andriy
    Landman, Bennett
    Roth, Holger R.
    Xu, Daguang
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 1748 - 1758
  • [24] CTBANet: Convolution transformers and bidirectional attention for medical image segmentation
    Luo, Sha
    Pan, Li
    Jian, Yuanming
    Lu, Yunjiao
    Luo, Sisi
    ALEXANDRIA ENGINEERING JOURNAL, 2024, 88 : 133 - 143
  • [25] BFNet: a full-encoder skip connect way for medical image segmentation
    Zhan, Siyu
    Yuan, Quan
    Lei, Xin
    Huang, Rui
    Guo, Lu
    Liu, Ke
    Chen, Rong
    FRONTIERS IN PHYSIOLOGY, 2024, 15
  • [26] TransUNet plus : Redesigning the skip connection to enhance features in medical image segmentation
    Liu, Yuhang
    Wang, Han
    Chen, Zugang
    Huangliang, Kehan
    Zhang, Haixian
    KNOWLEDGE-BASED SYSTEMS, 2022, 256
  • [27] Pact-Net: Parallel CNNs and Transformers for medical image segmentation
    Chen, Weilin
    Zhang, Rui
    Zhang, Yunfeng
    Bao, Fangxun
    Lv, Haixia
    Li, Longhao
    Zhang, Caiming
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 242
  • [28] Patcher: Patch Transformers with Mixture of Experts for Precise Medical Image Segmentation
    Ou, Yanglan
    Yuan, Ye
    Huang, Xiaolei
    Wong, Stephen T. C.
    Volpi, John
    Wang, James Z.
    Wong, Kelvin
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT V, 2022, 13435 : 475 - 484
  • [29] 3D Medical image segmentation using parallel transformers
    Yan, Qingsen
    Liu, Shengqiang
    Xu, Songhua
    Dong, Caixia
    Li, Zongfang
    Shi, Javen Qinfeng
    Zhang, Yanning
    Dai, Duwei
    PATTERN RECOGNITION, 2023, 138
  • [30] Medical Image Segmentation Use Convolutional Attention Augmentation TransUNet with Skip Connection Enhancement
    Zou, Qingzhi
    Zhao, Jing
    Li, Ming
    Chen, Ling
    Yuan, Lin
    Zhang, Ronghuan
    Hu, Yushuai
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 1710 - 1715