UNestFormer: Enhancing Decoders and Skip Connections With Nested Transformers for Medical Image Segmentation

被引:0
|
作者
Tayeb, Adnan Md [1 ]
Kim, Tae-Hyong [1 ]
机构
[1] Kumoh Natl Inst Technol, Gumi Si 39177, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Transformers; Image segmentation; Decoding; Computational modeling; Semantics; Accuracy; Feature extraction; Computer architecture; Medical diagnostic imaging; Standards; Medical image segmentation; U-Net; CNN-transformer hybrid framework; omni-attention; nested skip connections;
D O I
10.1109/ACCESS.2024.3516079
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Precise identification of organs and lesions in medical images is essential for accurate disease diagnosis and analysis of organ structures. Deep convolutional neural network (CNN)-based U-shaped networks are among the most popular and promising approaches for this task. Recently, full-Transformer or hybrid CNN-Transformer structures have gained traction in medical image segmentation due to their effectiveness. However, current approaches face at least one of three limitations: 1) CNN-based models struggle to capture long-range dependencies, 2) Transformer-based models have limitations in extracting local features, resulting in a loss of low-level details, and 3) hybrid models are computationally complex. In this paper, we propose UNestFormer, a novel CNN-Transformer hybrid framework where densely connected nested transformers link the encoder and decoder to achieve more precise segmentation results by reducing the semantic gap between them. The traditional transformer, which uses the multi-head self-attention (MHSA), primarily addresses global attention but misses other forms of attention. In contrast, we designed an omni-attention, which incorporates four forms of attention: local, global, channel, and spatial, and introduced the omni-attention transformer block (OmniBlock). UNestFormer is designed to be comparatively lightweight yet robust and accurate. We argue that nested transformers with the proposed OmniBlock serve as strong decoders with efficient skip connections for medical image segmentation, enhancing feature aggregation and minimizing information loss. Extensive experiments validate UNestFormer's superiority across benchmark medical datasets. It outperforms its closest competitors by 2.1% on Synapse and 1.29% on ISIC 2018 in terms of the Dice similarity coefficient (DSC), and achieves a 1.45% improvement in the Hausdorff distance (HD) on Synapse, all while maintaining lower computational costs.
引用
收藏
页码:190996 / 191009
页数:14
相关论文
共 50 条
  • [31] Multi-modal medical Transformers: A meta-analysis for medical image segmentation in oncology
    Andrade-Miranda, Gustavo
    Jaouen, Vincent
    Tankyevych, Olena
    Le Rest, Catherine Cheze
    Visvikis, Dimitris
    Conze, Pierre-Henri
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2023, 110
  • [32] LL-UNet++:UNet++ Based Nested Skip Connections Network for Low-Light Image Enhancement
    Shi, Pengfei
    Xu, Xiwang
    Fan, Xinnan
    Yang, Xudong
    Xin, Yuanxue
    IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2024, 10 : 510 - 521
  • [33] SGBTransNet: Bridging the semantic gap in medical image segmentation models using Transformers
    Zhou, Yunlai
    Wang, Bing
    Yang, Jie
    Yang, Ying
    Tian, Xuedong
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 98
  • [34] INSTRAS: INfrared Spectroscopic imaging-based TRAnsformers for medical image Segmentation
    Lin, Hangzheng
    Falahkheirkhah, Kianoush
    Kindratenko, Volodymyr
    Bhargava, Rohit
    MACHINE LEARNING WITH APPLICATIONS, 2024, 16
  • [35] TC-Fuse: A Transformers Fusing CNNs Network for Medical Image Segmentation
    Geng, Peng
    Lu, Ji
    Zhang, Ying
    Ma, Simin
    Tang, Zhanzhong
    Liu, Jianhua
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2023, 137 (02): : 2001 - 2023
  • [36] TransFusion: Multi-view Divergent Fusion for Medical Image Segmentation with Transformers
    Liu, Di
    Gao, Yunhe
    Zhangli, Qilong
    Han, Ligong
    He, Xiaoxiao
    Xia, Zhaoyang
    Wen, Song
    Chang, Qi
    Yan, Zhennan
    Zhou, Mu
    Metaxas, Dimitris
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT V, 2022, 13435 : 485 - 495
  • [37] TransLevelSet: Integrating vision transformers with level-sets for medical image segmentation
    Koutsiou, Dimitra-Christina C.
    Savelonas, Michalis A.
    Iakovidis, Dimitris K.
    NEUROCOMPUTING, 2024, 599
  • [38] The Lighter the Better: Rethinking Transformers in Medical Image Segmentation Through Adaptive Pruning
    Lin, Xian
    Yu, Li
    Cheng, Kwang-Ting
    Yan, Zengqiang
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (08) : 2325 - 2337
  • [39] Enhancing the ability of convolutional neural networks for remote sensing image segmentation using transformers
    Barr M.
    Neural Computing and Applications, 2024, 36 (22) : 13605 - 13616
  • [40] Nested Dilation Network (NDN) for Multi-Task Medical Image Segmentation
    Wang, Liansheng
    Chen, Rongzhen
    Wang, Shuxin
    Zeng, Nianyin
    Huang, Xiaoyang
    Liu, Changhua
    IEEE ACCESS, 2019, 7 : 44676 - 44685