UNestFormer: Enhancing Decoders and Skip Connections With Nested Transformers for Medical Image Segmentation

被引:0
|
作者
Tayeb, Adnan Md [1 ]
Kim, Tae-Hyong [1 ]
机构
[1] Kumoh Natl Inst Technol, Gumi Si 39177, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Transformers; Image segmentation; Decoding; Computational modeling; Semantics; Accuracy; Feature extraction; Computer architecture; Medical diagnostic imaging; Standards; Medical image segmentation; U-Net; CNN-transformer hybrid framework; omni-attention; nested skip connections;
D O I
10.1109/ACCESS.2024.3516079
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Precise identification of organs and lesions in medical images is essential for accurate disease diagnosis and analysis of organ structures. Deep convolutional neural network (CNN)-based U-shaped networks are among the most popular and promising approaches for this task. Recently, full-Transformer or hybrid CNN-Transformer structures have gained traction in medical image segmentation due to their effectiveness. However, current approaches face at least one of three limitations: 1) CNN-based models struggle to capture long-range dependencies, 2) Transformer-based models have limitations in extracting local features, resulting in a loss of low-level details, and 3) hybrid models are computationally complex. In this paper, we propose UNestFormer, a novel CNN-Transformer hybrid framework where densely connected nested transformers link the encoder and decoder to achieve more precise segmentation results by reducing the semantic gap between them. The traditional transformer, which uses the multi-head self-attention (MHSA), primarily addresses global attention but misses other forms of attention. In contrast, we designed an omni-attention, which incorporates four forms of attention: local, global, channel, and spatial, and introduced the omni-attention transformer block (OmniBlock). UNestFormer is designed to be comparatively lightweight yet robust and accurate. We argue that nested transformers with the proposed OmniBlock serve as strong decoders with efficient skip connections for medical image segmentation, enhancing feature aggregation and minimizing information loss. Extensive experiments validate UNestFormer's superiority across benchmark medical datasets. It outperforms its closest competitors by 2.1% on Synapse and 1.29% on ISIC 2018 in terms of the Dice similarity coefficient (DSC), and achieves a 1.45% improvement in the Hausdorff distance (HD) on Synapse, all while maintaining lower computational costs.
引用
收藏
页码:190996 / 191009
页数:14
相关论文
共 50 条
  • [41] Long-Range Decoder Skip Connections: Exploiting Multi-Context Information for Cardiac Image Segmentation
    Gutierrez-Castilla, Nicolas
    Torres, Ricardo da S.
    Falcao, Alexandre X.
    Kozerke, Sebastian
    Schwitter, Jurg
    Masci, Pier-Giorgio
    Montoya-Zegarra, Javier A.
    2019 32ND SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI), 2019, : 60 - 67
  • [42] EU-Net: Efficient Dense Skip-Connected Autoencoder for Medical Image Segmentation
    Liu, Lizhuang
    Qiu, Jiacun
    Wang, Ke
    Zhan, Qiao
    Jiang, Jiaxi
    Han, Zhenqi
    Qiu, Jianxin
    Wu, Tian
    Xu, Jinghang
    Zeng, Zheng
    IEEE ACCESS, 2023, 11 : 135959 - 135967
  • [43] Nested transformer decoder using dense skip-connections for change detection in high-resolution remote sensing image
    Meng, Xiangjun
    Song, Yonghong
    Li, Guofu
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2024, 45 (22) : 8061 - 8083
  • [44] Joint Modeling of Image and Label Statistics for Enhancing Model Generalizability of Medical Image Segmentation
    Gao, Shangqi
    Zhou, Hangqi
    Gao, Yibo
    Zhuang, Xiahai
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT V, 2022, 13435 : 360 - 369
  • [45] HiFormer: Hierarchical Multi-scale Representations Using Transformers for Medical Image Segmentation
    Heidari, Moein
    Kazerouni, Amirhossein
    Soltany, Milad
    Azad, Reza
    Aghdam, Ehsan Khodapanah
    Cohen-Adad, Julien
    Merhof, Dorit
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 6191 - 6201
  • [46] Hybrid Ladder Transformers with Efficient Parallel-Cross Attention for Medical Image Segmentation
    Luo Haozhe
    Yu Changdong
    Selvan, Raghavendra
    INTERNATIONAL CONFERENCE ON MEDICAL IMAGING WITH DEEP LEARNING, VOL 172, 2022, 172 : 808 - 819
  • [47] A Novel Adaptive Hypergraph Neural Network for Enhancing Medical Image Segmentation
    Chai, Shurong
    Jain, Rahul K.
    Mo, Shaocong
    Liu, Jiaqing
    Yang, Yulin
    Li, Yinhao
    Tateyama, Tomoko
    Lin, Lanfen
    Chen, Yen-Wei
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT IX, 2024, 15009 : 23 - 33
  • [48] Enhancing Cross-Modal Medical Image Segmentation Through Compositionality
    Eijpe, Aniek
    Corbetta, Valentina
    Chupetlovska, Kalina
    Beets-Tan, Regina
    Silva, Wilson
    DEEP GENERATIVE MODELS, DGM4MICCAI 2024, 2025, 15224 : 43 - 53
  • [49] UNet plus plus : A Nested U-Net Architecture for Medical Image Segmentation
    Zhou, Zongwei
    Siddiquee, Md Mahfuzur Rahman
    Tajbakhsh, Nima
    Liang, Jianming
    DEEP LEARNING IN MEDICAL IMAGE ANALYSIS AND MULTIMODAL LEARNING FOR CLINICAL DECISION SUPPORT, DLMIA 2018, 2018, 11045 : 3 - 11
  • [50] ConvFormer: Plug-and-Play CNN-Style Transformers for Improving Medical Image Segmentation
    Lin, Xian
    Yan, Zengqiang
    Deng, Xianbo
    Zheng, Chuansheng
    Yu, Li
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 642 - 651