UNestFormer: Enhancing Decoders and Skip Connections With Nested Transformers for Medical Image Segmentation

被引：0

作者：

Tayeb, Adnan Md ^{[1
]}

Kim, Tae-Hyong ^{[1
]}

机构：

[1] Kumoh Natl Inst Technol, Gumi Si 39177, South Korea

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Transformers; Image segmentation; Decoding; Computational modeling; Semantics; Accuracy; Feature extraction; Computer architecture; Medical diagnostic imaging; Standards; Medical image segmentation; U-Net; CNN-transformer hybrid framework; omni-attention; nested skip connections;

D O I：

10.1109/ACCESS.2024.3516079

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Precise identification of organs and lesions in medical images is essential for accurate disease diagnosis and analysis of organ structures. Deep convolutional neural network (CNN)-based U-shaped networks are among the most popular and promising approaches for this task. Recently, full-Transformer or hybrid CNN-Transformer structures have gained traction in medical image segmentation due to their effectiveness. However, current approaches face at least one of three limitations: 1) CNN-based models struggle to capture long-range dependencies, 2) Transformer-based models have limitations in extracting local features, resulting in a loss of low-level details, and 3) hybrid models are computationally complex. In this paper, we propose UNestFormer, a novel CNN-Transformer hybrid framework where densely connected nested transformers link the encoder and decoder to achieve more precise segmentation results by reducing the semantic gap between them. The traditional transformer, which uses the multi-head self-attention (MHSA), primarily addresses global attention but misses other forms of attention. In contrast, we designed an omni-attention, which incorporates four forms of attention: local, global, channel, and spatial, and introduced the omni-attention transformer block (OmniBlock). UNestFormer is designed to be comparatively lightweight yet robust and accurate. We argue that nested transformers with the proposed OmniBlock serve as strong decoders with efficient skip connections for medical image segmentation, enhancing feature aggregation and minimizing information loss. Extensive experiments validate UNestFormer's superiority across benchmark medical datasets. It outperforms its closest competitors by 2.1% on Synapse and 1.29% on ISIC 2018 in terms of the Dice similarity coefficient (DSC), and achieves a 1.45% improvement in the Hausdorff distance (HD) on Synapse, all while maintaining lower computational costs.

引用

页码：190996 / 191009

页数：14

共 50 条

[21] Medical Image Segmentation Using Squeeze-and-Expansion Transformers
Li, Shaohua
Sui, Xiuchao
Luo, Xiangde
Xu, Xinxing
Liu, Yong
Goh, Rick
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 807 - 815
[22] Class-Aware Adversarial Transformers for Medical Image Segmentation
You, Chenyu
Zhao, Ruihan
Liu, Fenglin
Dong, Siyuan
Chinchali, Sandeep
Topcu, Ufuk
Staib, Lawrence
Duncan, James S.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[23] UNETR: Transformers for 3D Medical Image Segmentation
Hatamizadeh, Ali
Tang, Yucheng
Nath, Vishwesh
Yang, Dong
Myronenko, Andriy
Landman, Bennett
Roth, Holger R.
Xu, Daguang
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 1748 - 1758
[24] CTBANet: Convolution transformers and bidirectional attention for medical image segmentation
Luo, Sha
Pan, Li
Jian, Yuanming
Lu, Yunjiao
Luo, Sisi
ALEXANDRIA ENGINEERING JOURNAL, 2024, 88 : 133 - 143
[25] BFNet: a full-encoder skip connect way for medical image segmentation
Zhan, Siyu
Yuan, Quan
Lei, Xin
Huang, Rui
Guo, Lu
Liu, Ke
Chen, Rong
FRONTIERS IN PHYSIOLOGY, 2024, 15
[26] TransUNet plus : Redesigning the skip connection to enhance features in medical image segmentation
Liu, Yuhang
Wang, Han
Chen, Zugang
Huangliang, Kehan
Zhang, Haixian
KNOWLEDGE-BASED SYSTEMS, 2022, 256
[27] Pact-Net: Parallel CNNs and Transformers for medical image segmentation
Chen, Weilin
Zhang, Rui
Zhang, Yunfeng
Bao, Fangxun
Lv, Haixia
Li, Longhao
Zhang, Caiming
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 242
[28] Patcher: Patch Transformers with Mixture of Experts for Precise Medical Image Segmentation
Ou, Yanglan
Yuan, Ye
Huang, Xiaolei
Wong, Stephen T. C.
Volpi, John
Wang, James Z.
Wong, Kelvin
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT V, 2022, 13435 : 475 - 484
[29] 3D Medical image segmentation using parallel transformers
Yan, Qingsen
Liu, Shengqiang
Xu, Songhua
Dong, Caixia
Li, Zongfang
Shi, Javen Qinfeng
Zhang, Yanning
Dai, Duwei
PATTERN RECOGNITION, 2023, 138
[30] Medical Image Segmentation Use Convolutional Attention Augmentation TransUNet with Skip Connection Enhancement
Zou, Qingzhi
Zhao, Jing
Li, Ming
Chen, Ling
Yuan, Lin
Zhang, Ronghuan
Hu, Yushuai
PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 1710 - 1715

← 1 2 3 4 5 →