seUNet-Trans: A Simple Yet Effective UNet-Transformer Model for Medical Image Segmentation

被引：4

作者：

Pham, Tan-Hanh ^{[1
]}

Li, Xianqi ^{[2
]}

Nguyen, Kim-Doang ^{[1
]}

机构：

[1] Florida Inst Technol, Dept Mech & Aerosp Engn, Melbourne, FL 32901 USA

[2] Florida Inst Technol, Dept Math & Syst Engn, Melbourne, FL 32901 USA

来源：

IEEE ACCESS | 2024年 / 12卷

基金：

美国农业部;

关键词：

Transformers; Image segmentation; Medical diagnostic imaging; Decoding; Computer architecture; Colonoscopy; Biomedical imaging; Deep learning; Polyps; colonoscopy; medical image analysis; deep learning; vision transformers; ATTENTION;

D O I：

10.1109/ACCESS.2024.3451304

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Medical image segmentation plays a crucial role in modern clinical practice, enabling accurate diagnosis and personalized treatment plans. Advancements in machine learning, particularly deep learning techniques, have significantly driven this progress. While Convolutional Neural Networks (CNNs) dominate the field, transformer-based models are emerging as powerful alternatives for computer vision tasks. However, most existing CNN-Transformer models underutilize the full potential of Transformers, often relegating them to assistant modules. To address this issue, we propose a novel and efficient UNet-Transformer (seUNet-Trans) model for medical image segmentation. The seUNet-Trans framework leverages a UNet architecture for feature extraction, generating rich representations from input images. These features are then passed through a bridge layer that connects the UNet to a transformer module. To improve efficiency, we employ a novel pixel-wise embedding method that eliminates the need for position embedding vectors. We utilize spatially reduced attention within the transformer to reduce computational complexity. By combining the strengths of UNet's localization capabilities and the transformer's ability to capture long-range dependencies, seUNet-Trans effectively captures both local and global information within medical images. This holistic understanding enables the model to achieve superior segmentation performance. The efficacy of our model is demonstrated through extensive experimentation on seven medical image segmentation datasets. The seUNet-Trans model outperforms several state-of-the-art segmentation models, achieving impressive mean Dice Coefficient (mDC) and mean Intersection over Union (mIoU) scores. On the CVC-ClinicDB dataset, it achieves scores of 0.945 and 0.895, respectively; on the GlaS dataset, it scores 0.899 and 0.823, respectively; on the ISIC 2018 dataset, it achieves 0.922 and 0.854, respectively; and on the Data Science Bowl dataset, it scores 0.928 and 0.867, respectively. The code is available on seUnet-Trans.

引用

页码：122139 / 122154

页数：16

共 50 条

[41] Swin Unet3D: a three-dimensional medical image segmentation network combining vision transformer and convolution
Yimin Cai
Yuqing Long
Zhenggong Han
Mingkun Liu
Yuchen Zheng
Wei Yang
Liming Chen
BMC Medical Informatics and Decision Making, 23
[42] Dilated-UNet: A Fast and Accurate Medical Image Segmentation Approach using a Dilated Transformer and U-Net Architecture
Saadati, Davoud
Manzari, Omid Nejati
Mirzakuchaki, Sattar
arXiv, 2023,
[43] STM-UNet: An Efficient U-shaped Architecture Based on Swin Transformer and Multiscale MLP for Medical Image Segmentation
Shi, Lei
Gao, Tianyu
Zhang, Zheng
Zhang, Junxing
IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 2003 - 2008
[44] MSCT-UNET: multi-scale contrastive transformer within U-shaped network for medical image segmentation
Xi, Heran
Dong, Haoji
Sheng, Yue
Cui, Hui
Huang, Chengying
Li, Jinbao
Zhu, Jinghua
PHYSICS IN MEDICINE AND BIOLOGY, 2024, 69 (01):
[45] Multi-Scale Orthogonal Model CNN-Transformer for Medical Image Segmentation
Zhou, Wuyi
Zeng, Xianhua
Zhou, Mingkun
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (10)
[46] A Novel Deep Learning Model for Medical Image Segmentation with Convolutional Neural Network and Transformer
Zhang, Zhuo
Wu, Hongbing
Zhao, Huan
Shi, Yicheng
Wang, Jifang
Bai, Hua
Sun, Baoshan
INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2023, 15 (04) : 663 - 677
[47] A Novel Deep Learning Model for Medical Image Segmentation with Convolutional Neural Network and Transformer
Zhuo Zhang
Hongbing Wu
Huan Zhao
Yicheng Shi
Jifang Wang
Hua Bai
Baoshan Sun
Interdisciplinary Sciences: Computational Life Sciences, 2023, 15 : 663 - 677
[48] TransDiff: medical image segmentation method based on Swin Transformer with diffusion probabilistic model
Liu, Xiaoxiao
Zhao, Yan
Wang, Shigang
Wei, Jian
APPLIED INTELLIGENCE, 2024, 54 (08) : 6543 - 6557
[49] ST-Unet: Swin Transformer boosted U-Net with Cross-Layer Feature Enhancement for medical image segmentation
Zhang, Jing
Qin, Qiuge
Ye, Qi
Ruan, Tong
COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 153
[50] SIB-UNet: A dual encoder medical image segmentation model with selective fusion and information bottleneck fusion
Li, Guangju
Qi, Meng
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 252

← 1 2 3 4 5 →