seUNet-Trans: A Simple Yet Effective UNet-Transformer Model for Medical Image Segmentation

被引:4
|
作者
Pham, Tan-Hanh [1 ]
Li, Xianqi [2 ]
Nguyen, Kim-Doang [1 ]
机构
[1] Florida Inst Technol, Dept Mech & Aerosp Engn, Melbourne, FL 32901 USA
[2] Florida Inst Technol, Dept Math & Syst Engn, Melbourne, FL 32901 USA
来源
IEEE ACCESS | 2024年 / 12卷
基金
美国农业部;
关键词
Transformers; Image segmentation; Medical diagnostic imaging; Decoding; Computer architecture; Colonoscopy; Biomedical imaging; Deep learning; Polyps; colonoscopy; medical image analysis; deep learning; vision transformers; ATTENTION;
D O I
10.1109/ACCESS.2024.3451304
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Medical image segmentation plays a crucial role in modern clinical practice, enabling accurate diagnosis and personalized treatment plans. Advancements in machine learning, particularly deep learning techniques, have significantly driven this progress. While Convolutional Neural Networks (CNNs) dominate the field, transformer-based models are emerging as powerful alternatives for computer vision tasks. However, most existing CNN-Transformer models underutilize the full potential of Transformers, often relegating them to assistant modules. To address this issue, we propose a novel and efficient UNet-Transformer (seUNet-Trans) model for medical image segmentation. The seUNet-Trans framework leverages a UNet architecture for feature extraction, generating rich representations from input images. These features are then passed through a bridge layer that connects the UNet to a transformer module. To improve efficiency, we employ a novel pixel-wise embedding method that eliminates the need for position embedding vectors. We utilize spatially reduced attention within the transformer to reduce computational complexity. By combining the strengths of UNet's localization capabilities and the transformer's ability to capture long-range dependencies, seUNet-Trans effectively captures both local and global information within medical images. This holistic understanding enables the model to achieve superior segmentation performance. The efficacy of our model is demonstrated through extensive experimentation on seven medical image segmentation datasets. The seUNet-Trans model outperforms several state-of-the-art segmentation models, achieving impressive mean Dice Coefficient (mDC) and mean Intersection over Union (mIoU) scores. On the CVC-ClinicDB dataset, it achieves scores of 0.945 and 0.895, respectively; on the GlaS dataset, it scores 0.899 and 0.823, respectively; on the ISIC 2018 dataset, it achieves 0.922 and 0.854, respectively; and on the Data Science Bowl dataset, it scores 0.928 and 0.867, respectively. The code is available on seUnet-Trans.
引用
收藏
页码:122139 / 122154
页数:16
相关论文
共 50 条
  • [41] Swin Unet3D: a three-dimensional medical image segmentation network combining vision transformer and convolution
    Yimin Cai
    Yuqing Long
    Zhenggong Han
    Mingkun Liu
    Yuchen Zheng
    Wei Yang
    Liming Chen
    BMC Medical Informatics and Decision Making, 23
  • [42] Dilated-UNet: A Fast and Accurate Medical Image Segmentation Approach using a Dilated Transformer and U-Net Architecture
    Saadati, Davoud
    Manzari, Omid Nejati
    Mirzakuchaki, Sattar
    arXiv, 2023,
  • [43] STM-UNet: An Efficient U-shaped Architecture Based on Swin Transformer and Multiscale MLP for Medical Image Segmentation
    Shi, Lei
    Gao, Tianyu
    Zhang, Zheng
    Zhang, Junxing
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 2003 - 2008
  • [44] MSCT-UNET: multi-scale contrastive transformer within U-shaped network for medical image segmentation
    Xi, Heran
    Dong, Haoji
    Sheng, Yue
    Cui, Hui
    Huang, Chengying
    Li, Jinbao
    Zhu, Jinghua
    PHYSICS IN MEDICINE AND BIOLOGY, 2024, 69 (01):
  • [45] Multi-Scale Orthogonal Model CNN-Transformer for Medical Image Segmentation
    Zhou, Wuyi
    Zeng, Xianhua
    Zhou, Mingkun
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (10)
  • [46] A Novel Deep Learning Model for Medical Image Segmentation with Convolutional Neural Network and Transformer
    Zhang, Zhuo
    Wu, Hongbing
    Zhao, Huan
    Shi, Yicheng
    Wang, Jifang
    Bai, Hua
    Sun, Baoshan
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2023, 15 (04) : 663 - 677
  • [47] A Novel Deep Learning Model for Medical Image Segmentation with Convolutional Neural Network and Transformer
    Zhuo Zhang
    Hongbing Wu
    Huan Zhao
    Yicheng Shi
    Jifang Wang
    Hua Bai
    Baoshan Sun
    Interdisciplinary Sciences: Computational Life Sciences, 2023, 15 : 663 - 677
  • [48] TransDiff: medical image segmentation method based on Swin Transformer with diffusion probabilistic model
    Liu, Xiaoxiao
    Zhao, Yan
    Wang, Shigang
    Wei, Jian
    APPLIED INTELLIGENCE, 2024, 54 (08) : 6543 - 6557
  • [49] ST-Unet: Swin Transformer boosted U-Net with Cross-Layer Feature Enhancement for medical image segmentation
    Zhang, Jing
    Qin, Qiuge
    Ye, Qi
    Ruan, Tong
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 153
  • [50] SIB-UNet: A dual encoder medical image segmentation model with selective fusion and information bottleneck fusion
    Li, Guangju
    Qi, Meng
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 252