seUNet-Trans: A Simple Yet Effective UNet-Transformer Model for Medical Image Segmentation

被引:4
|
作者
Pham, Tan-Hanh [1 ]
Li, Xianqi [2 ]
Nguyen, Kim-Doang [1 ]
机构
[1] Florida Inst Technol, Dept Mech & Aerosp Engn, Melbourne, FL 32901 USA
[2] Florida Inst Technol, Dept Math & Syst Engn, Melbourne, FL 32901 USA
来源
IEEE ACCESS | 2024年 / 12卷
基金
美国农业部;
关键词
Transformers; Image segmentation; Medical diagnostic imaging; Decoding; Computer architecture; Colonoscopy; Biomedical imaging; Deep learning; Polyps; colonoscopy; medical image analysis; deep learning; vision transformers; ATTENTION;
D O I
10.1109/ACCESS.2024.3451304
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Medical image segmentation plays a crucial role in modern clinical practice, enabling accurate diagnosis and personalized treatment plans. Advancements in machine learning, particularly deep learning techniques, have significantly driven this progress. While Convolutional Neural Networks (CNNs) dominate the field, transformer-based models are emerging as powerful alternatives for computer vision tasks. However, most existing CNN-Transformer models underutilize the full potential of Transformers, often relegating them to assistant modules. To address this issue, we propose a novel and efficient UNet-Transformer (seUNet-Trans) model for medical image segmentation. The seUNet-Trans framework leverages a UNet architecture for feature extraction, generating rich representations from input images. These features are then passed through a bridge layer that connects the UNet to a transformer module. To improve efficiency, we employ a novel pixel-wise embedding method that eliminates the need for position embedding vectors. We utilize spatially reduced attention within the transformer to reduce computational complexity. By combining the strengths of UNet's localization capabilities and the transformer's ability to capture long-range dependencies, seUNet-Trans effectively captures both local and global information within medical images. This holistic understanding enables the model to achieve superior segmentation performance. The efficacy of our model is demonstrated through extensive experimentation on seven medical image segmentation datasets. The seUNet-Trans model outperforms several state-of-the-art segmentation models, achieving impressive mean Dice Coefficient (mDC) and mean Intersection over Union (mIoU) scores. On the CVC-ClinicDB dataset, it achieves scores of 0.945 and 0.895, respectively; on the GlaS dataset, it scores 0.899 and 0.823, respectively; on the ISIC 2018 dataset, it achieves 0.922 and 0.854, respectively; and on the Data Science Bowl dataset, it scores 0.928 and 0.867, respectively. The code is available on seUnet-Trans.
引用
收藏
页码:122139 / 122154
页数:16
相关论文
共 50 条
  • [21] RT-Unet: An advanced network based on residual network and transformer for medical image segmentation
    Li, Bo
    Liu, Sikai
    Wu, Fei
    Li, GuangHui
    Zhong, Meiling
    Guan, Xiaohui
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (11) : 8565 - 8582
  • [22] AFC-Unet: Attention-fused full-scale CNN-transformer unet for medical image segmentation
    Meng, Wenjie
    Liu, Shujun
    Wang, Huajun
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 99
  • [23] A combined deformable model and medical transformer algorithm for medical image segmentation
    Tang, Zhixian
    Duan, Jintao
    Sun, Yanming
    Zeng, Yanan
    Zhang, Yile
    Yao, Xufeng
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2023, 61 (01) : 129 - 137
  • [24] A combined deformable model and medical transformer algorithm for medical image segmentation
    Zhixian Tang
    Jintao Duan
    Yanming Sun
    Yanan Zeng
    Yile Zhang
    Xufeng Yao
    Medical & Biological Engineering & Computing, 2023, 61 : 129 - 137
  • [25] H2MaT-Unet:Hierarchical hybrid multi-axis transformer based Unet for medical image segmentation
    Ju Z.
    Zhou Z.
    Qi Z.
    Yi C.
    Computers in Biology and Medicine, 2024, 174
  • [26] MISSFormer: An Effective Transformer for 2D Medical Image Segmentation
    Huang, Xiaohong
    Deng, Zhifang
    Li, Dandan
    Yuan, Xueguang
    Fu, Ying
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (05) : 1484 - 1494
  • [27] UCSwin-UNet model for medical image segmentation based on cardiac haemangioma
    Shi, Jian-Ting
    Qu, Gui-Xu
    Li, Zhi-Jun
    IET IMAGE PROCESSING, 2024, 18 (12) : 3302 - 3315
  • [28] ResTrans-Unet: A Residual-Aware Transformer-Based Approach to Medical Image Segmentation
    Ma, Fengying
    Wang, Zhi
    Ji, Peng
    Fu, Chengcai
    Wang, Feng
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (04)
  • [29] Parallel Transformer-CNN Model for Medical Image Segmentation
    Zhou, Mingkun
    Nie, Xueyun
    Liu, Yuhang
    Li, Doudou
    2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024, 2024, : 1048 - 1051
  • [30] A Simple Generic Method for Effective Boundary Extraction in Medical Image Segmentation
    Kim, Minki
    Lee, Byoung-Dai
    IEEE ACCESS, 2021, 9 : 103875 - 103884