seUNet-Trans: A Simple Yet Effective UNet-Transformer Model for Medical Image Segmentation

被引:4
|
作者
Pham, Tan-Hanh [1 ]
Li, Xianqi [2 ]
Nguyen, Kim-Doang [1 ]
机构
[1] Florida Inst Technol, Dept Mech & Aerosp Engn, Melbourne, FL 32901 USA
[2] Florida Inst Technol, Dept Math & Syst Engn, Melbourne, FL 32901 USA
来源
IEEE ACCESS | 2024年 / 12卷
基金
美国农业部;
关键词
Transformers; Image segmentation; Medical diagnostic imaging; Decoding; Computer architecture; Colonoscopy; Biomedical imaging; Deep learning; Polyps; colonoscopy; medical image analysis; deep learning; vision transformers; ATTENTION;
D O I
10.1109/ACCESS.2024.3451304
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Medical image segmentation plays a crucial role in modern clinical practice, enabling accurate diagnosis and personalized treatment plans. Advancements in machine learning, particularly deep learning techniques, have significantly driven this progress. While Convolutional Neural Networks (CNNs) dominate the field, transformer-based models are emerging as powerful alternatives for computer vision tasks. However, most existing CNN-Transformer models underutilize the full potential of Transformers, often relegating them to assistant modules. To address this issue, we propose a novel and efficient UNet-Transformer (seUNet-Trans) model for medical image segmentation. The seUNet-Trans framework leverages a UNet architecture for feature extraction, generating rich representations from input images. These features are then passed through a bridge layer that connects the UNet to a transformer module. To improve efficiency, we employ a novel pixel-wise embedding method that eliminates the need for position embedding vectors. We utilize spatially reduced attention within the transformer to reduce computational complexity. By combining the strengths of UNet's localization capabilities and the transformer's ability to capture long-range dependencies, seUNet-Trans effectively captures both local and global information within medical images. This holistic understanding enables the model to achieve superior segmentation performance. The efficacy of our model is demonstrated through extensive experimentation on seven medical image segmentation datasets. The seUNet-Trans model outperforms several state-of-the-art segmentation models, achieving impressive mean Dice Coefficient (mDC) and mean Intersection over Union (mIoU) scores. On the CVC-ClinicDB dataset, it achieves scores of 0.945 and 0.895, respectively; on the GlaS dataset, it scores 0.899 and 0.823, respectively; on the ISIC 2018 dataset, it achieves 0.922 and 0.854, respectively; and on the Data Science Bowl dataset, it scores 0.928 and 0.867, respectively. The code is available on seUnet-Trans.
引用
收藏
页码:122139 / 122154
页数:16
相关论文
共 50 条
  • [1] A novel full-convolution UNet-transformer for medical image segmentation
    Zhu, Tianyou
    Ding, Derui
    Wang, Feng
    Liang, Wei
    Wang, Bo
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89
  • [2] LIT-Unet: a lightweight and effective model for medical image segmentation
    Wang, Ru
    Kou, Qiqi
    Dou, Lina
    RADIOLOGICAL PHYSICS AND TECHNOLOGY, 2024, 17 (04) : 878 - 887
  • [3] AFTer-UNet: Axial Fusion Transformer UNet for Medical Image Segmentation
    Yan, Xiangyi
    Tang, Hao
    Sun, Shanlin
    Ma, Haoyu
    Kong, Deying
    Xie, Xiaohui
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 3270 - 3280
  • [4] SMESwin Unet: Merging CNN and Transformer for Medical Image Segmentation
    Wang, Ziheng
    Min, Xiongkuo
    Shi, Fangyu
    Jin, Ruinian
    Nawrin, Saida S.
    Yu, Ichen
    Nagatomi, Ryoichi
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT V, 2022, 13435 : 517 - 526
  • [5] TransCUNet: UNet cross fused transformer for medical image segmentation
    Jiang, Shen
    Li, Jinjiang
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 150
  • [6] CSWin-UNet: Transformer UNet with cross-shaped windows for medical image segmentation
    Liu, Xiao
    Gao, Peng
    Yu, Tao
    Wang, Fei
    Yuan, Ru-Yue
    INFORMATION FUSION, 2025, 113
  • [7] LIGHTSEG: EFFICIENT YET EFFECTIVE MEDICAL IMAGE SEGMENTATION
    Jahan, Most Husne
    Imran, Abdullah Al Zubaer
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,
  • [8] DSTUNET: UNET WITH EFFICIENT DENSE SWIN TRANSFORMER PATHWAY FOR MEDICAL IMAGE SEGMENTATION
    Cai, Zhuotong
    Xin, Jingmin
    Shi, Peiwen
    Wu, Jiayi
    Zheng, Nanning
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,
  • [9] LeViT-UNet: Make Faster Encoders with Transformer for Medical Image Segmentation
    Xu, Guoping
    Zhang, Xuan
    He, Xinwei
    Wu, Xinglong
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VIII, 2024, 14432 : 42 - 53
  • [10] MR-Trans: MultiResolution Transformer for medical image segmentation
    Zou, Yibo
    Ge, Yan
    Zhao, Linlin
    Li, Wei
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 165