Enhancing medical image segmentation with a multi-transformer U-Net

被引:3
|
作者
Dan, Yongping [1 ]
Jin, Weishou [1 ]
Yue, Xuebin [2 ]
Wang, Zhida [1 ]
机构
[1] Zhongyuan Univ Technol, Sch Elect & Informat, Zhengzhou, Henan, Peoples R China
[2] Ritsumeikan Univ, Res Org Sci & Technol, Kusatsu, Japan
来源
PEERJ | 2024年 / 12卷
关键词
CT or X-ray lung images; Medical image segmentation; Multi-transformer; Unet;
D O I
10.7717/peerj.17005
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Various segmentation networks based on Swin Transformer have shown promise in medical segmentation tasks. Nonetheless, challenges such as lower accuracy and slower training convergence have persisted. To tackle these issues, we introduce a novel approach that combines the Swin Transformer and Deformable Transformer to enhance overall model performance. We leverage the Swin Transformer's window attention mechanism to capture local feature information and employ the Deformable Transformer to adjust sampling positions dynamically, accelerating model convergence and aligning it more closely with object shapes and sizes. By amalgamating both Transformer modules and incorporating additional skip connections to minimize information loss, our proposed model excels at rapidly and accurately segmenting CT or X-ray lung images. Experimental results demonstrate the remarkable, showcasing the significant prowess of our model. It surpasses the performance of the standalone Swin Transformer's Swin Unet and converges more rapidly under identical conditions, yielding accuracy improvements of 0.7% (resulting in 88.18%) and 2.7% (resulting in 98.01%) on the COVID-19 CT scan lesion segmentation dataset and Chest X-ray Masks and Labels dataset, respectively. This advancement has the potential to aid medical practitioners in early diagnosis and treatment decision-making.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] MUNet: A Multi-scale U-Net Framework for Medical Image Segmentation
    Zhang, Wentao
    Cheng, Hao
    Gan, Jun
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [22] SWTRU: Star-shaped Window Transformer Reinforced U-Net for medical image segmentation
    Zhang, Jianyi
    Liu, Yong
    Wu, Qihang
    Wang, Yongpan
    Liu, Yuhai
    Xu, Xianchong
    Song, Bo
    Computers in Biology and Medicine, 2022, 150
  • [23] Efficient combined algorithm of Transformer and U-Net for 3D medical image segmentation
    Zhang, Mingyan
    Wang, Aixia
    Yang, Gang
    Li, Jingjiao
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4377 - 4382
  • [24] SWTRU: Star-shaped Window Transformer Reinforced U-Net for medical image segmentation
    Zhang, Jianyi
    Liu, Yong
    Wu, Qihang
    Wang, Yongpan
    Liu, Yuhai
    Xu, Xianchong
    Song, Bo
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 150
  • [25] Multiscale transunet plus plus : dense hybrid U-Net with transformer for medical image segmentation
    Wang, Bo
    Wang, Fan
    Dong, Pengwei
    Li, Chongyi
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (06) : 1607 - 1614
  • [26] 3D bi-directional transformer U-Net for medical image segmentation
    Fu, Xiyao
    Sun, Zhexian
    Tang, Haoteng
    Zou, Eric M.
    Huang, Heng
    Wang, Yong
    Zhan, Liang
    FRONTIERS IN BIG DATA, 2023, 5
  • [27] Ultrasound image segmentation based on Transformer and U-Net with joint loss
    Cai, Lina
    Li, Qingkai
    Zhang, Junhua
    Zhang, Zhenghua
    Yang, Rui
    Zhang, Lun
    PEERJ COMPUTER SCIENCE, 2023, 9 : 1 - 18
  • [28] Rethinking the unpretentious U-net for medical ultrasound image segmentation
    Chen, Gongping
    Li, Lei
    Zhang, Jianxun
    Dai, Yu
    PATTERN RECOGNITION, 2023, 142
  • [29] Design of Superpiexl U-Net Network for Medical Image Segmentation
    Wang H.
    Liu H.
    Guo Q.
    Deng K.
    Zhang C.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (06): : 1007 - 1017
  • [30] Medical ultrasound image segmentation using Multi-Residual U-Net architecture
    Shereena V. B.
    Raju G.
    Multimedia Tools and Applications, 2024, 83 (9) : 27067 - 27088