CoTrFuse: a novel framework by fusing CNN and transformer for medical image segmentation

被引:12
|
作者
Chen, Yuanbin [1 ,2 ]
Wang, Tao [1 ,2 ]
Tang, Hui [1 ,2 ]
Zhao, Longxuan [1 ,2 ]
Zhang, Xinlin [1 ,2 ]
Tan, Tao [3 ]
Gao, Qinquan [1 ,2 ]
Du, Min [1 ,2 ]
Tong, Tong [1 ,2 ]
机构
[1] Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou 350116, Peoples R China
[2] Fuzhou Univ, Fujian Key Lab Med Instrumentat & Pharmaceut Techn, Fuzhou 350116, Peoples R China
[3] Macao Polytech Univ, Fac Appl Sci, Macau 999078, Peoples R China
来源
PHYSICS IN MEDICINE AND BIOLOGY | 2023年 / 68卷 / 17期
基金
中国国家自然科学基金;
关键词
medical image segmentation; convolutional neural network; transformer; SKIN-LESION SEGMENTATION; NET; NETWORK;
D O I
10.1088/1361-6560/acede8
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Medical image segmentation is a crucial and intricate process in medical image processing and analysis. With the advancements in artificial intelligence, deep learning techniques have been widely used in recent years for medical image segmentation. One such technique is the U-Net framework based on the U-shaped convolutional neural networks (CNN) and its variants. However, these methods have limitations in simultaneously capturing both the global and the remote semantic information due to the restricted receptive domain caused by the convolution operation's intrinsic features. Transformers are attention-based models with excellent global modeling capabilities, but their ability to acquire local information is limited. To address this, we propose a network that combines the strengths of bothCNNand Transformer, called CoTrFuse. The proposed CoTrFuse network uses EfficientNet and Swin Transformer as dual encoders. The Swin Transformer andCNN Fusion module are combined to fuse the features of both branches before the skip connection structure. Weevaluated the proposed network on two datasets: the ISIC-2017 challenge dataset and the COVID-QU-Ex dataset. Our experimental results demonstrate that the proposed CoTrFuse outperforms several state-of-the-art segmentation methods, indicating its superiority in medical image segmentation. The codes are available at https://github.com/BinYCn/CoTrFuse.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation
    Xie, Yutong
    Zhang, Jianpeng
    Shen, Chunhua
    Xia, Yong
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 : 171 - 180
  • [22] A Novel Domain Adaptation Framework for Medical Image Segmentation
    Gholami, Amir
    Subramanian, Shashank
    Shenoy, Varun
    Himthani, Naveen
    Yue, Xiangyu
    Zhao, Sicheng
    Jin, Peter
    Biros, George
    Keutzer, Kurt
    BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES, BRAINLES 2018, PT II, 2019, 11384 : 289 - 298
  • [23] Sclera-TransFuse: Fusing Swin Transformer and CNN for Accurate Sclera Segmentation
    Li, Haiqing
    Wang, Caiyong
    Zhao, Guangzhe
    He, Zhaofeng
    Wang, Yunlong
    Sun, Zhenan
    2023 IEEE INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS, IJCB, 2023,
  • [24] LATrans-Unet: Improving CNN-Transformer with Location Adaptive for Medical Image Segmentation
    Lin, Qiqin
    Yao, Junfeng
    Hong, Qingqi
    Cao, Xianpeng
    Zhou, Rongzhou
    Xie, Weixing
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XIII, 2024, 14437 : 223 - 234
  • [25] Semi-Supervised Medical Image Segmentation via Cross Teaching between CNN and Transformer
    Luo, Xiangde
    Hu, Minhao
    Song, Tao
    Wang, Guotai
    Zhang, Shaoting
    INTERNATIONAL CONFERENCE ON MEDICAL IMAGING WITH DEEP LEARNING, VOL 172, 2022, 172 : 820 - 833
  • [26] Hybrid 3D Medical Image Segmentation Using CNN and Frequency Transformer Fusion
    Labbihi, Ismayl
    Meslouhi, Othmane El
    Elassad, Zouhair Elamrani Abou
    Benaddy, Mohamed
    Kardouchi, Mustapha
    Akhloufi, Moulay
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024,
  • [27] Aggregated Mutual Learning between CNN and Transformer for semi-supervised medical image segmentation
    Xu, Zhenghua
    Wang, Hening
    Yang, Runhe
    Yang, Yuchen
    Liu, Weipeng
    Lukasiewicz, Thomas
    KNOWLEDGE-BASED SYSTEMS, 2025, 311
  • [28] ScribFormer: Transformer Makes CNN Work Better for Scribble-Based Medical Image Segmentation
    Li, Zihan
    Zheng, Yuan
    Shan, Dandan
    Yang, Shuzhou
    Li, Qingde
    Wang, Beizhan
    Zhang, Yuanting
    Hong, Qingqi
    Shen, Dinggang
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (06) : 2254 - 2265
  • [29] UCTNet: Uncertainty-guided CNN-Transformer hybrid networks for medical image segmentation
    Guo, Xiayu
    Lin, Xian
    Yang, Xin
    Yu, Li
    Cheng, Kwang-Ting
    Yan, Zengqiang
    PATTERN RECOGNITION, 2024, 152
  • [30] RAMIS: Increasing robustness and accuracy in medical image segmentation with hybrid CNN-transformer synergy
    Gu, Jia
    Tian, Fangzheng
    Oh, Il-Seok
    NEUROCOMPUTING, 2025, 618