HST-MRF: Heterogeneous Swin Transformer With Multi-Receptive Field for Medical Image Segmentation

被引:0
|
作者
Huang, Xiaofei [1 ]
Gong, Hongfang [1 ]
Zhang, Jin [2 ]
机构
[1] Changsha Univ Sci & Technol, Sch Math & Stat, Changsha 410114, Peoples R China
[2] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410114, Peoples R China
基金
中国国家自然科学基金;
关键词
Image segmentation; Transformers; Biomedical imaging; Task analysis; Computational modeling; Feature extraction; Visualization; Heterogeneous attention; medical imaging segmentation; multi-receptive field; patch segmentation; NETWORK; CONNECTIONS;
D O I
10.1109/JBHI.2024.3397047
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Transformer has been successfully used in medical image segmentation due to its excellent long-range modeling capabilities. However, patch segmentation is necessary when building a Transformer class model. This process ignores the tissue structure features within patch, resulting in the loss of shallow representation information. In this study, we propose a Heterogeneous Swin Transformer with Multi-Receptive Field (HST-MRF) model that fuses patch information from different receptive fields to solve the problem of loss of feature information caused by patch segmentation. The heterogeneous Swin Transformer (HST) is the core module, which achieves the interaction of multi-receptive field patch information through heterogeneous attention and passes it to the next stage for progressive learning, thus complementing the patch structure information. We also designed a two-stage fusion module, multimodal bilinear pooling (MBP), to assist HST in further fusing multi-receptive field information and combining low-level and high-level semantic information for accurate localization of lesion regions. In addition, we developed adaptive patch embedding (APE) and soft channel attention (SCA) modules to retain more valuable information when acquiring patch embedding and filtering channel features, respectively, thereby improving model segmentation quality. We evaluated HST-MRF on multiple datasets for polyp, skin lesion and breast ultrasound segmentation tasks. Experimental results show that our proposed method outperforms state-of-the-art models and can achieve superior performance. Furthermore, we verified the effectiveness of each module and the benefits of multi-receptive field segmentation in reducing the loss of structural information through ablation experiments and qualitative analysis.
引用
收藏
页码:4048 / 4061
页数:14
相关论文
共 50 条
  • [1] ConvMedSegNet: A multi-receptive field depthwise convolutional neural network for medical image segmentation
    Peng Y.
    Yi X.
    Zhang D.
    Zhang L.
    Tian Y.
    Zhou Z.
    Computers in Biology and Medicine, 2024, 176
  • [2] Multi-receptive Field Aggregation Network for single image deraining
    Liang, Songliang
    Meng, Xiaozhe
    Su, Zhuo
    Zhou, Fan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 84
  • [3] Multi-Task Mean Teacher Medical Image Segmentation Based on Swin Transformer
    Zhang, Jie
    Li, Fan
    Zhang, Xin
    Cheng, Yue
    Hei, Xinhong
    APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [4] Swin-TransUper: Swin Transformer-based UperNet for medical image segmentation
    Yin J.
    Chen Y.
    Li C.
    Zheng Z.
    Gu Y.
    Zhou J.
    Multimedia Tools and Applications, 2024, 83 (42) : 89817 - 89836
  • [5] Medical image segmentation by combining feature enhancement Swin Transformer and UperNet
    Lin Zhang
    Xiaochun Yin
    Xuqi Liu
    Zengguang Liu
    Scientific Reports, 15 (1)
  • [6] High-Resolution Swin Transformer for Automatic Medical Image Segmentation
    Wei, Chen
    Ren, Shenghan
    Guo, Kaitai
    Hu, Haihong
    Liang, Jimin
    SENSORS, 2023, 23 (07)
  • [7] Swin Transformer Assisted Prior Attention Network for Medical Image Segmentation
    Liao, Zhihao
    Fan, Neng
    Xu, Kai
    APPLIED SCIENCES-BASEL, 2022, 12 (09):
  • [8] Placental MRI segmentation based on multi-receptive field and mixed attention separation mechanism
    Lee, Cong
    Liao, Zhifang
    Li, Yuanzhe
    Lai, Qingquan
    Guo, Yingying
    Huang, Jing
    Li, Shuting
    Wang, Yi
    Shi, Ruizheng
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 242
  • [9] A MULTI-SCALED RECEPTIVE FIELD LEARNING APPROACH FOR MEDICAL IMAGE SEGMENTATION
    Guo, Pengcheng
    Su, Xiangdong
    Zhang, Haoran
    Wang, Meng
    Bao, Feilong
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1414 - 1418
  • [10] DSTUNET: UNET WITH EFFICIENT DENSE SWIN TRANSFORMER PATHWAY FOR MEDICAL IMAGE SEGMENTATION
    Cai, Zhuotong
    Xin, Jingmin
    Shi, Peiwen
    Wu, Jiayi
    Zheng, Nanning
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,