HST-MRF: Heterogeneous Swin Transformer With Multi-Receptive Field for Medical Image Segmentation

被引:0
|
作者
Huang, Xiaofei [1 ]
Gong, Hongfang [1 ]
Zhang, Jin [2 ]
机构
[1] Changsha Univ Sci & Technol, Sch Math & Stat, Changsha 410114, Peoples R China
[2] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410114, Peoples R China
基金
中国国家自然科学基金;
关键词
Image segmentation; Transformers; Biomedical imaging; Task analysis; Computational modeling; Feature extraction; Visualization; Heterogeneous attention; medical imaging segmentation; multi-receptive field; patch segmentation; NETWORK; CONNECTIONS;
D O I
10.1109/JBHI.2024.3397047
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Transformer has been successfully used in medical image segmentation due to its excellent long-range modeling capabilities. However, patch segmentation is necessary when building a Transformer class model. This process ignores the tissue structure features within patch, resulting in the loss of shallow representation information. In this study, we propose a Heterogeneous Swin Transformer with Multi-Receptive Field (HST-MRF) model that fuses patch information from different receptive fields to solve the problem of loss of feature information caused by patch segmentation. The heterogeneous Swin Transformer (HST) is the core module, which achieves the interaction of multi-receptive field patch information through heterogeneous attention and passes it to the next stage for progressive learning, thus complementing the patch structure information. We also designed a two-stage fusion module, multimodal bilinear pooling (MBP), to assist HST in further fusing multi-receptive field information and combining low-level and high-level semantic information for accurate localization of lesion regions. In addition, we developed adaptive patch embedding (APE) and soft channel attention (SCA) modules to retain more valuable information when acquiring patch embedding and filtering channel features, respectively, thereby improving model segmentation quality. We evaluated HST-MRF on multiple datasets for polyp, skin lesion and breast ultrasound segmentation tasks. Experimental results show that our proposed method outperforms state-of-the-art models and can achieve superior performance. Furthermore, we verified the effectiveness of each module and the benefits of multi-receptive field segmentation in reducing the loss of structural information through ablation experiments and qualitative analysis.
引用
收藏
页码:4048 / 4061
页数:14
相关论文
共 50 条
  • [41] CTRANS: A MULTI-RESOLUTION CONVOLUTION-TRANSFORMER NETWORK FOR MEDICAL IMAGE SEGMENTATION
    Gong, Zhendi
    French, Andrew P.
    Qiu, Guoping
    Chen, Xin
    IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI 2024, 2024,
  • [42] ST-Unet: Swin Transformer boosted U-Net with Cross-Layer Feature Enhancement for medical image segmentation
    Zhang, Jing
    Qin, Qiuge
    Ye, Qi
    Ruan, Tong
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 153
  • [43] Multi-task Heterogeneous Framework for Semi-supervised Medical Image Segmentation
    Cao, Jinghan
    Fan, Huijie
    Fu, Shengpeng
    Xu, Ling
    Chen, Xi'ai
    Lin, Sen
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2024, PT II, 2025, 15202 : 77 - 88
  • [44] Multi-scale Hierarchical Vision Transformer with Cascaded Attention Decoding for Medical Image Segmentation
    Rahman, Md Mostafijur
    Marculescu, Radu
    MEDICAL IMAGING WITH DEEP LEARNING, VOL 227, 2023, 227 : 1526 - 1544
  • [45] MAXFormer: Enhanced transformer for medical image segmentation with multi-attention and multi-scale features fusion
    Liang, Zhiwei
    Zhao, Kui
    Liang, Gang
    Li, Siyu
    Wu, Yifei
    Zhou, Yiping
    KNOWLEDGE-BASED SYSTEMS, 2023, 280
  • [46] MSMVT: Semi-Supervised Framework with Multi-Scale and Multi-View Transformer for Medical Image Segmentation
    Li, Feixiang
    Jiang, Ailian
    Computer Engineering and Applications, 61 (02): : 273 - 282
  • [47] MS-Former: Multi-Scale Self-Guided Transformer for Medical Image Segmentation
    Karimijafarbigloo, Sanaz
    Azad, Reza
    Kazerouni, Amirhossein
    Merhof, Dorit
    MEDICAL IMAGING WITH DEEP LEARNING, VOL 227, 2023, 227 : 680 - 694
  • [48] Multi-scale convolutional attention frequency-enhanced transformer network for medical image segmentation
    Yan, Shun
    Yang, Benquan
    Chen, Aihua
    Zhao, Xiaoming
    Zhang, Shiqing
    INFORMATION FUSION, 2025, 119
  • [49] MSGAT: Multi-scale gated axial reverse attention transformer network for medical image segmentation
    Liu, Yanjun
    Yun, Haijiao
    Xia, Yang
    Luan, Jinyang
    Li, Mingjing
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 95
  • [50] CascadeMedSeg: integrating pyramid vision transformer with multi-scale fusion for precise medical image segmentation
    Li, Junwei
    Sun, Shengfeng
    Li, Shijie
    Xia, Ruixue
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (12) : 9067 - 9079