HST-MRF: Heterogeneous Swin Transformer With Multi-Receptive Field for Medical Image Segmentation

被引:0
|
作者
Huang, Xiaofei [1 ]
Gong, Hongfang [1 ]
Zhang, Jin [2 ]
机构
[1] Changsha Univ Sci & Technol, Sch Math & Stat, Changsha 410114, Peoples R China
[2] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410114, Peoples R China
基金
中国国家自然科学基金;
关键词
Image segmentation; Transformers; Biomedical imaging; Task analysis; Computational modeling; Feature extraction; Visualization; Heterogeneous attention; medical imaging segmentation; multi-receptive field; patch segmentation; NETWORK; CONNECTIONS;
D O I
10.1109/JBHI.2024.3397047
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Transformer has been successfully used in medical image segmentation due to its excellent long-range modeling capabilities. However, patch segmentation is necessary when building a Transformer class model. This process ignores the tissue structure features within patch, resulting in the loss of shallow representation information. In this study, we propose a Heterogeneous Swin Transformer with Multi-Receptive Field (HST-MRF) model that fuses patch information from different receptive fields to solve the problem of loss of feature information caused by patch segmentation. The heterogeneous Swin Transformer (HST) is the core module, which achieves the interaction of multi-receptive field patch information through heterogeneous attention and passes it to the next stage for progressive learning, thus complementing the patch structure information. We also designed a two-stage fusion module, multimodal bilinear pooling (MBP), to assist HST in further fusing multi-receptive field information and combining low-level and high-level semantic information for accurate localization of lesion regions. In addition, we developed adaptive patch embedding (APE) and soft channel attention (SCA) modules to retain more valuable information when acquiring patch embedding and filtering channel features, respectively, thereby improving model segmentation quality. We evaluated HST-MRF on multiple datasets for polyp, skin lesion and breast ultrasound segmentation tasks. Experimental results show that our proposed method outperforms state-of-the-art models and can achieve superior performance. Furthermore, we verified the effectiveness of each module and the benefits of multi-receptive field segmentation in reducing the loss of structural information through ablation experiments and qualitative analysis.
引用
收藏
页码:4048 / 4061
页数:14
相关论文
共 50 条
  • [31] SwinMM: Masked Multi-view with Swin Transformers for 3D Medical Image Segmentation
    Wang, Yiqing
    Li, Zihan
    Mei, Jieru
    Wei, Zihao
    Liu, Li
    Wang, Chen
    Sang, Shengtian
    Yuille, Alan L.
    Xie, Cihang
    Zhou, Yuyin
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT III, 2023, 14222 : 486 - 496
  • [32] Demystifying the effect of receptive field size in U-Net models for medical image segmentation
    Loos, Vincent
    Pardasani, Rohit
    Awasthi, Navchetan
    JOURNAL OF MEDICAL IMAGING, 2024, 11 (05)
  • [33] STM-UNet: An Efficient U-shaped Architecture Based on Swin Transformer and Multiscale MLP for Medical Image Segmentation
    Shi, Lei
    Gao, Tianyu
    Zhang, Zheng
    Zhang, Junxing
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 2003 - 2008
  • [34] Swin Unet3D: a three-dimensional medical image segmentation network combining vision transformer and convolution
    Yimin Cai
    Yuqing Long
    Zhenggong Han
    Mingkun Liu
    Yuchen Zheng
    Wei Yang
    Liming Chen
    BMC Medical Informatics and Decision Making, 23
  • [35] Swin Unet3D: a three-dimensional medical image segmentation network combining vision transformer and convolution
    Cai, Yimin
    Long, Yuqing
    Han, Zhenggong
    Liu, Mingkun
    Zheng, Yuchen
    Yang, Wei
    Chen, Liming
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2023, 23 (01)
  • [36] MFHARFNet: multi-branch feature hybrid and adaptive receptive field network for image segmentation
    Li, Meng
    Yun, Juntong
    Jiang, Du
    Tao, Bo
    Liu, Rong
    Li, Gongfa
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2025, 36 (01)
  • [37] MDViT: Multi-domain Vision Transformer for Small Medical Image Segmentation Datasets
    Du, Siyi
    Bayasi, Nourhan
    Hamarneh, Ghassan
    Garbi, Rafeef
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 448 - 458
  • [38] Multi-Scale Orthogonal Model CNN-Transformer for Medical Image Segmentation
    Zhou, Wuyi
    Zeng, Xianhua
    Zhou, Mingkun
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (10)
  • [39] Feature ensemble network for medical image segmentation with multi-scale atrous transformer
    Gai, Di
    Geng, Yuhan
    Huang, Xia
    Huang, Zheng
    Xiong, Xin
    Zhou, Ruihua
    Wang, Qi
    IET IMAGE PROCESSING, 2024, 18 (11) : 3082 - 3092
  • [40] LTMSegnet: Lightweight multi-scale medical image segmentation combining Transformer and MLP
    Huang, Xin
    Tang, Hongxiang
    Ding, Yan
    Li, Yuanyuan
    Zhu, Zhiqin
    Yang, Pan
    Computers in Biology and Medicine, 2024, 183