HST-MRF: Heterogeneous Swin Transformer With Multi-Receptive Field for Medical Image Segmentation

被引：0

作者：

Huang, Xiaofei ^{[1
]}

Gong, Hongfang ^{[1
]}

Zhang, Jin ^{[2
]}

机构：

[1] Changsha Univ Sci & Technol, Sch Math & Stat, Changsha 410114, Peoples R China

[2] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410114, Peoples R China

来源：

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS | 2024年 / 28卷 / 07期

基金：

中国国家自然科学基金;

关键词：

Image segmentation; Transformers; Biomedical imaging; Task analysis; Computational modeling; Feature extraction; Visualization; Heterogeneous attention; medical imaging segmentation; multi-receptive field; patch segmentation; NETWORK; CONNECTIONS;

D O I：

10.1109/JBHI.2024.3397047

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The Transformer has been successfully used in medical image segmentation due to its excellent long-range modeling capabilities. However, patch segmentation is necessary when building a Transformer class model. This process ignores the tissue structure features within patch, resulting in the loss of shallow representation information. In this study, we propose a Heterogeneous Swin Transformer with Multi-Receptive Field (HST-MRF) model that fuses patch information from different receptive fields to solve the problem of loss of feature information caused by patch segmentation. The heterogeneous Swin Transformer (HST) is the core module, which achieves the interaction of multi-receptive field patch information through heterogeneous attention and passes it to the next stage for progressive learning, thus complementing the patch structure information. We also designed a two-stage fusion module, multimodal bilinear pooling (MBP), to assist HST in further fusing multi-receptive field information and combining low-level and high-level semantic information for accurate localization of lesion regions. In addition, we developed adaptive patch embedding (APE) and soft channel attention (SCA) modules to retain more valuable information when acquiring patch embedding and filtering channel features, respectively, thereby improving model segmentation quality. We evaluated HST-MRF on multiple datasets for polyp, skin lesion and breast ultrasound segmentation tasks. Experimental results show that our proposed method outperforms state-of-the-art models and can achieve superior performance. Furthermore, we verified the effectiveness of each module and the benefits of multi-receptive field segmentation in reducing the loss of structural information through ablation experiments and qualitative analysis.

引用

页码：4048 / 4061

页数：14

共 50 条

[41] CTRANS: A MULTI-RESOLUTION CONVOLUTION-TRANSFORMER NETWORK FOR MEDICAL IMAGE SEGMENTATION
Gong, Zhendi
French, Andrew P.
Qiu, Guoping
Chen, Xin
IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI 2024, 2024,
[42] ST-Unet: Swin Transformer boosted U-Net with Cross-Layer Feature Enhancement for medical image segmentation
Zhang, Jing
Qin, Qiuge
Ye, Qi
Ruan, Tong
COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 153
[43] Multi-task Heterogeneous Framework for Semi-supervised Medical Image Segmentation
Cao, Jinghan
Fan, Huijie
Fu, Shengpeng
Xu, Ling
Chen, Xi'ai
Lin, Sen
INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2024, PT II, 2025, 15202 : 77 - 88
[44] Multi-scale Hierarchical Vision Transformer with Cascaded Attention Decoding for Medical Image Segmentation
Rahman, Md Mostafijur
Marculescu, Radu
MEDICAL IMAGING WITH DEEP LEARNING, VOL 227, 2023, 227 : 1526 - 1544
[45] MAXFormer: Enhanced transformer for medical image segmentation with multi-attention and multi-scale features fusion
Liang, Zhiwei
Zhao, Kui
Liang, Gang
Li, Siyu
Wu, Yifei
Zhou, Yiping
KNOWLEDGE-BASED SYSTEMS, 2023, 280
[46] MSMVT: Semi-Supervised Framework with Multi-Scale and Multi-View Transformer for Medical Image Segmentation
Li, Feixiang
Jiang, Ailian
Computer Engineering and Applications, 61 (02): : 273 - 282
[47] MS-Former: Multi-Scale Self-Guided Transformer for Medical Image Segmentation
Karimijafarbigloo, Sanaz
Azad, Reza
Kazerouni, Amirhossein
Merhof, Dorit
MEDICAL IMAGING WITH DEEP LEARNING, VOL 227, 2023, 227 : 680 - 694
[48] Multi-scale convolutional attention frequency-enhanced transformer network for medical image segmentation
Yan, Shun
Yang, Benquan
Chen, Aihua
Zhao, Xiaoming
Zhang, Shiqing
INFORMATION FUSION, 2025, 119
[49] MSGAT: Multi-scale gated axial reverse attention transformer network for medical image segmentation
Liu, Yanjun
Yun, Haijiao
Xia, Yang
Luan, Jinyang
Li, Mingjing
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 95
[50] CascadeMedSeg: integrating pyramid vision transformer with multi-scale fusion for precise medical image segmentation
Li, Junwei
Sun, Shengfeng
Li, Shijie
Xia, Ruixue
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (12) : 9067 - 9079

← 1 2 3 4 5 →