MCPA: multi-scale cross perceptron attention network for 2D medical image segmentation

被引：0

作者：

Xu, Liang ^{[1
,2
]}

Chen, Mingxiao ^{[3
]}

Cheng, Yi ^{[3
]}

Song, Pengwu ^{[1
,2
]}

Shao, Pengfei ^{[3
]}

Shen, Shuwei ^{[1
,2
]}

Yao, Peng ^{[4
]}

Xu, Ronald X. ^{[1
,2
,3
]}

机构：

[1] Univ Sci & Technol China, Sch Biomed Engn, Div Life Sci & Med, Hefei, Peoples R China

[2] Univ Sci & Technol China, Suzhou Inst Adv Res, Suzhou, Peoples R China

[3] Univ Sci & Technol China, Dept Precis Machinery & Precis Instrument, Hefei, Peoples R China

[4] Univ Sci & Technol China, Sch Microelect, Hefei, Peoples R China

来源：

COMPLEX & INTELLIGENT SYSTEMS | 2025年 / 11卷 / 01期

关键词：

Medical image; Segmentation; Multi-scale; Cross perceptron; Progressive dual-branch structure; VESSEL SEGMENTATION;

D O I：

10.1007/s40747-024-01671-1

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The UNet architecture, based on convolutional neural networks (CNN), has demonstrated its remarkable performance in medical image analysis. However, it faces challenges in capturing long-range dependencies due to the limited receptive fields and inherent bias of convolutional operations. Recently, numerous transformer-based techniques have been incorporated into the UNet architecture to overcome this limitation by effectively capturing global feature correlations. However, the integration of the Transformer modules may result in the loss of local contextual information during the global feature fusion process. In this work, we propose a 2D medical image segmentation model called multi-scale cross perceptron attention network (MCPA). The MCPA consists of three main components: an encoder, a decoder, and a Cross Perceptron. The Cross Perceptron first captures the local correlations using multiple Multi-scale Cross Perceptron modules, facilitating the fusion of features across scales. The resulting multi-scale feature vectors are then spatially unfolded, concatenated, and fed through a Global Perceptron module to model global dependencies. Considering the high computational cost of using 3D neural network models, and the fact that many important clinical data can only be obtained in two dimensions, our MCPA focuses on 2D medical image segmentation. Furthermore, we introduce a progressive dual-branch structure (PDBS) to address the semantic segmentation of the image involving finer tissue structures. This structure gradually shifts the segmentation focus of MCPA network training from large-scale structural features to more sophisticated pixel-level features. We evaluate our proposed MCPA model on several publicly available medical image datasets from different tasks and devices, including the open large-scale dataset of CT (Synapse), MRI (ACDC), and widely used 2D medical imaging datasets captured by fundus camera (DRIVE, CHASE_\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\_$$\end{document}DB1, HRF), and OCTA (ROSE). The experimental results show that our MCPA model achieves state-of-the-art performance.

引用

页数：16

共 50 条

[41] A Multi-Scale Channel Attention Network for Prostate Segmentation
Ding, Meiwen
Lin, Zhiping
Lee, Chau Hung
Tan, Cher Heng
Huang, Weimin
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (05) : 1754 - 1758
[42] MSA-Net: Multi-scale feature fusion network with enhanced attention module for 3D medical image segmentation
Wang, Shuo
Wang, Yuanhong
Peng, Yanjun
Chen, Xue
COMPUTERS & ELECTRICAL ENGINEERING, 2024, 120
[43] Multi-scale Hierarchical Vision Transformer with Cascaded Attention Decoding for Medical Image Segmentation
Rahman, Md Mostafijur
Marculescu, Radu
MEDICAL IMAGING WITH DEEP LEARNING, VOL 227, 2023, 227 : 1526 - 1544
[44] Enhancing medical image segmentation with MA-UNet: a multi-scale attention framework
Li, Hongzhi
Ren, Zhanghao
Zhu, Guoqing
Liang, Yaoju
Cui, Han
Wang, Chaozeyu
Wang, Jiaxi
VISUAL COMPUTER, 2025,
[45] DSA: Deformable Segmentation Attention for Multi-Scale Fisheye Image Segmentation
Jiang, Junzhe
Xu, Cheng
Liu, Hongzhe
Fu, Ying
Jian, Muwei
ELECTRONICS, 2023, 12 (19)
[46] Feature ensemble network for medical image segmentation with multi-scale atrous transformer
Gai, Di
Geng, Yuhan
Huang, Xia
Huang, Zheng
Xiong, Xin
Zhou, Ruihua
Wang, Qi
IET IMAGE PROCESSING, 2024, 18 (11) : 3082 - 3092
[47] Medical image segmentation network based on multi-scale frequency domain filter
Chen, Yufeng
Zhang, Xiaoqian
Peng, Lifan
He, Youdong
Sun, Feng
Sun, Huaijiang
NEURAL NETWORKS, 2024, 175
[48] Sub-pixel multi-scale fusion network for medical image segmentation
Jing Li
Qiaohong Chen
Xian Fang
Multimedia Tools and Applications, 2024, 83 (41) : 89355 - 89373
[49] SAMP-Net: a medical image segmentation network with split attention and multi-layer perceptron
Ma, Xiaoxuan
Shan, Sihan
Sui, Dong
MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2025,
[50] MSA-MaxNet: Multi-Scale Attention Enhanced Multi-Axis Vision Transformer Network for Medical Image Segmentation
Wu, Wei
Huang, Junfeng
Zhang, Mingxuan
Li, Yichen
Yu, Qijia
Zhao, Qi
JOURNAL OF CELLULAR AND MOLECULAR MEDICINE, 2024, 28 (24)

← 1 2 3 4 5 →