MCPA: multi-scale cross perceptron attention network for 2D medical image segmentation

被引:0
|
作者
Xu, Liang [1 ,2 ]
Chen, Mingxiao [3 ]
Cheng, Yi [3 ]
Song, Pengwu [1 ,2 ]
Shao, Pengfei [3 ]
Shen, Shuwei [1 ,2 ]
Yao, Peng [4 ]
Xu, Ronald X. [1 ,2 ,3 ]
机构
[1] Univ Sci & Technol China, Sch Biomed Engn, Div Life Sci & Med, Hefei, Peoples R China
[2] Univ Sci & Technol China, Suzhou Inst Adv Res, Suzhou, Peoples R China
[3] Univ Sci & Technol China, Dept Precis Machinery & Precis Instrument, Hefei, Peoples R China
[4] Univ Sci & Technol China, Sch Microelect, Hefei, Peoples R China
关键词
Medical image; Segmentation; Multi-scale; Cross perceptron; Progressive dual-branch structure; VESSEL SEGMENTATION;
D O I
10.1007/s40747-024-01671-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The UNet architecture, based on convolutional neural networks (CNN), has demonstrated its remarkable performance in medical image analysis. However, it faces challenges in capturing long-range dependencies due to the limited receptive fields and inherent bias of convolutional operations. Recently, numerous transformer-based techniques have been incorporated into the UNet architecture to overcome this limitation by effectively capturing global feature correlations. However, the integration of the Transformer modules may result in the loss of local contextual information during the global feature fusion process. In this work, we propose a 2D medical image segmentation model called multi-scale cross perceptron attention network (MCPA). The MCPA consists of three main components: an encoder, a decoder, and a Cross Perceptron. The Cross Perceptron first captures the local correlations using multiple Multi-scale Cross Perceptron modules, facilitating the fusion of features across scales. The resulting multi-scale feature vectors are then spatially unfolded, concatenated, and fed through a Global Perceptron module to model global dependencies. Considering the high computational cost of using 3D neural network models, and the fact that many important clinical data can only be obtained in two dimensions, our MCPA focuses on 2D medical image segmentation. Furthermore, we introduce a progressive dual-branch structure (PDBS) to address the semantic segmentation of the image involving finer tissue structures. This structure gradually shifts the segmentation focus of MCPA network training from large-scale structural features to more sophisticated pixel-level features. We evaluate our proposed MCPA model on several publicly available medical image datasets from different tasks and devices, including the open large-scale dataset of CT (Synapse), MRI (ACDC), and widely used 2D medical imaging datasets captured by fundus camera (DRIVE, CHASE_\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\_$$\end{document}DB1, HRF), and OCTA (ROSE). The experimental results show that our MCPA model achieves state-of-the-art performance.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Lightweight multi-scale dynamic selection network for medical image segmentation
    Dong, Xue-Mei
    Sun, Yu
    Wang, Lili
    INFORMATION SCIENCES, 2024, 677
  • [32] An improved multi-scale feature extraction network for medical image segmentation
    Guo, Haoyu
    Shi, Liuliu
    Liu, Jinlong
    QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2024, 14 (12) : 8331 - 8346
  • [33] Multi-Scale Attention Generative Adversarial Network for Medical Image Enhancement
    Zhong, Guojin
    Ding, Weiping
    Chen, Long
    Wang, Yingxu
    Yu, Yu-Feng
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 7 (04): : 1113 - 1125
  • [34] Multi-scale contextual semantic enhancement network for 3D medical image segmentation
    Xia, Tingjian
    Huang, Guoheng
    Pun, Chi-Man
    Zhang, Weiwen
    Li, Jiajian
    Ling, Wing-Kuen
    Lin, Chao
    Yang, Qi
    PHYSICS IN MEDICINE AND BIOLOGY, 2022, 67 (22):
  • [35] Automatic lumbar spinal MRI image segmentation with a multi-scale attention network
    Haixing Li
    Haibo Luo
    Wang Huan
    Zelin Shi
    Chongnan Yan
    Lanbo Wang
    Yueming Mu
    Yunpeng Liu
    Neural Computing and Applications, 2021, 33 : 11589 - 11602
  • [36] Automatic lumbar spinal MRI image segmentation with a multi-scale attention network
    Li, Haixing
    Luo, Haibo
    Huan, Wang
    Shi, Zelin
    Yan, Chongnan
    Wang, Lanbo
    Mu, Yueming
    Liu, Yunpeng
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (18): : 11589 - 11602
  • [37] Multi-Scale Attention Network for Image Cropping
    Lian, Tianpei
    Xian, Ke
    Pan, Zhiyu
    Hong, Chaoyi
    Cao, Zhiguo
    Zhong, Weicai
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 2640 - 2645
  • [38] Multi-scale attention network for image inpainting
    Qin, Jia
    Bai, Huihui
    Zhao, Yao
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 204
  • [39] MRAU-net: Multi-scale residual attention U-shaped network for medical image segmentation
    Shu, Xin
    Li, Xiaotong
    Zhang, Xin
    Shao, Changbin
    Yan, Xi
    Huang, Shucheng
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 118
  • [40] Dual-axis Generalized Cross Attention and Shape-aware Network for 2D medical image segmentation
    Zhang, Zengmin
    Peng, Yanjun
    Duan, Xiaomeng
    Hou, Qingfan
    Li, Zhengyu
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 107