MCPA: multi-scale cross perceptron attention network for 2D medical image segmentation

被引:0
|
作者
Xu, Liang [1 ,2 ]
Chen, Mingxiao [3 ]
Cheng, Yi [3 ]
Song, Pengwu [1 ,2 ]
Shao, Pengfei [3 ]
Shen, Shuwei [1 ,2 ]
Yao, Peng [4 ]
Xu, Ronald X. [1 ,2 ,3 ]
机构
[1] Univ Sci & Technol China, Sch Biomed Engn, Div Life Sci & Med, Hefei, Peoples R China
[2] Univ Sci & Technol China, Suzhou Inst Adv Res, Suzhou, Peoples R China
[3] Univ Sci & Technol China, Dept Precis Machinery & Precis Instrument, Hefei, Peoples R China
[4] Univ Sci & Technol China, Sch Microelect, Hefei, Peoples R China
关键词
Medical image; Segmentation; Multi-scale; Cross perceptron; Progressive dual-branch structure; VESSEL SEGMENTATION;
D O I
10.1007/s40747-024-01671-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The UNet architecture, based on convolutional neural networks (CNN), has demonstrated its remarkable performance in medical image analysis. However, it faces challenges in capturing long-range dependencies due to the limited receptive fields and inherent bias of convolutional operations. Recently, numerous transformer-based techniques have been incorporated into the UNet architecture to overcome this limitation by effectively capturing global feature correlations. However, the integration of the Transformer modules may result in the loss of local contextual information during the global feature fusion process. In this work, we propose a 2D medical image segmentation model called multi-scale cross perceptron attention network (MCPA). The MCPA consists of three main components: an encoder, a decoder, and a Cross Perceptron. The Cross Perceptron first captures the local correlations using multiple Multi-scale Cross Perceptron modules, facilitating the fusion of features across scales. The resulting multi-scale feature vectors are then spatially unfolded, concatenated, and fed through a Global Perceptron module to model global dependencies. Considering the high computational cost of using 3D neural network models, and the fact that many important clinical data can only be obtained in two dimensions, our MCPA focuses on 2D medical image segmentation. Furthermore, we introduce a progressive dual-branch structure (PDBS) to address the semantic segmentation of the image involving finer tissue structures. This structure gradually shifts the segmentation focus of MCPA network training from large-scale structural features to more sophisticated pixel-level features. We evaluate our proposed MCPA model on several publicly available medical image datasets from different tasks and devices, including the open large-scale dataset of CT (Synapse), MRI (ACDC), and widely used 2D medical imaging datasets captured by fundus camera (DRIVE, CHASE_\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\_$$\end{document}DB1, HRF), and OCTA (ROSE). The experimental results show that our MCPA model achieves state-of-the-art performance.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] MSU-Net: Multi-Scale U-Net for 2D Medical Image Segmentation
    Su, Run
    Zhang, Deyun
    Liu, Jinhuai
    Cheng, Chuandong
    FRONTIERS IN GENETICS, 2021, 12
  • [22] Attention based multi-scale nested network for biomedical image segmentation
    Cheng, Dapeng
    Deng, Jia
    Xiao, Jinjie
    Yanyan, Mao
    Kang, Jialong
    Gai, Jiale
    Zhang, Baosheng
    Zhao, Feng
    HELIYON, 2024, 10 (14)
  • [23] Medical image segmentation method combining multi-scale and multi-head attention
    Wang W.-L.
    Wang T.-J.
    Chen J.-C.
    You W.-B.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2022, 56 (09): : 1796 - 1805
  • [24] MC-Net: multi-scale context-attention network for medical CT image segmentation
    Haiying Xia
    Mingjun Ma
    Haisheng Li
    Shuxiang Song
    Applied Intelligence, 2022, 52 : 1508 - 1519
  • [25] MC-Net: multi-scale context-attention network for medical CT image segmentation
    Xia, Haiying
    Ma, Mingjun
    Li, Haisheng
    Song, Shuxiang
    APPLIED INTELLIGENCE, 2022, 52 (02) : 1508 - 1519
  • [26] PMFSNet: Polarized multi-scale feature self-attention network for lightweight medical image segmentation
    Zhong, Jiahui
    Tian, Wenhong
    Xie, Yuanlun
    Liu, Zhijia
    Ou, Jie
    Tian, Taoran
    Zhang, Lei
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2025, 261
  • [27] MedSegNet: A Lightweight Convolutional Network Combining Dual Self-Attention and Multi-Scale Attention for Medical Image Segmentation
    Bharati, Subrato
    Ahmad, M. Omair
    Swamy, M. N. S.
    2024 IEEE 67TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, MWSCAS 2024, 2024, : 965 - 969
  • [28] Multi-scale feature pyramid fusion network for medical image segmentation
    Bing Zhang
    Yang Wang
    Caifu Ding
    Ziqing Deng
    Linwei Li
    Zesheng Qin
    Zhao Ding
    Lifeng Bian
    Chen Yang
    International Journal of Computer Assisted Radiology and Surgery, 2023, 18 : 353 - 365
  • [29] HMDA: A Hybrid Model With Multi-Scale Deformable Attention for Medical Image Segmentation
    Wu, Mengmeng
    Liu, Tiantian
    Dai, Xin
    Ye, Chuyang
    Wu, Jinglong
    Funahashi, Shintaro
    Yan, Tianyi
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2025, 29 (02) : 1243 - 1255
  • [30] Multi-scale feature pyramid fusion network for medical image segmentation
    Zhang, Bing
    Wang, Yang
    Ding, Caifu
    Deng, Ziqing
    Li, Linwei
    Qin, Zesheng
    Ding, Zhao
    Bian, Lifeng
    Yang, Chen
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2023, 18 (02) : 353 - 365