MP-FocalUNet: Multiscale parallel focal self-attention U-Net for medical image segmentation

被引:0
|
作者
Wang, Chuan [1 ]
Jiang, Mingfeng [1 ]
Li, Yang [1 ]
Wei, Bo [1 ]
Li, Yongming [2 ]
Wang, Pin [2 ]
Yang, Guang [3 ,4 ]
机构
[1] Zhejiang Sci Tech Univ, Sch Comp Sci & Technol, Hangzhou 310018, Peoples R China
[2] Chongqing Univ, Coll Commun Engn, Chongqing, Peoples R China
[3] Royal Brompton Hosp, Cardiovasc Res Ctr, London SW3 6NP, England
[4] Imperial Coll London, Natl Heart & Lung Inst, London SW7 2AZ, England
关键词
Focal self-attention mechanism; Medical image segmentation; Multiscale; Deep learning;
D O I
10.1016/j.cmpb.2024.108562
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background and Objective: Medical image segmentation has been significantly improved in recent years with the progress of Convolutional Neural Networks (CNNs). Due to the inherent limitations of convolutional operations, CNNs perform poorly in learning the correlation information between global and long-range features. To solve this problem, some existing solutions rely on building deep encoders and down-sampling operations, but such methods are prone to produce redundant network structures and lose local details. Therefore, medical image segmentation tasks require better solutions to improve the modeling of the global context, while maintaining a strong grasp of the low-level details. Methods: We propose a novel multiscale parallel branch architecture (MP-FocalUNet). On the encoder side of MPFocalUNet, dual-scale sub-networks are used to extract information of different scales. A cross-scale "Feature Fusion" (FF) module was proposed to explore the potential of dual branch networks and fully utilize feature representations at different scales. On the decoder side, combined with the traditional CNN in parallel, focal selfattention is used for long-distance modeling, which can effectively capture the global dependencies and underlying spatial details in a shallower way. Results: Our proposed method is evaluated on both abdominal organ segmentation datasets and automatic cardiac diagnosis challenge datasets. Our method consistently outperforms several state-of-the-art segmentation methods with an average Dice score of 82.45% (2.68% higher than HC-Net) and 91.44% (0.35% higher than HC-Net) on the abdominal organ datasets and the automatic cardiac diagnosis challenge datasets, respectively. Conclusions: Our MP-FocalUNet is a novel encoder-decoder based multiscale parallel branch Transformer network, which solves the problem of insufficient long-distance modeling in CNNs and fuses image information at different scales. Extensive experiments on abdominal and cardiac medical image segmentation tasks show that our MP-FocalUNet outperforms other state-of-the-art methods. In the future, our work will focus on designing more lightweight Transformer-based models and better learning pixel-level intrinsic structural features generated by patch division in visual Transformers.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] CFU-Net: A Coarse-Fine U-Net With Multilevel Attention for Medical Image Segmentation
    Yin, Haitao
    Shao, Yudong
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [32] Study on Echocardiographic Image Segmentation Based on Attention U-Net
    Wang, Kai
    Zhang, Jiwei
    Hachiya, Hirotaka
    Wu, Haiyuan
    PROCEEDINGS OF 2022 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2022), 2022, : 1091 - 1096
  • [33] Multiscale transunet plus plus : dense hybrid U-Net with transformer for medical image segmentation
    Wang, Bo
    Wang, Fan
    Dong, Pengwei
    Li, Chongyi
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (06) : 1607 - 1614
  • [34] RSU-Net: U-net based on residual and self-attention mechanism in the segmentation of cardiac magnetic resonance images
    Li, Yuan-Zhe
    Wang, Yi
    Huang, Yin-Hui
    Xiang, Ping
    Liu, Wen-Xi
    Lai, Qing-Quan
    Gao, Yi-Yuan
    Xu, Mao-Sheng
    Guo, Yi-Fan
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 231
  • [35] OAU-net: Outlined Attention U-net for biomedical image segmentation
    Song, Haojie
    Wang, Yuefei
    Zeng, Shijie
    Guo, Xiaoyan
    Li, Zheheng
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 79
  • [36] Focusing the View: Enhancing U-Net with Convolutional Block Attention for Superior Medical Image Segmentation
    Nhu-Tai Do
    Dat Nguyen Khanh
    Tram-Tran Nguyen-Quynh
    Quoc-Huy Nguyen
    INTELLIGENCE OF THINGS: TECHNOLOGIES AND APPLICATIONS, ICIT 2024, VOL 2, 2025, 230 : 156 - 165
  • [37] Rethinking the unpretentious U-net for medical ultrasound image segmentation
    Chen, Gongping
    Li, Lei
    Zhang, Jianxun
    Dai, Yu
    PATTERN RECOGNITION, 2023, 142
  • [38] Design of Superpiexl U-Net Network for Medical Image Segmentation
    Wang H.
    Liu H.
    Guo Q.
    Deng K.
    Zhang C.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (06): : 1007 - 1017
  • [39] Comparative Analysis of U-Net with Transfer Learning and Attention Mechanism for Enhanced Medical Image Segmentation
    El Abassi, Fouzia
    Darouichi, Aziz
    Ouaarab, Aziz
    DIGITAL TECHNOLOGIES AND APPLICATIONS, ICDTA 2024, VOL 2, 2024, 1099 : 551 - 560
  • [40] Medical Ultrasound Image Segmentation Using U-Net Architecture
    Shereena, V. B.
    Raju, G.
    ADVANCES IN COMPUTING AND DATA SCIENCES (ICACDS 2022), PT I, 2022, 1613 : 361 - 372