CAT-Unet: An enhanced U-Net architecture with coordinate attention and skip-neighborhood attention transformer for medical image segmentation

被引:8
|
作者
Ding, Zhiquan [1 ,2 ]
Zhang, Yuejin [1 ,2 ]
Zhu, Chenxin [3 ]
Zhang, Guolong [1 ,2 ]
Li, Xiong [4 ]
Jiang, Nan [1 ]
Que, Yue [1 ]
Peng, Yuanyuan [5 ]
Guan, Xiaohui [6 ]
机构
[1] East China Jiaotong Univ, Sch Informat Engn, Nanchang 330013, Peoples R China
[2] East China Jiaotong Univ, Inst Computat & Biomech, Nanchang 330013, Peoples R China
[3] Xian Jiaotong Liverpool Univ, Sch Math & Phys, Suzhou 215028, Peoples R China
[4] East China Jiaotong Univ, Sch Software, Nanchang 330013, Peoples R China
[5] East China Jiaotong Univ, Sch Elect & Automat Engn, Nanchang 330013, Peoples R China
[6] Nanchang Univ, Natl Engn Res Ctr Bioengn Drugs & Technol, Nanchang, Peoples R China
基金
中国国家自然科学基金;
关键词
Medical image segmentation; Neighborhood attention; Depthwise separable convolutions; Coordinate attention; PLUS PLUS; NETWORK;
D O I
10.1016/j.ins.2024.120578
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rise of deep learning, the U -Net network, based on a U-shaped architecture and skip connections, has found widespread application in various medical image segmentation tasks. However, the receptive field of the standard convolution operation is limited, because it is difficult to achieve global and long-distance semantic information interaction. Inspired by the advantages of ConvNext and Neighborhood Attention (NA), we propose CAT-Unet in this study to address the aforementioned challenges. We effectively reduce the number of parameters by utilizing large kernels and depthwise separable convolutions. Meanwhile, we introduce a Coordinate Attention (CA) module, which enables the model to learn more comprehensive and contextual information from surrounding regions. Furthermore, we introduce Skip -NAT (Neighborhood Attention Transformer) as the main algorithmic framework, replacing U-Net's original skipconnection layers, to lessen the impact of shallow features on network efficiency. Experimental results show that CAT-Unet achieves better segmentation results. On the ISIC2018 dataset, the best results for Dice (Dice Coefficient), IoU (Intersection over Union), and HD (Hausdorff Distance) are 90.26%, 83.58%, and 4.259, respectively. For the PH2 dataset, the best Dice, IoU, and HD results are 96.49%, 91.81%, and 3.971, respectively. Finally, on the DSB2018 dataset, the best Dice, IoU, and HD results are 94.58%, 88.78%, and 3.749, respectively.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] U-Net Transformer: Self and Cross Attention for Medical Image Segmentation
    Petit, Olivier
    Thome, Nicolas
    Rambour, Clement
    Themyr, Loic
    Collins, Toby
    Soler, Luc
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2021, 2021, 12966 : 267 - 276
  • [2] GA-UNet: A Lightweight Ghost and Attention U-Net for Medical Image Segmentation
    Pang, Bo
    Chen, Lianghong
    Tao, Qingchuan
    Wang, Enhui
    Yu, Yanmei
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024, 37 (04): : 1874 - 1888
  • [3] Residual-Attention UNet plus plus : A Nested Residual-Attention U-Net for Medical Image Segmentation
    Li, Zan
    Zhang, Hong
    Li, Zhengzhen
    Ren, Zuyue
    APPLIED SCIENCES-BASEL, 2022, 12 (14):
  • [4] MDA-Unet: A Multi-Scale Dilated Attention U-Net for Medical Image Segmentation
    Amer, Alyaa
    Lambrou, Tryphon
    Ye, Xujiong
    APPLIED SCIENCES-BASEL, 2022, 12 (07):
  • [5] Half-UNet: A Simplified U-Net Architecture for Medical Image Segmentation
    Lu, Haoran
    She, Yifei
    Tie, Jun
    Xu, Shengzhou
    FRONTIERS IN NEUROINFORMATICS, 2022, 16
  • [6] UNet plus plus : A Nested U-Net Architecture for Medical Image Segmentation
    Zhou, Zongwei
    Siddiquee, Md Mahfuzur Rahman
    Tajbakhsh, Nima
    Liang, Jianming
    DEEP LEARNING IN MEDICAL IMAGE ANALYSIS AND MULTIMODAL LEARNING FOR CLINICAL DECISION SUPPORT, DLMIA 2018, 2018, 11045 : 3 - 11
  • [7] Not Another Dual Attention UNet Transformer (NNDA-UNETR): a plug-and-play parallel dual attention block in U-Net with enhanced residual blocks for medical image segmentation
    Cao, Lei
    Zhang, Qikai
    Fan, Chunjiang
    Cao, Yongnian
    QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2024, 14 (12) : 9169 - 9192
  • [8] ATTENTION UNET plus plus : A NESTED ATTENTION-AWARE U-NET FOR LIVER CT IMAGE SEGMENTATION
    Li, Chen
    Tan, Yusong
    Chen, Wei
    Luo, Xin
    Gao, Yuanming
    Jia, Xiaogang
    Wang, Zhiying
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 345 - 349
  • [9] Hybrid dilation and attention residual U-Net for medical image segmentation
    Wang, Zekun
    Zou, Yanni
    Liu, Peter X.
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 134
  • [10] Hybrid Swin Deformable Attention U-Net for Medical Image Segmentation
    Wang, Lichao
    Huang, Jiahao
    Xing, Xiaodan
    Yang, Guang
    2023 19TH INTERNATIONAL SYMPOSIUM ON MEDICAL INFORMATION PROCESSING AND ANALYSIS, SIPAIM, 2023,