CAT-Unet: An enhanced U-Net architecture with coordinate attention and skip-neighborhood attention transformer for medical image segmentation

被引:8
|
作者
Ding, Zhiquan [1 ,2 ]
Zhang, Yuejin [1 ,2 ]
Zhu, Chenxin [3 ]
Zhang, Guolong [1 ,2 ]
Li, Xiong [4 ]
Jiang, Nan [1 ]
Que, Yue [1 ]
Peng, Yuanyuan [5 ]
Guan, Xiaohui [6 ]
机构
[1] East China Jiaotong Univ, Sch Informat Engn, Nanchang 330013, Peoples R China
[2] East China Jiaotong Univ, Inst Computat & Biomech, Nanchang 330013, Peoples R China
[3] Xian Jiaotong Liverpool Univ, Sch Math & Phys, Suzhou 215028, Peoples R China
[4] East China Jiaotong Univ, Sch Software, Nanchang 330013, Peoples R China
[5] East China Jiaotong Univ, Sch Elect & Automat Engn, Nanchang 330013, Peoples R China
[6] Nanchang Univ, Natl Engn Res Ctr Bioengn Drugs & Technol, Nanchang, Peoples R China
基金
中国国家自然科学基金;
关键词
Medical image segmentation; Neighborhood attention; Depthwise separable convolutions; Coordinate attention; PLUS PLUS; NETWORK;
D O I
10.1016/j.ins.2024.120578
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rise of deep learning, the U -Net network, based on a U-shaped architecture and skip connections, has found widespread application in various medical image segmentation tasks. However, the receptive field of the standard convolution operation is limited, because it is difficult to achieve global and long-distance semantic information interaction. Inspired by the advantages of ConvNext and Neighborhood Attention (NA), we propose CAT-Unet in this study to address the aforementioned challenges. We effectively reduce the number of parameters by utilizing large kernels and depthwise separable convolutions. Meanwhile, we introduce a Coordinate Attention (CA) module, which enables the model to learn more comprehensive and contextual information from surrounding regions. Furthermore, we introduce Skip -NAT (Neighborhood Attention Transformer) as the main algorithmic framework, replacing U-Net's original skipconnection layers, to lessen the impact of shallow features on network efficiency. Experimental results show that CAT-Unet achieves better segmentation results. On the ISIC2018 dataset, the best results for Dice (Dice Coefficient), IoU (Intersection over Union), and HD (Hausdorff Distance) are 90.26%, 83.58%, and 4.259, respectively. For the PH2 dataset, the best Dice, IoU, and HD results are 96.49%, 91.81%, and 3.971, respectively. Finally, on the DSB2018 dataset, the best Dice, IoU, and HD results are 94.58%, 88.78%, and 3.749, respectively.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] MemAU-Net: Memory-Enhanced Attention U-Net for Medical Image Forgery Localization
    Wang, Nan
    Yi, Liping
    Wang, Gang
    Liu, Xiaoguang
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [42] MDAN-UNet: Multi-Scale and Dual Attention Enhanced Nested U-Net Architecture for Segmentation of Optical Coherence Tomography Images
    Liu, Wen
    Sun, Yankui
    Ji, Qingge
    ALGORITHMS, 2020, 13 (03)
  • [43] Enhancing medical image segmentation with a multi-transformer U-Net
    Dan, Yongping
    Jin, Weishou
    Yue, Xuebin
    Wang, Zhida
    PEERJ, 2024, 12
  • [44] LFU-Net: A Lightweight U-Net with Full Skip Connections for Medical Image Segmentation
    Deng, Yunjiao
    Wang, Hui
    Hou, Yulei
    Liang, Shunpan
    Zeng, Daxing
    CURRENT MEDICAL IMAGING, 2023, 19 (04) : 347 - 360
  • [45] DEA-UNet: a dense-edge-attention UNet architecture for medical image segmentation
    Zeng, Zhenhuan
    Fan, Chaodong
    Xiao, Leyi
    Qu, Xilong
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (04)
  • [46] U-Net with Coordinate Attention and VGGNet: A Grape Image Segmentation Algorithm Based on Fusion Pyramid Pooling and the Dual-Attention Mechanism
    Yi, Xiaomei
    Zhou, Yue
    Wu, Peng
    Wang, Guoying
    Mo, Lufeng
    Chola, Musenge
    Fu, Xinyun
    Qian, Pengxiang
    AGRONOMY-BASEL, 2024, 14 (05):
  • [47] DC-UNet: Rethinking the U-Net architecture with dual channel efficient CNN for medical image segmentation
    Lou, Ange
    Guan, Shuyue
    Loew, Murray
    Progress in Biomedical Optics and Imaging - Proceedings of SPIE, 2021, 11596
  • [48] NVTrans-UNet: Neighborhood vision transformer based U-Net for multi-modal cardiac MR image segmentation
    Li, Bingjie
    Yang, Tiejun
    Zhao, Xiang
    JOURNAL OF APPLIED CLINICAL MEDICAL PHYSICS, 2023, 24 (03):
  • [49] DCSAU-Net: A deeper and more compact split-attention U-Net for medical image segmentation
    Xu, Qing
    Ma, Zhicheng
    He, Na
    Duan, Wenting
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 154
  • [50] AttResDU-Net: Medical Image Segmentation Using Attention-based Residual Double U-Net
    Khan, Akib Mohammed
    Ashrafee, Alif
    Khan, Fahim Shahriar
    Hasan, Md. Bakhtiar
    Kabir, Md. Hasanul
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,