CAT-Unet: An enhanced U-Net architecture with coordinate attention and skip-neighborhood attention transformer for medical image segmentation

被引：8

作者：

Ding, Zhiquan ^{[1
,2
]}

Zhang, Yuejin ^{[1
,2
]}

Zhu, Chenxin ^{[3
]}

Zhang, Guolong ^{[1
,2
]}

Li, Xiong ^{[4
]}

Jiang, Nan ^{[1
]}

Que, Yue ^{[1
]}

Peng, Yuanyuan ^{[5
]}

Guan, Xiaohui ^{[6
]}

机构：

[1] East China Jiaotong Univ, Sch Informat Engn, Nanchang 330013, Peoples R China

[2] East China Jiaotong Univ, Inst Computat & Biomech, Nanchang 330013, Peoples R China

[3] Xian Jiaotong Liverpool Univ, Sch Math & Phys, Suzhou 215028, Peoples R China

[4] East China Jiaotong Univ, Sch Software, Nanchang 330013, Peoples R China

[5] East China Jiaotong Univ, Sch Elect & Automat Engn, Nanchang 330013, Peoples R China

[6] Nanchang Univ, Natl Engn Res Ctr Bioengn Drugs & Technol, Nanchang, Peoples R China

来源：

INFORMATION SCIENCES | 2024年 / 670卷

基金：

中国国家自然科学基金;

关键词：

Medical image segmentation; Neighborhood attention; Depthwise separable convolutions; Coordinate attention; PLUS PLUS; NETWORK;

D O I：

10.1016/j.ins.2024.120578

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the rise of deep learning, the U -Net network, based on a U-shaped architecture and skip connections, has found widespread application in various medical image segmentation tasks. However, the receptive field of the standard convolution operation is limited, because it is difficult to achieve global and long-distance semantic information interaction. Inspired by the advantages of ConvNext and Neighborhood Attention (NA), we propose CAT-Unet in this study to address the aforementioned challenges. We effectively reduce the number of parameters by utilizing large kernels and depthwise separable convolutions. Meanwhile, we introduce a Coordinate Attention (CA) module, which enables the model to learn more comprehensive and contextual information from surrounding regions. Furthermore, we introduce Skip -NAT (Neighborhood Attention Transformer) as the main algorithmic framework, replacing U-Net's original skipconnection layers, to lessen the impact of shallow features on network efficiency. Experimental results show that CAT-Unet achieves better segmentation results. On the ISIC2018 dataset, the best results for Dice (Dice Coefficient), IoU (Intersection over Union), and HD (Hausdorff Distance) are 90.26%, 83.58%, and 4.259, respectively. For the PH2 dataset, the best Dice, IoU, and HD results are 96.49%, 91.81%, and 3.971, respectively. Finally, on the DSB2018 dataset, the best Dice, IoU, and HD results are 94.58%, 88.78%, and 3.749, respectively.

引用

页数：15

共 50 条

[41] MemAU-Net: Memory-Enhanced Attention U-Net for Medical Image Forgery Localization
Wang, Nan
Yi, Liping
Wang, Gang
Liu, Xiaoguang
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
[42] MDAN-UNet: Multi-Scale and Dual Attention Enhanced Nested U-Net Architecture for Segmentation of Optical Coherence Tomography Images
Liu, Wen
Sun, Yankui
Ji, Qingge
ALGORITHMS, 2020, 13 (03)
[43] Enhancing medical image segmentation with a multi-transformer U-Net
Dan, Yongping
Jin, Weishou
Yue, Xuebin
Wang, Zhida
PEERJ, 2024, 12
[44] LFU-Net: A Lightweight U-Net with Full Skip Connections for Medical Image Segmentation
Deng, Yunjiao
Wang, Hui
Hou, Yulei
Liang, Shunpan
Zeng, Daxing
CURRENT MEDICAL IMAGING, 2023, 19 (04) : 347 - 360
[45] DEA-UNet: a dense-edge-attention UNet architecture for medical image segmentation
Zeng, Zhenhuan
Fan, Chaodong
Xiao, Leyi
Qu, Xilong
JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (04)
[46] U-Net with Coordinate Attention and VGGNet: A Grape Image Segmentation Algorithm Based on Fusion Pyramid Pooling and the Dual-Attention Mechanism
Yi, Xiaomei
Zhou, Yue
Wu, Peng
Wang, Guoying
Mo, Lufeng
Chola, Musenge
Fu, Xinyun
Qian, Pengxiang
AGRONOMY-BASEL, 2024, 14 (05):
[47] DC-UNet: Rethinking the U-Net architecture with dual channel efficient CNN for medical image segmentation
Lou, Ange
Guan, Shuyue
Loew, Murray
Progress in Biomedical Optics and Imaging - Proceedings of SPIE, 2021, 11596
[48] NVTrans-UNet: Neighborhood vision transformer based U-Net for multi-modal cardiac MR image segmentation
Li, Bingjie
Yang, Tiejun
Zhao, Xiang
JOURNAL OF APPLIED CLINICAL MEDICAL PHYSICS, 2023, 24 (03):
[49] DCSAU-Net: A deeper and more compact split-attention U-Net for medical image segmentation
Xu, Qing
Ma, Zhicheng
He, Na
Duan, Wenting
COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 154
[50] AttResDU-Net: Medical Image Segmentation Using Attention-based Residual Double U-Net
Khan, Akib Mohammed
Ashrafee, Alif
Khan, Fahim Shahriar
Hasan, Md. Bakhtiar
Kabir, Md. Hasanul
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,

← 1 2 3 4 5 →