Focus U-Net: A novel dual attention-gated CNN for polyp segmentation during colonoscopy

被引:89
|
作者
Yeung, Michael [1 ,2 ]
Sala, Evis [1 ,3 ]
Schonlieb, Carola-Bibiane [4 ]
Rundo, Leonardo [1 ,3 ]
机构
[1] Univ Cambridge, Dept Radiol, Box 218,Cambridge Biomed Campus, Cambridge CB2 0QQ, England
[2] Univ Cambridge, Sch Clin Med, Cambridge CB2 0SP, England
[3] Univ Cambridge, Canc Res UK Cambridge Ctr, Cambridge CB2 0RE, England
[4] Univ Cambridge, Dept Appl Math & Theoret Phys, Cambridge CB3 0WA, England
基金
英国科学技术设施理事会; 英国惠康基金; 英国工程与自然科学研究理事会; 欧盟地平线“2020”;
关键词
Polyp segmentation; Colorectal cancer; Colonoscopy; Computer-aided diagnosis; Focus U-Net; Attention mechanisms; Loss function; COLORECTAL-CANCER; MISS RATE; NETWORKS; RISK;
D O I
10.1016/j.compbiomed.2021.104815
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Colonoscopy remains the gold-standard screening for colorectal cancer. However, significant miss rates for polyps have been reported, particularly when there are multiple small adenomas. This presents an opportunity to leverage computer-aided systems to support clinicians and reduce the number of polyps missed. Method: In this work we introduce the Focus U-Net, a novel dual attention-gated deep neural network, which combines efficient spatial and channel-based attention into a single Focus Gate module to encourage selective learning of polyp features. The Focus U-Net incorporates several further architectural modifications, including the addition of short-range skip connections and deep supervision. Furthermore, we introduce the Hybrid Focal loss, a new compound loss function based on the Focal loss and Focal Tversky loss, designed to handle classimbalanced image segmentation. For our experiments, we selected five public datasets containing images of polyps obtained during optical colonoscopy: CVC-ClinicDB, Kvasir-SEG, CVC-ColonDB, ETIS-Larib PolypDB and EndoScene test set. We first perform a series of ablation studies and then evaluate the Focus U-Net on the CVCClinicDB and Kvasir-SEG datasets separately, and on a combined dataset of all five public datasets. To evaluate model performance, we use the Dice similarity coefficient (DSC) and Intersection over Union (IoU) metrics. Results: Our model achieves state-of-the-art results for both CVC-ClinicDB and Kvasir-SEG, with a mean DSC of 0.941 and 0.910, respectively. When evaluated on a combination of five public polyp datasets, our model similarly achieves state-of-the-art results with a mean DSC of 0.878 and mean IoU of 0.809, a 14% and 15% improvement over the previous state-of-the-art results of 0.768 and 0.702, respectively. Conclusions: This study shows the potential for deep learning to provide fast and accurate polyp segmentation results for use during colonoscopy. The Focus U-Net may be adapted for future use in newer non-invasive colorectal cancer screening and more broadly to other biomedical image segmentation tasks similarly involving class imbalance and requiring efficiency.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] U-Net with Attention Mechanism for Retinal Vessel Segmentation
    Si, Ze
    Fu, Dongmei
    Li, Jiahao
    IMAGE AND GRAPHICS, ICIG 2019, PT II, 2019, 11902 : 668 - 677
  • [32] Synergistic attention U-Net for sublingual vein segmentation
    Tingxiao Yang
    Yuichiro Yoshimura
    Akira Morita
    Takao Namiki
    Toshiya Nakaguchi
    Artificial Life and Robotics, 2019, 24 : 550 - 559
  • [33] Synergistic attention U-Net for sublingual vein segmentation
    Yang, Tingxiao
    Yoshimura, Yuichiro
    Morita, Akira
    Namiki, Takao
    Nakaguchi, Toshiya
    ARTIFICIAL LIFE AND ROBOTICS, 2019, 24 (04) : 550 - 559
  • [34] Epileptic Seizure Detection in EEG via Fusion of Multi-View Attention-Gated U-Net Deep Neural Networks
    Chatzichristos, C.
    Dan, J.
    Narayanan, A. Mundanad
    Seeuws, N.
    Vandecasteele, K.
    De Vos, M.
    Bertrand, A.
    Van Huffel, S.
    2020 IEEE SIGNAL PROCESSING IN MEDICINE AND BIOLOGY SYMPOSIUM, 2020,
  • [35] A Dense-Gated U-Net for Brain Lesion Segmentation
    Ji, Zhongyi
    Han, Xiao
    Lin, Tong
    Wang, Wenmin
    2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 104 - 107
  • [36] DSU-Net: Dual-Stage U-Net based on CNN and Transformer for skin lesion segmentation
    Zhong, Longwei
    Li, Tiansong
    Cui, Meng
    Cui, Shaoguo
    Wang, Hongkui
    Yu, Li
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 100
  • [37] AttU-NET: Attention U-Net for Brain Tumor Segmentation
    Wang, Sihan
    Li, Lei
    Zhuang, Xiahai
    BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES, BRAINLES 2021, PT II, 2022, 12963 : 302 - 311
  • [38] Attention-augmented U-Net (AA-U-Net) for semantic segmentation
    Kumar T. Rajamani
    Priya Rani
    Hanna Siebert
    Rajkumar ElagiriRamalingam
    Mattias P. Heinrich
    Signal, Image and Video Processing, 2023, 17 : 981 - 989
  • [39] Attention-augmented U-Net (AA-U-Net) for semantic segmentation
    Rajamani, Kumar T.
    Rani, Priya
    Siebert, Hanna
    ElagiriRamalingam, Rajkumar
    Heinrich, Mattias P.
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (04) : 981 - 989
  • [40] Dual Encoding U-Net for Retinal Vessel Segmentation
    Wang, Bo
    Qiu, Shuang
    He, Huiguang
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT I, 2019, 11764 : 84 - 92