Multi-scale nested UNet with transformer for colorectal polyp segmentation

被引:5
|
作者
Wang, Zenan [1 ]
Liu, Zhen [1 ]
Yu, Jianfeng [1 ]
Gao, Yingxin [1 ]
Liu, Ming [2 ]
机构
[1] Capital Med Univ, Beijing Chaoyang Hosp, Dept Gastroenterol, Clin Med Coll 3, Beijing, Peoples R China
[2] Hunan Key Lab Nonferrous Resources & Geol Hazard E, Changsha, Peoples R China
来源
关键词
colorectal polyp; deep learning; polyp segmentation; transformer; MISS RATE; COLONOSCOPY;
D O I
10.1002/acm2.14351
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
BackgroundPolyp detection and localization are essential tasks for colonoscopy. U-shape network based convolutional neural networks have achieved remarkable segmentation performance for biomedical images, but lack of long-range dependencies modeling limits their receptive fields.PurposeOur goal was to develop and test a novel architecture for polyp segmentation, which takes advantage of learning local information with long-range dependencies modeling.MethodsA novel architecture combining with multi-scale nested UNet structure integrated transformer for polyp segmentation was developed. The proposed network takes advantage of both CNN and transformer to extract distinct feature information. The transformer layer is embedded between the encoder and decoder of a U-shape net to learn explicit global context and long-range semantic information. To address the challenging of variant polyp sizes, a MSFF unit was proposed to fuse features with multiple resolution.ResultsFour public datasets and one in-house dataset were used to train and test the model performance. Ablation study was also conducted to verify each component of the model. For dataset Kvasir-SEG and CVC-ClinicDB, the proposed model achieved mean dice score of 0.942 and 0.950 respectively, which were more accurate than the other methods. To show the generalization of different methods, we processed two cross dataset validations, the proposed model achieved the highest mean dice score. The results demonstrate that the proposed network has powerful learning and generalization capability, significantly improving segmentation accuracy and outperforming state-of-the-art methods.ConclusionsThe proposed model produced more accurate polyp segmentation than current methods on four different public and one in-house datasets. Its capability of polyps segmentation in different sizes shows the potential clinical application
引用
收藏
页数:10
相关论文
共 50 条
  • [21] EMS-Net: Enhanced Multi-Scale Network for Polyp Segmentation
    Wang, Miao
    An, Xingwei
    Li, Yuhao
    Li, Ning
    Hang, Wei
    Liu, Gang
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 2936 - 2939
  • [22] Polyp Image Segmentation Based on Multi-Scale Edge Perception and Enhancement
    Yang, Ruijun
    Chen, Liye
    Cheng, Yan
    Computer Engineering and Applications, 2025, 61 (01) : 272 - 281
  • [23] MFEFNet: Multi-scale feature enhancement and Fusion Network for polyp segmentation
    Xia, Yang
    Yun, Haijiao
    Liu, Yanjun
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 157
  • [24] MSEANet: Multi-Scale Selective Edge Aware Network for Polyp Segmentation
    Liu, Botao
    Shi, Changqi
    Zhao, Ming
    ALGORITHMS, 2025, 18 (01)
  • [25] MH UNet: A Multi-Scale Hierarchical Based Architecture for Medical Image Segmentation
    Ahmad, Parvez
    Jin, Hai
    Alroobaea, Roobaea
    Qamar, Saqib
    Zheng, Ran
    Alnajjar, Fady
    Aboudi, Fathia
    IEEE ACCESS, 2021, 9 : 148384 - 148408
  • [26] Medical image segmentation with UNet-based multi-scale context fusion
    Yuan, Yongqi
    Cheng, Yong
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [27] A Novel Multi-Scale Attention PFE-UNet for Forest Image Segmentation
    Zhang, Boyang
    Mu, Hongbo
    Gao, Mingyu
    Ni, Haiming
    Chen, Jianfeng
    Yang, Hong
    Qi, Dawei
    FORESTS, 2021, 12 (07):
  • [28] Fusion multi-scale Transformer skin lesion segmentation algorithm
    Liang L.-M.
    Zhou L.-S.
    Yin J.
    Sheng X.-Q.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2024, 54 (04): : 1086 - 1098
  • [29] Grouped multi-scale vision transformer for medical image segmentation
    Zexuan Ji
    Zheng Chen
    Xiao Ma
    Scientific Reports, 15 (1)
  • [30] Attention based multi-scale nested network for biomedical image segmentation
    Cheng, Dapeng
    Deng, Jia
    Xiao, Jinjie
    Yanyan, Mao
    Kang, Jialong
    Gai, Jiale
    Zhang, Baosheng
    Zhao, Feng
    HELIYON, 2024, 10 (14)