Multi-scale nested UNet with transformer for colorectal polyp segmentation

被引:5
|
作者
Wang, Zenan [1 ]
Liu, Zhen [1 ]
Yu, Jianfeng [1 ]
Gao, Yingxin [1 ]
Liu, Ming [2 ]
机构
[1] Capital Med Univ, Beijing Chaoyang Hosp, Dept Gastroenterol, Clin Med Coll 3, Beijing, Peoples R China
[2] Hunan Key Lab Nonferrous Resources & Geol Hazard E, Changsha, Peoples R China
来源
关键词
colorectal polyp; deep learning; polyp segmentation; transformer; MISS RATE; COLONOSCOPY;
D O I
10.1002/acm2.14351
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
BackgroundPolyp detection and localization are essential tasks for colonoscopy. U-shape network based convolutional neural networks have achieved remarkable segmentation performance for biomedical images, but lack of long-range dependencies modeling limits their receptive fields.PurposeOur goal was to develop and test a novel architecture for polyp segmentation, which takes advantage of learning local information with long-range dependencies modeling.MethodsA novel architecture combining with multi-scale nested UNet structure integrated transformer for polyp segmentation was developed. The proposed network takes advantage of both CNN and transformer to extract distinct feature information. The transformer layer is embedded between the encoder and decoder of a U-shape net to learn explicit global context and long-range semantic information. To address the challenging of variant polyp sizes, a MSFF unit was proposed to fuse features with multiple resolution.ResultsFour public datasets and one in-house dataset were used to train and test the model performance. Ablation study was also conducted to verify each component of the model. For dataset Kvasir-SEG and CVC-ClinicDB, the proposed model achieved mean dice score of 0.942 and 0.950 respectively, which were more accurate than the other methods. To show the generalization of different methods, we processed two cross dataset validations, the proposed model achieved the highest mean dice score. The results demonstrate that the proposed network has powerful learning and generalization capability, significantly improving segmentation accuracy and outperforming state-of-the-art methods.ConclusionsThe proposed model produced more accurate polyp segmentation than current methods on four different public and one in-house datasets. Its capability of polyps segmentation in different sizes shows the potential clinical application
引用
收藏
页数:10
相关论文
共 50 条
  • [1] META-Unet: Multi-Scale Efficient Transformer Attention Unet for Fast and High-Accuracy Polyp Segmentation
    Wu, Huisi
    Zhao, Zebin
    Wang, Zhaoze
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (03) : 4117 - 4128
  • [2] LTUNet: A Lightweight Transformer-Based UNet with Multi-scale Mechanism for Skin Lesion Segmentation
    Guo, Huike
    Zhang, Han
    Li, Minghe
    Quan, Xiongwen
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT II, 2024, 14474 : 147 - 158
  • [3] MS-UNet: Swin Transformer U-Net with Multi-scale Nested Decoder for Medical Image Segmentation with Small Training Data
    Chen, Haoyuan
    Han, Yufei
    Li, Yanyi
    Xu, Pin
    Li, Kuan
    Yin, Jianping
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XIII, 2024, 14437 : 472 - 483
  • [4] RMS-UNet: Residual multi-scale UNet for liver and lesion segmentation
    Khan, Rayyan Azam
    Luo, Yigang
    Wu, Fang-Xiang
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2022, 124
  • [5] MS UNet: Multi-scale 3D UNet for Brain Tumor Segmentation
    Ahmad, Parvez
    Qamar, Saqib
    Shen, Linlin
    Rizvi, Syed Qasim Afser
    Ali, Aamir
    Chetty, Girija
    BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES, BRAINLES 2021, PT II, 2022, 12963 : 30 - 41
  • [6] MTPA_Unet: Multi-Scale Transformer-Position Attention Retinal Vessel Segmentation Network Joint Transformer and CNN
    Jiang, Yun
    Liang, Jing
    Cheng, Tongtong
    Lin, Xin
    Zhang, Yuan
    Dong, Jinkun
    SENSORS, 2022, 22 (12)
  • [7] MS-UNet: Multi-Scale Nested UNet for Medical Image Segmentation with Few Training Data Based on an ELoss and Adaptive Denoising Method
    Chen, Haoyuan
    Han, Yufei
    Yao, Linwei
    Wu, Xin
    Li, Kuan
    Yin, Jianping
    MATHEMATICS, 2024, 12 (19)
  • [8] Automatic Polyp Segmentation via Multi-scale Subtraction Network
    Zhao, Xiaoqi
    Zhang, Lihe
    Lu, Huchuan
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I, 2021, 12901 : 120 - 130
  • [9] Attention based multi-scale parallel network for polyp segmentation
    Song, Pengfei
    Li, Jinjiang
    Fan, Hui
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 146
  • [10] CrossFormer: Multi-scale cross-attention for polyp segmentation
    Chen, Lifang
    Ge, Hongze
    Li, Jiawei
    IET IMAGE PROCESSING, 2023, 17 (12) : 3441 - 3452