Polyp-LVT: Polyp segmentation with lightweight vision transformers

被引:6
|
作者
Lin, Long [1 ]
Lv, Guangzu [1 ]
Wang, Bin [2 ]
Xu, Cunlu [1 ]
Liu, Jun [3 ]
机构
[1] Lanzhou Univ, Sch Informat Sci & Engn, Lanzhou, Gansu, Peoples R China
[2] Nanjing Univ Finance & Econ, Sch Informat Engn, Nanjing, Peoples R China
[3] Ulster Univ, Sch Comp, Belfast, North Ireland
关键词
Polyp segmentation; Lightweight vision transformer; Pooling layer; Colorectal cancer;
D O I
10.1016/j.knosys.2024.112181
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic segmentation of polyps in endoscopic images is crucial for early diagnosis and surgical planning of colorectal cancer. However, polyps closely resemble surrounding mucosal tissue in both texture and indistinct borders and vary in size, appearance, and location which possess great challenge to polyp segmentation. Although some recent attempts have been made to apply Vision Transformer (ViT) to polyp segmentation and achieved promising performance, their application in clinical scenarios is still limited by high computational complexity, large model size, redundant dependencies, and significant training costs. To address these limitations, we propose a novel ViT-based approach named Polyp-LVT, strategically replacing the attention layer in the encoder with a global max pooling layer, which significantly reduces the model's parameter count and computational cost while keeping the performance undegraded. Furthermore, we introduce a network block, named Inter-block Feature Fusion Module (IFFM), into the decoder, aiming to offer a streamlined yet highly efficient feature extraction. We conduct extensive experiments on three public polyp image benchmarks to evaluate our method. The experimental results show that compared with the baseline models, our PolypLVT network achieves a nearly 44% reduction in model parameters while gaining comparable segmentation performance.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Polyp2Seg: Improved Polyp Segmentation with Vision Transformer
    Mandujano-Cornejo, Vittorino
    Montoya-Zegarra, Javier A.
    MEDICAL IMAGE UNDERSTANDING AND ANALYSIS, MIUA 2022, 2022, 13413 : 519 - 534
  • [2] Ensembles of Convolutional Neural Networks and Transformers for Polyp Segmentation
    Nanni, Loris
    Fantozzi, Carlo
    Loreggia, Andrea
    Lumini, Alessandra
    SENSORS, 2023, 23 (10)
  • [3] PVT-MA: pyramid vision transformers with multi-attention fusion mechanism for polyp segmentation
    Shang, Xiao
    Wu, Siqi
    Liu, Yuhao
    Zhao, Zhenfeng
    Wang, Shenwen
    APPLIED INTELLIGENCE, 2025, 55 (01)
  • [4] Attention combined pyramid vision transformer for polyp segmentation
    Liu, Xiaogang
    Song, Shuang
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89
  • [5] EPSegNet: Lightweight Semantic Recalibration and Assembly for Efficient Polyp Segmentation
    Wu, Huisi
    Zhao, Zebin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025,
  • [6] Polyp-Mamba: Polyp Segmentation with Visual Mamba
    Xu, Zhongxing
    Tang, Feilong
    Chen, Zhe
    Zhou, Zheng
    Wu, Weishan
    Yang, Yuyao
    Liang, Yu
    Jiang, Jiyu
    Cai, Xuyue
    Su, Jionglong
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VIII, 2024, 15008 : 510 - 521
  • [7] Polyp-SAM: Transfer SAM for Polyp Segmentation
    Li, Yuheng
    Hu, Mingzhe
    Yang, Xiaofeng
    COMPUTER-AIDED DIAGNOSIS, MEDICAL IMAGING 2024, 2024, 12927
  • [8] Semantic Polyp Generation for Improving Polyp Segmentation Performance
    Song, Hun
    Shin, Younghak
    JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING, 2024, 44 (02) : 280 - 292
  • [9] Meta-Polyp: a baseline for efficient Polyp segmentation
    Trinh, Quoc-Huy
    2023 IEEE 36TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS, 2023, : 742 - 747
  • [10] Polyp segmentation network based on lightweight model and reverse attention mechanisms
    Long, Jianwu
    Yang, Chengxin
    Song, Xinlei
    Zeng, Ziqin
    Ren, Yan
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (03)