Polyp-LVT: Polyp segmentation with lightweight vision transformers

被引：6

作者：

Lin, Long ^{[1
]}

Lv, Guangzu ^{[1
]}

Wang, Bin ^{[2
]}

Xu, Cunlu ^{[1
]}

Liu, Jun ^{[3
]}

机构：

[1] Lanzhou Univ, Sch Informat Sci & Engn, Lanzhou, Gansu, Peoples R China

[2] Nanjing Univ Finance & Econ, Sch Informat Engn, Nanjing, Peoples R China

[3] Ulster Univ, Sch Comp, Belfast, North Ireland

来源：

KNOWLEDGE-BASED SYSTEMS | 2024年 / 300卷

关键词：

Polyp segmentation; Lightweight vision transformer; Pooling layer; Colorectal cancer;

D O I：

10.1016/j.knosys.2024.112181

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Automatic segmentation of polyps in endoscopic images is crucial for early diagnosis and surgical planning of colorectal cancer. However, polyps closely resemble surrounding mucosal tissue in both texture and indistinct borders and vary in size, appearance, and location which possess great challenge to polyp segmentation. Although some recent attempts have been made to apply Vision Transformer (ViT) to polyp segmentation and achieved promising performance, their application in clinical scenarios is still limited by high computational complexity, large model size, redundant dependencies, and significant training costs. To address these limitations, we propose a novel ViT-based approach named Polyp-LVT, strategically replacing the attention layer in the encoder with a global max pooling layer, which significantly reduces the model's parameter count and computational cost while keeping the performance undegraded. Furthermore, we introduce a network block, named Inter-block Feature Fusion Module (IFFM), into the decoder, aiming to offer a streamlined yet highly efficient feature extraction. We conduct extensive experiments on three public polyp image benchmarks to evaluate our method. The experimental results show that compared with the baseline models, our PolypLVT network achieves a nearly 44% reduction in model parameters while gaining comparable segmentation performance.

引用

页数：10

共 50 条

[1] Polyp2Seg: Improved Polyp Segmentation with Vision Transformer
Mandujano-Cornejo, Vittorino
Montoya-Zegarra, Javier A.
MEDICAL IMAGE UNDERSTANDING AND ANALYSIS, MIUA 2022, 2022, 13413 : 519 - 534
[2] Ensembles of Convolutional Neural Networks and Transformers for Polyp Segmentation
Nanni, Loris
Fantozzi, Carlo
Loreggia, Andrea
Lumini, Alessandra
SENSORS, 2023, 23 (10)
[3] PVT-MA: pyramid vision transformers with multi-attention fusion mechanism for polyp segmentation
Shang, Xiao
Wu, Siqi
Liu, Yuhao
Zhao, Zhenfeng
Wang, Shenwen
APPLIED INTELLIGENCE, 2025, 55 (01)
[4] Attention combined pyramid vision transformer for polyp segmentation
Liu, Xiaogang
Song, Shuang
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89
[5] EPSegNet: Lightweight Semantic Recalibration and Assembly for Efficient Polyp Segmentation
Wu, Huisi
Zhao, Zebin
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025,
[6] Polyp-Mamba: Polyp Segmentation with Visual Mamba
Xu, Zhongxing
Tang, Feilong
Chen, Zhe
Zhou, Zheng
Wu, Weishan
Yang, Yuyao
Liang, Yu
Jiang, Jiyu
Cai, Xuyue
Su, Jionglong
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VIII, 2024, 15008 : 510 - 521
[7] Polyp-SAM: Transfer SAM for Polyp Segmentation
Li, Yuheng
Hu, Mingzhe
Yang, Xiaofeng
COMPUTER-AIDED DIAGNOSIS, MEDICAL IMAGING 2024, 2024, 12927
[8] Semantic Polyp Generation for Improving Polyp Segmentation Performance
Song, Hun
Shin, Younghak
JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING, 2024, 44 (02) : 280 - 292
[9] Meta-Polyp: a baseline for efficient Polyp segmentation
Trinh, Quoc-Huy
2023 IEEE 36TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS, 2023, : 742 - 747
[10] Polyp segmentation network based on lightweight model and reverse attention mechanisms
Long, Jianwu
Yang, Chengxin
Song, Xinlei
Zeng, Ziqin
Ren, Yan
INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (03)

← 1 2 3 4 5 →