Polyp-LVT: Polyp segmentation with lightweight vision transformers

被引：6

作者：

Lin, Long ^{[1
]}

Lv, Guangzu ^{[1
]}

Wang, Bin ^{[2
]}

Xu, Cunlu ^{[1
]}

Liu, Jun ^{[3
]}

机构：

[1] Lanzhou Univ, Sch Informat Sci & Engn, Lanzhou, Gansu, Peoples R China

[2] Nanjing Univ Finance & Econ, Sch Informat Engn, Nanjing, Peoples R China

[3] Ulster Univ, Sch Comp, Belfast, North Ireland

来源：

KNOWLEDGE-BASED SYSTEMS | 2024年 / 300卷

关键词：

Polyp segmentation; Lightweight vision transformer; Pooling layer; Colorectal cancer;

D O I：

10.1016/j.knosys.2024.112181

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Automatic segmentation of polyps in endoscopic images is crucial for early diagnosis and surgical planning of colorectal cancer. However, polyps closely resemble surrounding mucosal tissue in both texture and indistinct borders and vary in size, appearance, and location which possess great challenge to polyp segmentation. Although some recent attempts have been made to apply Vision Transformer (ViT) to polyp segmentation and achieved promising performance, their application in clinical scenarios is still limited by high computational complexity, large model size, redundant dependencies, and significant training costs. To address these limitations, we propose a novel ViT-based approach named Polyp-LVT, strategically replacing the attention layer in the encoder with a global max pooling layer, which significantly reduces the model's parameter count and computational cost while keeping the performance undegraded. Furthermore, we introduce a network block, named Inter-block Feature Fusion Module (IFFM), into the decoder, aiming to offer a streamlined yet highly efficient feature extraction. We conduct extensive experiments on three public polyp image benchmarks to evaluate our method. The experimental results show that compared with the baseline models, our PolypLVT network achieves a nearly 44% reduction in model parameters while gaining comparable segmentation performance.

引用

页数：10

共 50 条

[41] RANet: A receptive aggregation network for polyp segmentation
Ma, Dehua
Zhu, Xiaoliang
Li, Yanxiang
Meng, Wenzhe
Xu, Siping
MULTIMEDIA SYSTEMS, 2025, 31 (01)
[42] Revisiting Feature Propagation and Aggregation in Polyp Segmentation
Su, Yanzhou
Shen, Yiqing
Ye, Jin
He, Junjun
Cheng, Jian
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT V, 2023, 14224 : 632 - 641
[43] Video Polyp Segmentation: A Deep Learning Perspective
Ge-Peng Ji
Guobao Xiao
Yu-Cheng Chou
Deng-Ping Fan
Kai Zhao
Geng Chen
Luc Van Gool
Machine Intelligence Research, 2022, 19 : 531 - 549
[44] ColonNeXt: Fully Convolutional Attention for Polyp Segmentation
Nguyen, Dinh Cong
Nguyen, Hoang Long
JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024,
[45] MMNet: A Mixing Module Network for Polyp Segmentation
Ghimire, Raman
Lee, Sang-Woong
SENSORS, 2023, 23 (16)
[46] HSNet: A hybrid semantic network for polyp segmentation
Zhang, Wenchao
Fu, Chong
Zheng, Yu
Zhang, Fangyuan
Zhao, Yanli
Sham, Chiu-Wing
COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 150
[47] Estimating Predictive Uncertainty in Gastrointestinal Polyp Segmentation
Jacobsen, Felicia Ly
Hicks, Steven A.
Halvorsen, Pal
Riegler, Michael A.
2022 IEEE 35TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2022, : 44 - 49
[48] SPPNet: Strip Pooling Architecture for Polyp Segmentation
Lan Rong
Sun Yuhao
Guo Di
Zhao Feng
Yu Haiyan
Zhang Lu
2024 6TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING, ICNLP 2024, 2024, : 214 - 219
[49] Video Polyp Segmentation: A Deep Learning Perspective
Ji, Ge-Peng
Xiao, Guobao
Chou, Yu-Cheng
Fan, Deng-Ping
Zhao, Kai
Chen, Geng
Van Gool, Luc
MACHINE INTELLIGENCE RESEARCH, 2022, 19 (06) : 531 - 549
[50] Video Polyp Segmentation:A Deep Learning Perspective
Ge-Peng Ji
Guobao Xiao
Yu-Cheng Chou
Deng-Ping Fan
Kai Zhao
Geng Chen
Luc Van Gool
Machine Intelligence Research, 2022, 19 (06) : 531 - 549

← 1 2 3 4 5 →