Polyp-LVT: Polyp segmentation with lightweight vision transformers

被引:6
|
作者
Lin, Long [1 ]
Lv, Guangzu [1 ]
Wang, Bin [2 ]
Xu, Cunlu [1 ]
Liu, Jun [3 ]
机构
[1] Lanzhou Univ, Sch Informat Sci & Engn, Lanzhou, Gansu, Peoples R China
[2] Nanjing Univ Finance & Econ, Sch Informat Engn, Nanjing, Peoples R China
[3] Ulster Univ, Sch Comp, Belfast, North Ireland
关键词
Polyp segmentation; Lightweight vision transformer; Pooling layer; Colorectal cancer;
D O I
10.1016/j.knosys.2024.112181
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic segmentation of polyps in endoscopic images is crucial for early diagnosis and surgical planning of colorectal cancer. However, polyps closely resemble surrounding mucosal tissue in both texture and indistinct borders and vary in size, appearance, and location which possess great challenge to polyp segmentation. Although some recent attempts have been made to apply Vision Transformer (ViT) to polyp segmentation and achieved promising performance, their application in clinical scenarios is still limited by high computational complexity, large model size, redundant dependencies, and significant training costs. To address these limitations, we propose a novel ViT-based approach named Polyp-LVT, strategically replacing the attention layer in the encoder with a global max pooling layer, which significantly reduces the model's parameter count and computational cost while keeping the performance undegraded. Furthermore, we introduce a network block, named Inter-block Feature Fusion Module (IFFM), into the decoder, aiming to offer a streamlined yet highly efficient feature extraction. We conduct extensive experiments on three public polyp image benchmarks to evaluate our method. The experimental results show that compared with the baseline models, our PolypLVT network achieves a nearly 44% reduction in model parameters while gaining comparable segmentation performance.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] RANet: A receptive aggregation network for polyp segmentation
    Ma, Dehua
    Zhu, Xiaoliang
    Li, Yanxiang
    Meng, Wenzhe
    Xu, Siping
    MULTIMEDIA SYSTEMS, 2025, 31 (01)
  • [42] Revisiting Feature Propagation and Aggregation in Polyp Segmentation
    Su, Yanzhou
    Shen, Yiqing
    Ye, Jin
    He, Junjun
    Cheng, Jian
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT V, 2023, 14224 : 632 - 641
  • [43] Video Polyp Segmentation: A Deep Learning Perspective
    Ge-Peng Ji
    Guobao Xiao
    Yu-Cheng Chou
    Deng-Ping Fan
    Kai Zhao
    Geng Chen
    Luc Van Gool
    Machine Intelligence Research, 2022, 19 : 531 - 549
  • [44] ColonNeXt: Fully Convolutional Attention for Polyp Segmentation
    Nguyen, Dinh Cong
    Nguyen, Hoang Long
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024,
  • [45] MMNet: A Mixing Module Network for Polyp Segmentation
    Ghimire, Raman
    Lee, Sang-Woong
    SENSORS, 2023, 23 (16)
  • [46] HSNet: A hybrid semantic network for polyp segmentation
    Zhang, Wenchao
    Fu, Chong
    Zheng, Yu
    Zhang, Fangyuan
    Zhao, Yanli
    Sham, Chiu-Wing
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 150
  • [47] Estimating Predictive Uncertainty in Gastrointestinal Polyp Segmentation
    Jacobsen, Felicia Ly
    Hicks, Steven A.
    Halvorsen, Pal
    Riegler, Michael A.
    2022 IEEE 35TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2022, : 44 - 49
  • [48] SPPNet: Strip Pooling Architecture for Polyp Segmentation
    Lan Rong
    Sun Yuhao
    Guo Di
    Zhao Feng
    Yu Haiyan
    Zhang Lu
    2024 6TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING, ICNLP 2024, 2024, : 214 - 219
  • [49] Video Polyp Segmentation: A Deep Learning Perspective
    Ji, Ge-Peng
    Xiao, Guobao
    Chou, Yu-Cheng
    Fan, Deng-Ping
    Zhao, Kai
    Chen, Geng
    Van Gool, Luc
    MACHINE INTELLIGENCE RESEARCH, 2022, 19 (06) : 531 - 549
  • [50] Video Polyp Segmentation:A Deep Learning Perspective
    Ge-Peng Ji
    Guobao Xiao
    Yu-Cheng Chou
    Deng-Ping Fan
    Kai Zhao
    Geng Chen
    Luc Van Gool
    Machine Intelligence Research, 2022, 19 (06) : 531 - 549