PVT2DNet: Polyp segmentation with vision transformer and dual decoder refinement strategy

被引:0
|
作者
Hu, Yibiao [1 ]
Jin, Yan [1 ]
Jiang, Zhiwei [1 ]
Zheng, Qiufu [1 ]
机构
[1] Zhejiang Univ Technol, Coll Informat Engn, Hangzhou 310023, Zhejiang, Peoples R China
关键词
Polyp image segmentation; Context enhancement; Dual decoder refinement;
D O I
10.1016/j.jvcir.2024.104304
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Colorectal carcinoma is a prevalent malignancy worldwide. Accurate polyp segmentation, along with endoscopic resection, can significantly reduce its incidence and mortality. Most polyp segmentation neural networks are CNN-based and single decoder strategy architectures, which learn limited robust representations. In this paper, we propose a novel network with the vision transformer and dual decoder refinement strategy called PVT2DNet to overcome some limitations of current networks and achieve more precise automated polyp segmentation. The PVT2DNet adopts a pyramid vision transformer encoder and enhances the multi-level features with the contextenhanced module (CEM). Moreover, instead of directly feeding features into a single decoder, we introduce a dual partial cascaded decoder refinement strategy to excavate more informative polyp cues. Extensive experimentations on five widely adopted datasets demonstrate the proposed network outperforms other state-of-the-art on most metrics.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Polyp2Seg: Improved Polyp Segmentation with Vision Transformer
    Mandujano-Cornejo, Vittorino
    Montoya-Zegarra, Javier A.
    MEDICAL IMAGE UNDERSTANDING AND ANALYSIS, MIUA 2022, 2022, 13413 : 519 - 534
  • [2] Rethinking encoder-decoder architecture using vision transformer for colorectal polyp and surgical instruments segmentation
    Iqbal, Ahmed
    Ahmed, Zohair
    Usman, Muhammad
    Malik, Isra
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 136
  • [3] Attention combined pyramid vision transformer for polyp segmentation
    Liu, Xiaogang
    Song, Shuang
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89
  • [4] PolySegNet: improving polyp segmentation through swin transformer and vision transformer fusion
    Lijin, P.
    Ullah, Mohib
    Vats, Anuja
    Cheikh, Faouzi Alaya
    Kumar, G. Santhosh
    Nair, Madhu S.
    BIOMEDICAL ENGINEERING LETTERS, 2024, 14 (06) : 1421 - 1431
  • [5] Dual Encoder Decoder Shifted Window-Based Transformer Network for Polyp Segmentation with Self-Learning Approach
    P. L.
    Ullah M.
    Vats A.
    Cheikh F.A.
    Kumar G. S.
    Nair M.S.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (07): : 1 - 14
  • [6] Probabilistic Modeling Ensemble Vision Transformer Improves Complex Polyp Segmentation
    Ling, Tianyi
    Wu, Chengyi
    Yu, Huan
    Cai, Tian
    Wang, Da
    Zhou, Yincong
    Chen, Ming
    Ding, Kefeng
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VII, 2023, 14226 : 572 - 581
  • [7] Colorectal Polyp Segmentation Combining Pyramid Vision Transformer and Axial Attention
    Zhou, Xue
    Bai, Zhengyao
    Lu, Qianjie
    Fan, Shenglan
    Computer Engineering and Applications, 2023, 59 (11) : 222 - 230
  • [8] Polyp Segmentation Using a Hybrid Vision Transformer and a Hybrid Loss Function
    Goceri, Evgin
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024, 37 (02): : 851 - 863
  • [9] Improved dual-aggregation polyp segmentation network combining a pyramid vision transformer with a fully convolutional network
    Li, Feng
    Huang, Zetao
    Zhou, Lu
    Chen, Yuyang
    Tang, Shiqing
    Ding, Pengchao
    Peng, Haixia
    Chu, Yimin
    BIOMEDICAL OPTICS EXPRESS, 2024, 15 (04): : 2590 - 2621
  • [10] DUSFormer: Dual-Swin Transformer V2 Aggregate Network for Polyp Segmentation
    Xia, Zhangrun
    Chen, Jingliang
    Lu, Chengzhun
    IEEE ACCESS, 2024, 12 : 8822 - 8832