PVT2DNet: Polyp segmentation with vision transformer and dual decoder refinement strategy

被引：0

作者：

Hu, Yibiao ^{[1
]}

Jin, Yan ^{[1
]}

Jiang, Zhiwei ^{[1
]}

Zheng, Qiufu ^{[1
]}

机构：

[1] Zhejiang Univ Technol, Coll Informat Engn, Hangzhou 310023, Zhejiang, Peoples R China

来源：

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION | 2024年 / 104卷

关键词：

Polyp image segmentation; Context enhancement; Dual decoder refinement;

D O I：

10.1016/j.jvcir.2024.104304

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Colorectal carcinoma is a prevalent malignancy worldwide. Accurate polyp segmentation, along with endoscopic resection, can significantly reduce its incidence and mortality. Most polyp segmentation neural networks are CNN-based and single decoder strategy architectures, which learn limited robust representations. In this paper, we propose a novel network with the vision transformer and dual decoder refinement strategy called PVT2DNet to overcome some limitations of current networks and achieve more precise automated polyp segmentation. The PVT2DNet adopts a pyramid vision transformer encoder and enhances the multi-level features with the contextenhanced module (CEM). Moreover, instead of directly feeding features into a single decoder, we introduce a dual partial cascaded decoder refinement strategy to excavate more informative polyp cues. Extensive experimentations on five widely adopted datasets demonstrate the proposed network outperforms other state-of-the-art on most metrics.

引用

页数：11

共 50 条

[1] Polyp2Seg: Improved Polyp Segmentation with Vision Transformer
Mandujano-Cornejo, Vittorino
Montoya-Zegarra, Javier A.
MEDICAL IMAGE UNDERSTANDING AND ANALYSIS, MIUA 2022, 2022, 13413 : 519 - 534
[2] Rethinking encoder-decoder architecture using vision transformer for colorectal polyp and surgical instruments segmentation
Iqbal, Ahmed
Ahmed, Zohair
Usman, Muhammad
Malik, Isra
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 136
[3] Attention combined pyramid vision transformer for polyp segmentation
Liu, Xiaogang
Song, Shuang
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89
[4] PolySegNet: improving polyp segmentation through swin transformer and vision transformer fusion
Lijin, P.
Ullah, Mohib
Vats, Anuja
Cheikh, Faouzi Alaya
Kumar, G. Santhosh
Nair, Madhu S.
BIOMEDICAL ENGINEERING LETTERS, 2024, 14 (06) : 1421 - 1431
[5] Dual Encoder Decoder Shifted Window-Based Transformer Network for Polyp Segmentation with Self-Learning Approach
P. L.
Ullah M.
Vats A.
Cheikh F.A.
Kumar G. S.
Nair M.S.
IEEE Transactions on Artificial Intelligence, 2024, 5 (07): : 1 - 14
[6] Probabilistic Modeling Ensemble Vision Transformer Improves Complex Polyp Segmentation
Ling, Tianyi
Wu, Chengyi
Yu, Huan
Cai, Tian
Wang, Da
Zhou, Yincong
Chen, Ming
Ding, Kefeng
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VII, 2023, 14226 : 572 - 581
[7] Colorectal Polyp Segmentation Combining Pyramid Vision Transformer and Axial Attention
Zhou, Xue
Bai, Zhengyao
Lu, Qianjie
Fan, Shenglan
Computer Engineering and Applications, 2023, 59 (11) : 222 - 230
[8] Polyp Segmentation Using a Hybrid Vision Transformer and a Hybrid Loss Function
Goceri, Evgin
JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024, 37 (02): : 851 - 863
[9] Improved dual-aggregation polyp segmentation network combining a pyramid vision transformer with a fully convolutional network
Li, Feng
Huang, Zetao
Zhou, Lu
Chen, Yuyang
Tang, Shiqing
Ding, Pengchao
Peng, Haixia
Chu, Yimin
BIOMEDICAL OPTICS EXPRESS, 2024, 15 (04): : 2590 - 2621
[10] DUSFormer: Dual-Swin Transformer V2 Aggregate Network for Polyp Segmentation
Xia, Zhangrun
Chen, Jingliang
Lu, Chengzhun
IEEE ACCESS, 2024, 12 : 8822 - 8832

← 1 2 3 4 5 →