PSTNet: Enhanced Polyp Segmentation With Multi-Scale Alignment and Frequency Domain Integration

被引:3
|
作者
Xu, Wenhao [1 ]
Xu, Rongtao [2 ,3 ]
Wang, Changwei [4 ,5 ,6 ]
Li, Xiuli [7 ]
Xu, Shibiao [1 ]
Guo, Li [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
[2] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing, Peoples R China
[3] Mohamed Bin Zayed Univ Artificial Intelligence, Abu Dhabi, U Arab Emirates
[4] Qilu Univ Technol, Shandong Acad Sci, Key Lab Comp Power Network & Informat Secur, Minist Educ,Shandong Comp Sci Ctr,Natl Supercomp C, Jinan 250013, Peoples R China
[5] Shandong Fundamental Res Ctr Comp Sci, Shandong Prov Key Lab Comp Networks, Jinan 250013, Peoples R China
[6] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing 100876, Peoples R China
[7] Deepwise Healthcare, AI Lab, Beijing 100080, Peoples R China
基金
中国国家自然科学基金;
关键词
Image segmentation; Feature extraction; Transformers; Accuracy; Frequency-domain analysis; Location awareness; Colonoscopy; Polyp segmentation; shunted transformer; multi-scale fusion; VALIDATION;
D O I
10.1109/JBHI.2024.3421550
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accurate segmentation of colorectal polyps in colonoscopy images is crucial for effective diagnosis and management of colorectal cancer (CRC). However, current deep learning-based methods primarily rely on fusing RGB information across multiple scales, leading to limitations in accurately identifying polyps due to restricted RGB domain information and challenges in feature misalignment during multi-scale aggregation. To address these limitations, we propose the Polyp Segmentation Network with Shunted Transformer (PSTNet), a novel approach that integrates both RGB and frequency domain cues present in the images. PSTNet comprises three key modules: the Frequency Characterization Attention Module (FCAM) for extracting frequency cues and capturing polyp characteristics, the Feature Supplementary Alignment Module (FSAM) for aligning semantic information and reducing misalignment noise, and the Cross Perception localization Module (CPM) for synergizing frequency cues with high-level semantics to achieve efficient polyp segmentation. Extensive experiments on challenging datasets demonstrate PSTNet's significant improvement in polyp segmentation accuracy across various metrics, consistently outperforming state-of-the-art methods. The integration of frequency domain cues and the novel architectural design of PSTNet contribute to advancing computer-assisted polyp segmentation, facilitating more accurate diagnosis and management of CRC.
引用
收藏
页码:6042 / 6053
页数:12
相关论文
共 50 条
  • [11] DeepNeXt: a lightweight polyp segmentation algorithm based on multi-scale attention
    Wang, Chuantao
    Wang, Saishuo
    Shao, Shuo
    Zhai, Jiliang
    QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2024, 14 (12) : 8551 - 8567
  • [12] Polyp Image Segmentation Based on Multi-Scale Edge Perception and Enhancement
    Yang, Ruijun
    Chen, Liye
    Cheng, Yan
    Computer Engineering and Applications, 2025, 61 (01) : 272 - 281
  • [13] Advancing Rice Grain Impurity Segmentation with an Enhanced SegFormer and Multi-Scale Feature Integration
    Qiu, Xiulin
    Yao, Hongzhi
    Liu, Qinghua
    Liu, Hongrui
    Zhang, Haozhi
    Zhao, Mengdi
    ENTROPY, 2025, 27 (01)
  • [14] MFEFNet: Multi-scale feature enhancement and Fusion Network for polyp segmentation
    Xia, Yang
    Yun, Haijiao
    Liu, Yanjun
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 157
  • [15] MSEANet: Multi-Scale Selective Edge Aware Network for Polyp Segmentation
    Liu, Botao
    Shi, Changqi
    Zhao, Ming
    ALGORITHMS, 2025, 18 (01)
  • [16] MFBGR: Multi-scale feature boundary graph reasoning network for polyp segmentation
    Liu, Fangjin
    Hua, Zhen
    Li, Jinjiang
    Fan, Linwei
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [17] ECTransNet: An Automatic Polyp Segmentation Network Based on Multi-scale Edge Complementary
    Weikang Liu
    Zhigang Li
    Chunyang Li
    Hongyan Gao
    Journal of Digital Imaging, 2023, 36 : 2427 - 2440
  • [18] ECTransNet: An Automatic Polyp Segmentation Network Based on Multi-scale Edge Complementary
    Liu, Weikang
    Li, Zhigang
    Li, Chunyang
    Gao, Hongyan
    JOURNAL OF DIGITAL IMAGING, 2023, 36 (06) : 2427 - 2440
  • [19] Multi-scale information sharing and selection network with boundary attention for polyp segmentation
    Kang, Xiaolu
    Ma, Zhuoqi
    Liu, Kang
    Li, Yunan
    Miao, Qiguang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 139
  • [20] Modality-agnostic Domain Generalizable Medical Image Segmentation by Multi-Frequency in Multi-Scale Attention
    Nam, Ju-Hyeon
    Lee, Sang-Chul
    Syazwanyl, Nur Suriza
    Kim, Su Jung
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 11480 - 11491