PSTNet: Enhanced Polyp Segmentation With Multi-Scale Alignment and Frequency Domain Integration

被引:3
|
作者
Xu, Wenhao [1 ]
Xu, Rongtao [2 ,3 ]
Wang, Changwei [4 ,5 ,6 ]
Li, Xiuli [7 ]
Xu, Shibiao [1 ]
Guo, Li [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
[2] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing, Peoples R China
[3] Mohamed Bin Zayed Univ Artificial Intelligence, Abu Dhabi, U Arab Emirates
[4] Qilu Univ Technol, Shandong Acad Sci, Key Lab Comp Power Network & Informat Secur, Minist Educ,Shandong Comp Sci Ctr,Natl Supercomp C, Jinan 250013, Peoples R China
[5] Shandong Fundamental Res Ctr Comp Sci, Shandong Prov Key Lab Comp Networks, Jinan 250013, Peoples R China
[6] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing 100876, Peoples R China
[7] Deepwise Healthcare, AI Lab, Beijing 100080, Peoples R China
基金
中国国家自然科学基金;
关键词
Image segmentation; Feature extraction; Transformers; Accuracy; Frequency-domain analysis; Location awareness; Colonoscopy; Polyp segmentation; shunted transformer; multi-scale fusion; VALIDATION;
D O I
10.1109/JBHI.2024.3421550
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accurate segmentation of colorectal polyps in colonoscopy images is crucial for effective diagnosis and management of colorectal cancer (CRC). However, current deep learning-based methods primarily rely on fusing RGB information across multiple scales, leading to limitations in accurately identifying polyps due to restricted RGB domain information and challenges in feature misalignment during multi-scale aggregation. To address these limitations, we propose the Polyp Segmentation Network with Shunted Transformer (PSTNet), a novel approach that integrates both RGB and frequency domain cues present in the images. PSTNet comprises three key modules: the Frequency Characterization Attention Module (FCAM) for extracting frequency cues and capturing polyp characteristics, the Feature Supplementary Alignment Module (FSAM) for aligning semantic information and reducing misalignment noise, and the Cross Perception localization Module (CPM) for synergizing frequency cues with high-level semantics to achieve efficient polyp segmentation. Extensive experiments on challenging datasets demonstrate PSTNet's significant improvement in polyp segmentation accuracy across various metrics, consistently outperforming state-of-the-art methods. The integration of frequency domain cues and the novel architectural design of PSTNet contribute to advancing computer-assisted polyp segmentation, facilitating more accurate diagnosis and management of CRC.
引用
收藏
页码:6042 / 6053
页数:12
相关论文
共 50 条
  • [21] Multi-scale frequency domain learning for texture classification
    Zang, Liguang
    Li, Yuancheng
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025, 16 (02) : 947 - 958
  • [22] Multi-scale convolutional attention frequency-enhanced transformer network for medical image segmentation
    Yan, Shun
    Yang, Benquan
    Chen, Aihua
    Zhao, Xiaoming
    Zhang, Shiqing
    INFORMATION FUSION, 2025, 119
  • [23] PASSIVE MULTI-SCALE ALIGNMENT
    Kazmer, David O.
    Johnston, Stephen P.
    Moriarty, Mary E.
    Santeufemio, Christopher
    PROCEEDINGS OF THE ASME INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, 2011, VOL 11, 2012, : 747 - 754
  • [24] Multi-scale cross-domain alignment for person image generation
    Ma, Liyuan
    Gao, Tingwei
    Shen, Haibin
    Huang, Kejie
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2024, 9 (02) : 374 - 387
  • [25] CoAM-Net: coordinate asymmetric multi-scale fusion strategy for polyp segmentation
    Wang, Yuanyuan
    Yuan, Weimin
    Bai, Xiangzhi
    APPLIED INTELLIGENCE, 2023, 53 (24) : 30626 - 30641
  • [26] CoAM-Net: coordinate asymmetric multi-scale fusion strategy for polyp segmentation
    Yuanyuan Wang
    Weimin Yuan
    Xiangzhi Bai
    Applied Intelligence, 2023, 53 : 30626 - 30641
  • [27] Multi-scale polyp segmentation network employing cascaded strategy to fuse boundary features
    Yi, Jianbing
    Wan, Jianhui
    Cao, Feng
    Li, Jun
    Chen, Xin
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2024, 32 (18): : 2846 - 2860
  • [28] Dataset-level color augmentation and multi-scale exploration methods for polyp segmentation
    Chen, Haipeng
    Ju, Honghong
    Qin, Jun
    Song, Jincai
    Lyu, Yingda
    Liu, Xianzhu
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 260
  • [29] Multi-scale watershed segmentation
    Olsen, OF
    GAUSSIAN SCALE-SPACE THEORY, 1997, 8 : 191 - 200
  • [30] Multi-scale contour segmentation
    Mokhtarian, F
    SCALE-SPACE THEORY IN COMPUTER VISION, 1997, 1252 : 296 - 307