PCT: Pyramid convolutional transformer for parotid gland tumor segmentation in ultrasound images

被引:7
|
作者
Zhang, Gang [1 ,2 ]
Zheng, Chenhong [3 ]
He, Jianfeng [1 ,2 ]
Yi, Sanli [1 ,2 ]
机构
[1] Kunming Univ Sci & Technol, Sch Informat Engn & Automat, Kunming 650500, Yunnan, Peoples R China
[2] Kunming Univ Sci & Technol, Key Lab Comp Technol Applicat Yunnan Prov, Kunming 650500, Yunnan, Peoples R China
[3] Kunming Med Univ, Dept Ultrasound, Affiliated Hosp 3, Kunming 650118, Yunnan, Peoples R China
基金
中国国家自然科学基金;
关键词
Transformer; Attention mechanism; Parotid gland tumor; Medical image segmentation; Dense pixel prediction; Convolutional neural network; ARCHITECTURE;
D O I
10.1016/j.bspc.2022.104498
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Preoperative segmentation of parotid gland tumor regions using deep learning is of great significance for treatment decisions. However, there are still two major limitations: to the best of our knowledge, no networks are designed specifically for parotid gland tumor segmentation tasks; and neither convolutional neural network (CNN) nor Transformer can extract both global and local feature solely. To address these issues, we first propose a Pyramid Convolutional Transformer (PCT) architecture based on the shrinking pyramid framework and Fusion Attention Transformer CNN (FTC) block for parotid gland tumors segmentation. In this architecture, the shrinking pyramid framework can effectively capture parotid gland tumor image features with dense pixel by integrating multi-scale dependencies of images. And the FTC block is constructed to address complex and variable contour characteristics of parotid gland tumors, which combines Transformer with CNN for preferable extracting global and local features of images by dual branch structure. The experimental results suggest that proposed PCT achieved intersection-over-union (IoU) of 0.8434 and Dice similarity coefficient (Dice) of 0.9151 on parotid gland tumor segmentation (PGTSeg) dataset, and attained new state-of-the-art performance on multiple challenging benchmarks with IoU of 0.8521 on MoNuSeg and IoU of 0.9080 on ISIC 2018. Meanwhile, common backbones equipped with FTC block outperformed the baseline model. The code and models will be available at: https://github.com/Twoverz/PCT-Pyramid-Convolutional-Transformer.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Cascaded Fully Convolutional DenseNet for Automatic Kidney Segmentation in Ultrasound Images
    Wu, Zhiwei
    Hai, Jinjin
    Zhang, Lijie
    Chen, Jian
    Cheng, Genyang
    Yan, Bin
    2019 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2019), 2019, : 384 - 388
  • [22] Decoupled pyramid correlation network for liver tumor segmentation from CT images
    Zhang, Yao
    Yang, Jiawei
    Liu, Yang
    Tian, Jiang
    Wang, Siyun
    Zhong, Cheng
    Shi, Zhongchao
    Zhang, Yang
    He, Zhiqiang
    MEDICAL PHYSICS, 2022, 49 (11) : 7207 - 7221
  • [23] DFP-ResUNet:Convolutional Neural Network with a Dilated Convolutional Feature Pyramid for Multimodal Brain Tumor Segmentation
    Wang, Jingjing
    Gao, Jun
    Ren, Jinwen
    Luan, Zhenye
    Yu, Zishu
    Zhao, Yanhua
    Zhao, Yuefeng
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 208
  • [24] An amalgamation of vision transformer with convolutional neural network for automatic lung tumor segmentation
    Tyagi, Shweta
    Kushnure, Devidas T.
    Talbar, Sanjay N.
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2023, 108
  • [25] CLASSIFICATION TASK ASSISTED SEGMENTATION NETWORK FOR BREAST TUMOR SEGMENTATION IN ULTRASOUND IMAGES
    Zhang, Kunkun
    Wang, Bin
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3294 - 3298
  • [26] An advanced transformer framework for liver tumor segmentation using MRI images
    Sivanagaraju, P.
    Ramana, S. Venkata
    Reddy, P. V. G. D. Prasad
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 107
  • [27] Transformer and convolutional based dual branch network for retinal vessel segmentation in OCTA images
    Liu, Xiaoming
    Zhang, Di
    Yao, Junping
    Tang, Jinshan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 83
  • [28] Brain Tumor Segmentation Using Convolutional Neural Networks in MRI Images
    Thaha, M. Mohammed
    Kumar, K. Pradeep Mohan
    Murugan, B. S.
    Dhanasekeran, S.
    Vijayakarthick, P.
    Selvi, A. Senthil
    JOURNAL OF MEDICAL SYSTEMS, 2019, 43 (09)
  • [29] Brain Tumor Segmentation Using Convolutional Neural Networks in MRI Images
    M. Mohammed Thaha
    K. Pradeep Mohan Kumar
    B. S. Murugan
    S. Dhanasekeran
    P. Vijayakarthick
    A. Senthil Selvi
    Journal of Medical Systems, 2019, 43
  • [30] Brain Tumor Segmentation Using Convolutional Neural Networks in MRI Images
    Pereira, Sergio
    Pinto, Adriano
    Alves, Victor
    Silva, Carlos A.
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2016, 35 (05) : 1240 - 1251