PCT: Pyramid convolutional transformer for parotid gland tumor segmentation in ultrasound images

被引:7
|
作者
Zhang, Gang [1 ,2 ]
Zheng, Chenhong [3 ]
He, Jianfeng [1 ,2 ]
Yi, Sanli [1 ,2 ]
机构
[1] Kunming Univ Sci & Technol, Sch Informat Engn & Automat, Kunming 650500, Yunnan, Peoples R China
[2] Kunming Univ Sci & Technol, Key Lab Comp Technol Applicat Yunnan Prov, Kunming 650500, Yunnan, Peoples R China
[3] Kunming Med Univ, Dept Ultrasound, Affiliated Hosp 3, Kunming 650118, Yunnan, Peoples R China
基金
中国国家自然科学基金;
关键词
Transformer; Attention mechanism; Parotid gland tumor; Medical image segmentation; Dense pixel prediction; Convolutional neural network; ARCHITECTURE;
D O I
10.1016/j.bspc.2022.104498
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Preoperative segmentation of parotid gland tumor regions using deep learning is of great significance for treatment decisions. However, there are still two major limitations: to the best of our knowledge, no networks are designed specifically for parotid gland tumor segmentation tasks; and neither convolutional neural network (CNN) nor Transformer can extract both global and local feature solely. To address these issues, we first propose a Pyramid Convolutional Transformer (PCT) architecture based on the shrinking pyramid framework and Fusion Attention Transformer CNN (FTC) block for parotid gland tumors segmentation. In this architecture, the shrinking pyramid framework can effectively capture parotid gland tumor image features with dense pixel by integrating multi-scale dependencies of images. And the FTC block is constructed to address complex and variable contour characteristics of parotid gland tumors, which combines Transformer with CNN for preferable extracting global and local features of images by dual branch structure. The experimental results suggest that proposed PCT achieved intersection-over-union (IoU) of 0.8434 and Dice similarity coefficient (Dice) of 0.9151 on parotid gland tumor segmentation (PGTSeg) dataset, and attained new state-of-the-art performance on multiple challenging benchmarks with IoU of 0.8521 on MoNuSeg and IoU of 0.9080 on ISIC 2018. Meanwhile, common backbones equipped with FTC block outperformed the baseline model. The code and models will be available at: https://github.com/Twoverz/PCT-Pyramid-Convolutional-Transformer.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] BREAST ULTRASOUND IMAGES GLAND SEGMENTATION
    Braz, Rui
    Pinheiro, Antonio M. G.
    Moutinho, J.
    Freire, Mario M.
    Pereira, Manuela
    2012 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2012,
  • [2] Radiological images of keratocystoma: a rare tumor of the parotid gland
    Hirata, Kenji
    Fujima, Noriyuki
    Mizumachi, Takatsugu
    Bandarchi, Bizhan
    Roesler, John M.
    ACTA RADIOLOGICA OPEN, 2014, 3 (08):
  • [3] Segmentation of ultrasound images for tumor surgery
    Medina, L. R. Gutierrez
    Cosio, F. Arambula
    Lasri, E. Hazan
    MEDICAL PHYSICS, 2006, 854 : 188 - +
  • [4] Feature Pyramid Nonlocal Network With Transform Modal Ensemble Learning for Breast Tumor Segmentation in Ultrasound Images
    Tang, Peng
    Yang, Xintong
    Nan, Yang
    Xiang, Shao
    Liang, Qiaokang
    IEEE TRANSACTIONS ON ULTRASONICS FERROELECTRICS AND FREQUENCY CONTROL, 2021, 68 (12) : 3549 - 3559
  • [5] Feature Pyramid Nonlocal Network with Transform Modal Ensemble Learning for Breast Tumor Segmentation in Ultrasound Images
    Tang, Peng
    Yang, Xintong
    Nan, Yang
    Xiang, Shao
    Liang, Qiaokang
    Liang, Qiaokang (qiaokang@hnu.edu.cn), 1600, Institute of Electrical and Electronics Engineers Inc. (68): : 3549 - 3559
  • [6] CSwin-PNet: A CNN-Swin Transformer combined pyramid network for breast lesion segmentation in ultrasound images
    Yang, Haonan
    Yang, Dapeng
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [7] Automatic Parotid Gland Segmentation in MVCT Using Deep Convolutional Neural Networks
    Zhang J.
    Yingming S.U.N.
    Liao H.
    Jian Z.H.U.
    Zhang Y.
    ACM Transactions on Computing for Healthcare, 2022, 3 (02):
  • [8] FCTformer: Fusing Convolutional Operations and Transformer for 3D Rectal Tumor Segmentation in MR Images
    Sang, Zhenguo
    Li, Chengkang
    Xu, Ye
    Wang, Yuanyuan
    Zheng, Hongtu
    Guo, Yi
    IEEE ACCESS, 2024, 12 : 4812 - 4824
  • [9] PYRAMID TRANSFORMER DRIVEN MULTIBRANCH FUSION FOR POLYP SEGMENTATION IN COLONOSCOPIC VIDEO IMAGES
    Wang, Ao
    Wu, Ming
    Qi, Hao
    Shi, Hong
    Chen, Jianhua
    Chen, Yinran
    Luo, Xiongbiao
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2350 - 2354