Hybrid transformer UNet for thyroid segmentation from ultrasound scans

被引:18
|
作者
Chi, Jianning [1 ,2 ,3 ]
Li, Zelan [1 ]
Sun, Zhiyi [1 ]
Yu, Xiaosheng [1 ]
Wang, Huan [1 ]
机构
[1] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110167, Peoples R China
[2] Northeastern Univ, Key Lab Intelligent Comp Med Image, Minist Educ, Shenyang 110167, Peoples R China
[3] 3-11 Wenhua Rd, Shenyang, Peoples R China
基金
中国国家自然科学基金;
关键词
Thyroid gland segmentation; Ultrasound image processing; Deep learning; Attention mechanism; Transformer; IMAGE SEGMENTATION; NETWORK;
D O I
10.1016/j.compbiomed.2022.106453
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Deep learning based medical image segmentation methods have been widely used for thyroid gland segmen-tation from ultrasound images, which is of great importance for the diagnosis of thyroid disease since it can provide various valuable sonography features. However, existing thyroid gland segmentation models suffer from: (1) low-level features that are significant in depicting thyroid boundaries are gradually lost during the feature encoding process, (2) contextual features reflecting the changes of difference between thyroid and other anatomies in the ultrasound diagnosis process are either omitted by 2D convolutions or weakly represented by 3D convolutions due to high redundancy. In this work, we propose a novel hybrid transformer UNet (H-TUNet) to segment thyroid glands in ultrasound sequences, which consists of two parts: (1) a 2D Transformer UNet is proposed by utilizing a designed multi-scale cross-attention transformer (MSCAT) module on every skipped connection of the UNet, so that the low-level features from different encoding layers are integrated and refined according to the high-level features in the decoding scheme, leading to better representation of differences between anatomies in one ultrasound frame; (2) a 3D Transformer UNet is proposed by applying a 3D self-attention transformer (SAT) module to the very bottom layer of 3D UNet, so that the contextual features representing visual differences between regions and consistencies within regions could be strengthened from successive frames in the video. The learning process of the H-TUNet is formulated as a unified end-to-end network, so the intra-frame feature extraction and inter-frame feature aggregation can be learned and optimized jointly. The proposed method was evaluated on Thyroid Segmentation in Ultrasonography Dataset (TSUD) and TG3k Dataset. Experimental results have demonstrated that our method outperformed other state-of-the-art methods with respect to the certain benchmarks for thyroid gland segmentation.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] MCV-UNet: a modified convolution & transformer hybrid encoder-decoder network with multi-scale information fusion for ultrasound image semantic segmentation
    Xu, Zihong
    Wang, Ziyang
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [32] HCTNet: A hybrid CNN-transformer network for breast ultrasound image segmentation
    He, Qiqi
    Yang, Qiuju
    Xie, Minghao
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 155
  • [33] A Hybrid Multilayer Filtering Approach for Thyroid Nodule Segmentation on Ultrasound Images
    Ardakani, Ali Abbasian
    Bitarafan-Rajabi, Ahmad
    Mohammadzadeh, Ali
    Mohammadi, Afshin
    Riazi, Reza
    Abolghasemi, Jamileh
    Jafari, Amir Homayoun
    Shiran, Mohammad Bagher
    JOURNAL OF ULTRASOUND IN MEDICINE, 2019, 38 (03) : 629 - 640
  • [34] SK-Unet plus plus : An improved Unet plus plus network with adaptive receptive fields for automatic segmentation of ultrasound thyroid nodule images
    Dai, Hong
    Xie, Wufei
    Xia, E.
    MEDICAL PHYSICS, 2024, 51 (03) : 1798 - 1811
  • [35] ADID-UNET-a segmentation model for COVID-19 infection from lung CT scans
    Raj, Alex Noel Joseph
    Zhu, Haipeng
    Khan, Asiya
    Zhuang, Zhemin
    Yang, Zengbiao
    Mahesh, Vijayalakshmi G. V.
    Karthik, Ganesan
    PEERJ COMPUTER SCIENCE, 2021,
  • [36] Multilevel UNet for Pancreas Segmentation from Non-Contrast CT Scans through Domain Adaptation
    Sriram, Sai Aditya
    Paul, Angshuman
    Zhu, Yingying
    Sandfort, Veit
    Pickhardt, Perry J.
    Summers, Ronald M.
    MEDICAL IMAGING 2020: COMPUTER-AIDED DIAGNOSIS, 2020, 11314
  • [37] ADID-UNET—a segmentation model for COVID-19 infection from lung CT scans
    Raj A.N.J.
    Zhu H.
    Khan A.
    Zhuang Z.
    Yang Z.
    Mahesh G.V.V.
    Karthik G.
    PeerJ Computer Science, 2021, 7 : 1 - 34
  • [38] PCAT-UNet: UNet-like network fused convolution and transformer for retinal vessel segmentation
    Chen, Danny
    Yang, Wenzhong
    Wang, Liejun
    Tan, Sixiang
    Lin, Jiangzhaung
    Bu, Wenxiu
    PLOS ONE, 2022, 17 (01):
  • [39] Combining Swin Transformer With UNet for Remote Sensing Image Semantic Segmentation
    Fan, Lili
    Zhou, Yu
    Liu, Hongmei
    Li, Yunjie
    Cao, Dongpu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 11
  • [40] PDAtt-Unet: Pyramid Dual-Decoder Attention Unet for Covid-19 infection segmentation from CT-scans
    Bougourzi, Fares
    Distante, Cosimo
    Dornaika, Fadi
    Taleb-Ahmed, Abdelmalik
    MEDICAL IMAGE ANALYSIS, 2023, 86