Hybrid transformer UNet for thyroid segmentation from ultrasound scans

被引:18
|
作者
Chi, Jianning [1 ,2 ,3 ]
Li, Zelan [1 ]
Sun, Zhiyi [1 ]
Yu, Xiaosheng [1 ]
Wang, Huan [1 ]
机构
[1] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110167, Peoples R China
[2] Northeastern Univ, Key Lab Intelligent Comp Med Image, Minist Educ, Shenyang 110167, Peoples R China
[3] 3-11 Wenhua Rd, Shenyang, Peoples R China
基金
中国国家自然科学基金;
关键词
Thyroid gland segmentation; Ultrasound image processing; Deep learning; Attention mechanism; Transformer; IMAGE SEGMENTATION; NETWORK;
D O I
10.1016/j.compbiomed.2022.106453
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Deep learning based medical image segmentation methods have been widely used for thyroid gland segmen-tation from ultrasound images, which is of great importance for the diagnosis of thyroid disease since it can provide various valuable sonography features. However, existing thyroid gland segmentation models suffer from: (1) low-level features that are significant in depicting thyroid boundaries are gradually lost during the feature encoding process, (2) contextual features reflecting the changes of difference between thyroid and other anatomies in the ultrasound diagnosis process are either omitted by 2D convolutions or weakly represented by 3D convolutions due to high redundancy. In this work, we propose a novel hybrid transformer UNet (H-TUNet) to segment thyroid glands in ultrasound sequences, which consists of two parts: (1) a 2D Transformer UNet is proposed by utilizing a designed multi-scale cross-attention transformer (MSCAT) module on every skipped connection of the UNet, so that the low-level features from different encoding layers are integrated and refined according to the high-level features in the decoding scheme, leading to better representation of differences between anatomies in one ultrasound frame; (2) a 3D Transformer UNet is proposed by applying a 3D self-attention transformer (SAT) module to the very bottom layer of 3D UNet, so that the contextual features representing visual differences between regions and consistencies within regions could be strengthened from successive frames in the video. The learning process of the H-TUNet is formulated as a unified end-to-end network, so the intra-frame feature extraction and inter-frame feature aggregation can be learned and optimized jointly. The proposed method was evaluated on Thyroid Segmentation in Ultrasonography Dataset (TSUD) and TG3k Dataset. Experimental results have demonstrated that our method outperformed other state-of-the-art methods with respect to the certain benchmarks for thyroid gland segmentation.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] A hybrid enhanced attention transformer network for medical ultrasound image segmentation
    Jiang, Tao
    Xing, Wenyu
    Yu, Ming
    Ta, Dean
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 86
  • [22] SMESwin Unet: Merging CNN and Transformer for Medical Image Segmentation
    Wang, Ziheng
    Min, Xiongkuo
    Shi, Fangyu
    Jin, Ruinian
    Nawrin, Saida S.
    Yu, Ichen
    Nagatomi, Ryoichi
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT V, 2022, 13435 : 517 - 526
  • [23] TransCUNet: UNet cross fused transformer for medical image segmentation
    Jiang, Shen
    Li, Jinjiang
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 150
  • [24] CSWin-UNet: Transformer UNet with cross-shaped windows for medical image segmentation
    Liu, Xiao
    Gao, Peng
    Yu, Tao
    Wang, Fei
    Yuan, Ru-Yue
    INFORMATION FUSION, 2025, 113
  • [25] Federated learning via multi-attention guided UNet for thyroid nodule segmentation of ultrasound images
    Xiang, Zhuo
    Tian, Xiaoyu
    Liu, Yiyao
    Chen, Minsi
    Zhao, Cheng
    Tang, Li-Na
    Xue, En-Sheng
    Zhou, Qi
    Shen, Bin
    Li, Fang
    Chen, Qin
    Xue, Hong-Yuan
    Tang, Qing
    Li, Ying-Jia
    Liang, Lei
    Wang, Bin
    Li, Quan-Shui
    Wu, Chang-Jun
    Ren, Tian-Tian
    Wu, Jin-Yu
    Wang, Tianfu
    Liu, Wen-Ying
    Yan, Kun
    Liu, Bo-Ji
    Sun, Li-Ping
    Zhao, Chong-Ke
    Xu, Hui-Xiong
    Lei, Baiying
    NEURAL NETWORKS, 2025, 181
  • [26] SGUNET: SEMANTIC GUIDED UNET FOR THYROID NODULE SEGMENTATION
    Pan, Huitong
    Zhou, Quan
    Latecki, Longin Jan
    2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 630 - 634
  • [27] TDD-UNet:Transformer with double decoder UNet for COVID-19 lesions segmentation
    Huang, Xuping
    Chen, Junxi
    Chen, Mingzhi
    Chen, Lingna
    Wan, Yaping
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 151
  • [28] CTA-UNet: CNN-transformer architecture UNet for dental CBCT images segmentation
    Chen, Zeyu
    Chen, Senyang
    Hu, Fengjun
    PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (17):
  • [29] COMPARISON OF RADIOISOTOPE AND ULTRASOUND SCANS OF THYROID
    TAYLOR, KJW
    MCCREADY, VR
    PROCEEDINGS OF THE ROYAL SOCIETY OF MEDICINE-LONDON, 1975, 68 (06): : 381 - 384
  • [30] MCV-UNet: a modified convolution & transformer hybrid encoder-decoder network with multi-scale information fusion for ultrasound image semantic segmentation
    Xu, Zihong
    Wang, Ziyang
    PEERJ COMPUTER SCIENCE, 2024, 10