Hybrid transformer UNet for thyroid segmentation from ultrasound scans

被引:18
|
作者
Chi, Jianning [1 ,2 ,3 ]
Li, Zelan [1 ]
Sun, Zhiyi [1 ]
Yu, Xiaosheng [1 ]
Wang, Huan [1 ]
机构
[1] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110167, Peoples R China
[2] Northeastern Univ, Key Lab Intelligent Comp Med Image, Minist Educ, Shenyang 110167, Peoples R China
[3] 3-11 Wenhua Rd, Shenyang, Peoples R China
基金
中国国家自然科学基金;
关键词
Thyroid gland segmentation; Ultrasound image processing; Deep learning; Attention mechanism; Transformer; IMAGE SEGMENTATION; NETWORK;
D O I
10.1016/j.compbiomed.2022.106453
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Deep learning based medical image segmentation methods have been widely used for thyroid gland segmen-tation from ultrasound images, which is of great importance for the diagnosis of thyroid disease since it can provide various valuable sonography features. However, existing thyroid gland segmentation models suffer from: (1) low-level features that are significant in depicting thyroid boundaries are gradually lost during the feature encoding process, (2) contextual features reflecting the changes of difference between thyroid and other anatomies in the ultrasound diagnosis process are either omitted by 2D convolutions or weakly represented by 3D convolutions due to high redundancy. In this work, we propose a novel hybrid transformer UNet (H-TUNet) to segment thyroid glands in ultrasound sequences, which consists of two parts: (1) a 2D Transformer UNet is proposed by utilizing a designed multi-scale cross-attention transformer (MSCAT) module on every skipped connection of the UNet, so that the low-level features from different encoding layers are integrated and refined according to the high-level features in the decoding scheme, leading to better representation of differences between anatomies in one ultrasound frame; (2) a 3D Transformer UNet is proposed by applying a 3D self-attention transformer (SAT) module to the very bottom layer of 3D UNet, so that the contextual features representing visual differences between regions and consistencies within regions could be strengthened from successive frames in the video. The learning process of the H-TUNet is formulated as a unified end-to-end network, so the intra-frame feature extraction and inter-frame feature aggregation can be learned and optimized jointly. The proposed method was evaluated on Thyroid Segmentation in Ultrasonography Dataset (TSUD) and TG3k Dataset. Experimental results have demonstrated that our method outperformed other state-of-the-art methods with respect to the certain benchmarks for thyroid gland segmentation.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] BPAT-UNet: Boundary preserving assembled transformer UNet for ultrasound thyroid nodule segmentation
    Bi, Hui
    Cai, Chengjie
    Sun, Jiawei
    Jiang, Yibo
    Lu, Gang
    Shu, Huazhong
    Ni, Xinye
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 238
  • [2] Advancing Brain Tumor Segmentation in MRI Scans: Hybrid Attention-Residual UNET with Transformer Blocks
    Xavier, P. Sobha
    Sathish, P. K.
    Raju, G.
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2024, 20 (06) : 103 - 115
  • [3] MGB-Unet: An Improved Multiscale Unet with Bottleneck Transformer for Myositis Segmentation from Ultrasound Images
    Hussein, Allaa
    Youssef, Sherin
    Ahmed, Magdy A.
    Ghatwary, Noha
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2025, 38 (01): : 217 - 228
  • [4] HUT: Hybrid UNet transformer for brain lesion and tumour segmentation
    Soh, Wei Kwek
    Yuen, Hing Yee
    Rajapakse, Jagath C.
    HELIYON, 2023, 9 (12)
  • [5] MWG-UNet plus plus : Hybrid Transformer U-Net Model for Brain Tumor Segmentation in MRI Scans
    Lyu, Yu
    Tian, Xiaolin
    BIOENGINEERING-BASEL, 2025, 12 (02):
  • [6] HYBRID ATTENTIVE UNET FOR SEGMENTATION OF LOWER LEG MUSCLES AND BONES FROM MRI SCANS FOR MUSCULOSKELETAL RESEARCH
    Zhu, Jiayi
    Bolsterlee, Bart
    Chow, Brian V. Y.
    Song, Yang
    Meijering, Erik
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,
  • [7] Hybrid UNet transformer architecture for ischemic stoke segmentation with MRI and CT datasets
    Soh, Wei Kwek
    Rajapakse, Jagath C.
    FRONTIERS IN NEUROSCIENCE, 2023, 17
  • [8] Hybrid Shunted Transformer embedding UNet for remote sensing image semantic segmentation
    Zhou H.
    Xiao X.
    Li H.
    Liu X.
    Liang P.
    Neural Computing and Applications, 2024, 36 (25) : 15705 - 15720
  • [9] RESIDUAL SWIN TRANSFORMER UNET WITH CONSISTENCY REGULARIZATION FOR AUTOMATIC BREAST ULTRASOUND TUMOR SEGMENTATION
    Zhuang, Xianwei
    Zhu, Xiner
    Hu, Haoji
    Yao, Jincao
    Li, Wei
    Yang, Chen
    Wang, Liping
    Feng, Na
    Xu, Dong
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3071 - 3075
  • [10] InvUNET: Involuted UNET for Breast Tumor Segmentation from Ultrasound
    Chavan, Trupti
    Prajapati, Kalpesh
    Rao, Kameshwar J., V
    ARTIFICIAL INTELLIGENCE IN MEDICINE, AIME 2022, 2022, 13263 : 283 - 290