Hybrid transformer UNet for thyroid segmentation from ultrasound scans

被引:18
|
作者
Chi, Jianning [1 ,2 ,3 ]
Li, Zelan [1 ]
Sun, Zhiyi [1 ]
Yu, Xiaosheng [1 ]
Wang, Huan [1 ]
机构
[1] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110167, Peoples R China
[2] Northeastern Univ, Key Lab Intelligent Comp Med Image, Minist Educ, Shenyang 110167, Peoples R China
[3] 3-11 Wenhua Rd, Shenyang, Peoples R China
基金
中国国家自然科学基金;
关键词
Thyroid gland segmentation; Ultrasound image processing; Deep learning; Attention mechanism; Transformer; IMAGE SEGMENTATION; NETWORK;
D O I
10.1016/j.compbiomed.2022.106453
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Deep learning based medical image segmentation methods have been widely used for thyroid gland segmen-tation from ultrasound images, which is of great importance for the diagnosis of thyroid disease since it can provide various valuable sonography features. However, existing thyroid gland segmentation models suffer from: (1) low-level features that are significant in depicting thyroid boundaries are gradually lost during the feature encoding process, (2) contextual features reflecting the changes of difference between thyroid and other anatomies in the ultrasound diagnosis process are either omitted by 2D convolutions or weakly represented by 3D convolutions due to high redundancy. In this work, we propose a novel hybrid transformer UNet (H-TUNet) to segment thyroid glands in ultrasound sequences, which consists of two parts: (1) a 2D Transformer UNet is proposed by utilizing a designed multi-scale cross-attention transformer (MSCAT) module on every skipped connection of the UNet, so that the low-level features from different encoding layers are integrated and refined according to the high-level features in the decoding scheme, leading to better representation of differences between anatomies in one ultrasound frame; (2) a 3D Transformer UNet is proposed by applying a 3D self-attention transformer (SAT) module to the very bottom layer of 3D UNet, so that the contextual features representing visual differences between regions and consistencies within regions could be strengthened from successive frames in the video. The learning process of the H-TUNet is formulated as a unified end-to-end network, so the intra-frame feature extraction and inter-frame feature aggregation can be learned and optimized jointly. The proposed method was evaluated on Thyroid Segmentation in Ultrasonography Dataset (TSUD) and TG3k Dataset. Experimental results have demonstrated that our method outperformed other state-of-the-art methods with respect to the certain benchmarks for thyroid gland segmentation.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Swin Transformer Embedding UNet for Remote Sensing Image Semantic Segmentation
    He, Xin
    Zhou, Yong
    Zhao, Jiaqi
    Zhang, Di
    Yao, Rui
    Xue, Yong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [42] Attention Swin Transformer UNet for Landslide Segmentation in Remotely Sensed Images
    Liu, Bingxue
    Wang, Wei
    Wu, Yuming
    Gao, Xing
    REMOTE SENSING, 2024, 16 (23)
  • [43] New hybrid segmentation algorithm: UNet-GOA
    Yousefi, Tohid
    Aktas, Ozlem
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [44] Multi-scale nested UNet with transformer for colorectal polyp segmentation
    Wang, Zenan
    Liu, Zhen
    Yu, Jianfeng
    Gao, Yingxin
    Liu, Ming
    JOURNAL OF APPLIED CLINICAL MEDICAL PHYSICS, 2024, 25 (06):
  • [45] SG-UNet: Hybrid self-guided transformer and U-Net fusion for CT image segmentation
    Lv, Chunjie
    Li, Biyuan
    Sun, Gaowei
    Wang, Xiuwei
    Cai, Pengfei
    Yan, Jun
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2025, 108
  • [46] MLRT-UNet: An Efficient Multi-Level Relation Transformer Based U-Net for Thyroid Nodule Segmentation
    Haribabu, Kaku
    Prasath, R.
    Joe, I. R. Praveen
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2025,
  • [47] ConvWin-UNet: UNet-like hierarchical vision Transformer combined with convolution for medical image segmentation
    Feng, Xiaomeng
    Wang, Taiping
    Yang, Xiaohang
    Zhang, Minfei
    Guo, Wanpeng
    Wang, Weina
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (01) : 128 - 144
  • [48] TransFSM: Fetal Anatomy Segmentation and Biometric Measurement in Ultrasound Images Using a Hybrid Transformer
    Zhao, Lei
    Tan, Guanghua
    Pu, Bin
    Wu, Qianghui
    Ren, Hongliang
    Li, Kenli
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (01) : 285 - 296
  • [49] HAU-Net: Hybrid CNN-transformer for breast ultrasound image segmentation
    Zhang, Huaikun
    Lian, Jing
    Yi, Zetong
    Wu, Ruichao
    Lu, Xiangyu
    Ma, Pei
    Ma, Yide
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 87
  • [50] Hippocampus Segmentation with Transformer-UNet Fusion and Squeeze-And-Excitation Block
    Zhao, Yaping
    Jiang, Yizhang
    Qian, Zhenyu
    Nie, Lei
    Xia, Kaijian
    ADVANCED INTELLIGENT COMPUTING IN BIOINFORMATICS, PT II, ICIC 2024, 2024, 14882 : 420 - 430