Automatic segmentation of echocardiographic images using a shifted windows vision transformer architecture

被引:0
|
作者
Nemri, Souha [1 ]
Duong, Luc [1 ]
机构
[1] Ecole Technol Super, Software & IT Engn Dept, Intervent Imaging Lab LIVE, 1100 Notre Dame St West, Montreal, PQ H3C 1K3, Canada
来源
基金
加拿大自然科学与工程研究理事会;
关键词
echocardiography; semantic segmentation; left ventricle; transformers; U-Net;
D O I
10.1088/2057-1976/ad7594
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Echocardiography is one the most commonly used imaging modalities for the diagnosis of congenital heart disease. Echocardiographic image analysis is crucial to obtaining accurate cardiac anatomy information. Semantic segmentation models can be used to precisely delimit the borders of the left ventricle, and allow an accurate and automatic identification of the region of interest, which can be extremely useful for cardiologists. In the field of computer vision, convolutional neural network (CNN) architectures remain dominant. Existing CNN approaches have proved highly efficient for the segmentation of various medical images over the past decade. However, these solutions usually struggle to capture long-range dependencies, especially when it comes to images with objects of different scales and complex structures. In this study, we present an efficient method for semantic segmentation of echocardiographic images that overcomes these challenges by leveraging the self-attention mechanism of the Transformer architecture. The proposed solution extracts long-range dependencies and efficiently processes objects at different scales, improving performance in a variety of tasks. We introduce Shifted Windows Transformer models (Swin Transformers), which encode both the content of anatomical structures and the relationship between them. Our solution combines the Swin Transformer and U-Net architectures, producing a U-shaped variant. The validation of the proposed method is performed with the EchoNet-Dynamic dataset used to train our model. The results show an accuracy of 0.97, a Dice coefficient of 0.87, and an Intersection over union (IoU) of 0.78. Swin Transformer models are promising for semantically segmenting echocardiographic images and may help assist cardiologists in automatically analyzing and measuring complex echocardiographic images.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
    Liu, Ze
    Lin, Yutong
    Cao, Yue
    Hu, Han
    Wei, Yixuan
    Zhang, Zheng
    Lin, Stephen
    Guo, Baining
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9992 - 10002
  • [2] SwinSight: a hierarchical vision transformer using shifted windows to leverage aerial image classification
    Pradhan P.K.
    Das A.
    Kumar A.
    Baruah U.
    Sen B.
    Ghosal P.
    Multimedia Tools and Applications, 2024, 83 (39) : 86457 - 86478
  • [3] Automatic Medical Image Segmentation with Vision Transformer
    Zhang, Jie
    Li, Fan
    Zhang, Xin
    Wang, Huaijun
    Hei, Xinhong
    APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [4] Vision Transformer With Hybrid Shifted Windows for Gastrointestinal Endoscopy Image Classification
    Wang, Wei
    Yang, Xin
    Tang, Jinhui
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 4452 - 4461
  • [5] Automatic Segmentation and Evaluation of Mitral Regurgitation Using Doppler Echocardiographic Images
    Liu, Guorong
    Wang, Yulong
    Cheng, Hanlin
    Shi, Zhongqing
    Qi, Zhanru
    Yao, Jing
    Luo, Shouhua
    Chen, Gong
    BIOENGINEERING-BASEL, 2024, 11 (11):
  • [6] Automatic segmentation of the heart muscle from echocardiographic images
    Sirbu, L
    Thijssen, J
    Florea, C
    Buzuloiu, V
    deKorte, C
    ISSCS 2005: International Symposium on Signals, Circuits and Systems, Vols 1 and 2, Proceedings, 2005, : 39 - 42
  • [7] Automatic segmentation of the left ventricle in echocardiographic images using convolutional neural networks
    Kim, Taeouk
    Hedayat, Mohammadali
    Vaitkus, Veronica V.
    Belohlavek, Marek
    Krishnamurthy, Vinayak
    Borazjani, Iman
    QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2021, 11 (05) : 1763 - 1781
  • [8] Detection of Floating Algae Blooms on Water Bodies Using PlanetScope Images and Shifted Windows Transformer Model
    Ahn, Jihye
    Kim, Kwangjin
    Kim, Yeji
    Kim, Hyunok
    Lee, Yangwon
    REMOTE SENSING, 2024, 16 (20)
  • [9] PSVT: Pyramid Shifted Window based Vision Transformer for cardiac image segmentation
    Zhang, Xingyu
    Liu, Jiacheng
    Xian, Xiaoli
    Chen, Bo
    Li, Dong
    Yang, Fei
    Zhang, Lei
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 102
  • [10] A vision transformer architecture for the automated segmentation of retinal lesions in spectral domain optical coherence tomography images
    Daniel Philippi
    Kai Rothaus
    Mauro Castelli
    Scientific Reports, 13