Automatic segmentation of echocardiographic images using a shifted windows vision transformer architecture

被引:0
|
作者
Nemri, Souha [1 ]
Duong, Luc [1 ]
机构
[1] Ecole Technol Super, Software & IT Engn Dept, Intervent Imaging Lab LIVE, 1100 Notre Dame St West, Montreal, PQ H3C 1K3, Canada
来源
基金
加拿大自然科学与工程研究理事会;
关键词
echocardiography; semantic segmentation; left ventricle; transformers; U-Net;
D O I
10.1088/2057-1976/ad7594
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Echocardiography is one the most commonly used imaging modalities for the diagnosis of congenital heart disease. Echocardiographic image analysis is crucial to obtaining accurate cardiac anatomy information. Semantic segmentation models can be used to precisely delimit the borders of the left ventricle, and allow an accurate and automatic identification of the region of interest, which can be extremely useful for cardiologists. In the field of computer vision, convolutional neural network (CNN) architectures remain dominant. Existing CNN approaches have proved highly efficient for the segmentation of various medical images over the past decade. However, these solutions usually struggle to capture long-range dependencies, especially when it comes to images with objects of different scales and complex structures. In this study, we present an efficient method for semantic segmentation of echocardiographic images that overcomes these challenges by leveraging the self-attention mechanism of the Transformer architecture. The proposed solution extracts long-range dependencies and efficiently processes objects at different scales, improving performance in a variety of tasks. We introduce Shifted Windows Transformer models (Swin Transformers), which encode both the content of anatomical structures and the relationship between them. Our solution combines the Swin Transformer and U-Net architectures, producing a U-shaped variant. The validation of the proposed method is performed with the EchoNet-Dynamic dataset used to train our model. The results show an accuracy of 0.97, a Dice coefficient of 0.87, and an Intersection over union (IoU) of 0.78. Swin Transformer models are promising for semantically segmenting echocardiographic images and may help assist cardiologists in automatically analyzing and measuring complex echocardiographic images.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] An Improved Deep Network Model to Isolate Lung Nodules from Histopathological Images Using an Orchestrated and Shifted Window Vision Transformer
    Sabitha, Ponnan
    Canessane, Ramalingam Aroul
    Minu, Manickarasi Sivathanu Pillai
    Gowri, Vinayagamoorthy
    Vigil, Maria Soosai Antony
    TRAITEMENT DU SIGNAL, 2024, 41 (04) : 2081 - 2091
  • [32] Hippocampus substructure segmentation using morphological vision transformer learning
    Lei, Yang
    Ding, Yifu
    Qiu, Richard L. J.
    Wang, Tonghe
    Roper, Justin
    Fu, Yabo
    Shu, Hui-Kuo
    Mao, Hui
    Yang, Xiaofeng
    PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (23):
  • [33] Privacy-Preserving Semantic Segmentation Using Vision Transformer
    Kiya, Hitoshi
    Nagamori, Teru
    Imaizumi, Shoko
    Shiota, Sayaka
    JOURNAL OF IMAGING, 2022, 8 (09)
  • [34] Image segmentation using Vision Transformer for tunnel defect assessment
    Qin, Shaojie
    Qi, Taiyue
    Deng, Tang
    Huang, Xiaodong
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2024, 39 (21) : 3243 - 3268
  • [35] Captioning Remote Sensing Images Using Transformer Architecture
    Nanal, Wrucha
    Hajiarbabi, Mohammadreza
    2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION, ICAIIC, 2023, : 413 - 418
  • [36] MULTI-SCALE SWIN TRANSFORMER ENABLED AUTOMATIC DETECTION AND SEGMENTATION OF LUNG METASTASES USING CT IMAGES
    Masood, Anum
    Naseem, Usman
    Razzak, Imran
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [37] EfficientUNetViT: Efficient Breast Tumor Segmentation Utilizing UNet Architecture and Pretrained Vision Transformer
    Anari, Shokofeh
    de Oliveira, Gabriel Gomes
    Ranjbarzadeh, Ramin
    Alves, Angela Maria
    Vaz, Gabriel Caumo
    Bendechache, Malika
    BIOENGINEERING-BASEL, 2024, 11 (09):
  • [38] Improving Citrus Fruit Classification with X-ray Images Using Features Enhanced Vision Transformer Architecture
    Raza, Syed Mudassir
    Raza, Awais
    Babeker, Mohamed Ibrahim Abdallh
    Zia-Ul Haq, Zia-Ul
    Islam, Muhammad Adnan
    Li, Shanjun
    FOOD ANALYTICAL METHODS, 2024, 17 (11) : 1523 - 1539
  • [39] Hypothalamus fully automatic segmentation from MR images using an U-Net based architecture
    Rodrigues, Livia
    Rezende, Thiago
    Zanesco, Ariane
    Hernandez, Ana Luisa
    Franca, Marcondes
    Rittner, Leticia
    15TH INTERNATIONAL SYMPOSIUM ON MEDICAL INFORMATION PROCESSING AND ANALYSIS, 2020, 11330
  • [40] Automatic Polyp Segmentation in Colonoscopy Images Using a Modified Deep Convolutional Encoder-Decoder Architecture
    Eu, Chin Yii
    Tang, Tong Boon
    Lin, Cheng-Hung
    Lee, Lok Hua
    Lu, Cheng-Kai
    SENSORS, 2021, 21 (16)