Deep Learning-Based COVID-19 Pneumonia Classification Using Chest CT Images: Model Generalizability

被引:12
|
作者
Nguyen, Dan [1 ,2 ]
Kay, Fernando [3 ]
Tan, Jun [2 ]
Yan, Yulong [2 ]
Ng, Yee Seng [3 ]
Iyengar, Puneeth [2 ]
Peshock, Ron [3 ]
Jiang, Steve [1 ,2 ]
机构
[1] Univ Texas Southwestern Med Ctr Dallas, Med Artificial Intelligence & Automat MAIA Lab, Dallas, TX 75390 USA
[2] Univ Texas Southwestern Med Ctr Dallas, Dept Radiat Oncol, Dallas, TX 75390 USA
[3] Univ Texas Southwestern Med Ctr Dallas, Dept Radiol, Dallas, TX USA
来源
关键词
deep learning; generalizability; convolutional neural network; classification; computed tomography; COVID-19; SARS-CoV-2; DIAGNOSIS; FEATURES;
D O I
10.3389/frai.2021.694875
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Since the outbreak of the COVID-19 pandemic, worldwide research efforts have focused on using artificial intelligence (AI) technologies on various medical data of COVID-19-positive patients in order to identify or classify various aspects of the disease, with promising reported results. However, concerns have been raised over their generalizability, given the heterogeneous factors in training datasets. This study aims to examine the severity of this problem by evaluating deep learning (DL) classification models trained to identify COVID-19-positive patients on 3D computed tomography (CT) datasets from different countries. We collected one dataset at UT Southwestern (UTSW) and three external datasets from different countries: CC-CCII Dataset (China), COVID-CTset (Iran), and MosMedData (Russia). We divided the data into two classes: COVID-19-positive and COVID-19- negative patients. We trained nine identical DL-based classification models by using combinations of datasets with a 72% train, 8% validation, and 20% test data split. Themodels trained on a single dataset achieved accuracy/area under the receiver operating characteristic curve (AUC) values of 0.87/0.826 (UTSW), 0.97/0.988 (CC-CCCI), and 0.86/0.873 (COVID-CTset) when evaluated on their own dataset. The models trained on multiple datasets and evaluated on a test set from one of the datasets used for training performed better. However, the performance dropped close to an AUC of 0.5 (random guess) for all models when evaluated on a different dataset outside of its training datasets. Including MosMedData, which only contained positive labels, into the training datasets did not necessarily help the performance of other datasets. Multiple factors likely contributed to these results, such as patient demographics and differences in image acquisition or reconstruction, causing a data shift among different study cohorts.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Deep Learning-based COVID-19 Pneumonia Classification Using Chest CT Images: Model Generalizability
    Nguyen, D.
    Kay, F.
    Tan, J.
    Yan, Y.
    Ng, Y.
    Iyengar, P.
    Peshock, R.
    Jiang, S.
    MEDICAL PHYSICS, 2021, 48 (06)
  • [2] A Hybrid Deep Transfer Learning Model With Kernel Metric for COVID-19 Pneumonia Classification Using Chest CT Images
    Li, Jianyuan
    Luo, Xiong
    Ma, Huimin
    Zhao, Wenbing
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (04) : 2506 - 2517
  • [3] Classification of COVID-19 Chest CT Images Based on Ensemble Deep Learning
    Li, Xiaoshuo
    Tan, Wenjun
    Liu, Pan
    Zhou, Qinghua
    Yang, Jinzhu
    JOURNAL OF HEALTHCARE ENGINEERING, 2021, 2021 (2021)
  • [4] COVID-19 classification based on a deep learning and machine learning fusion technique using chest CT images
    Salama, Gerges M.
    Mohamed, Asmaa
    Abd-Ellah, Mahmoud Khaled
    NEURAL COMPUTING & APPLICATIONS, 2023, 36 (10): : 5347 - 5365
  • [5] COVID-19 classification based on a deep learning and machine learning fusion technique using chest CT images
    Gerges M. Salama
    Asmaa Mohamed
    Mahmoud Khaled Abd-Ellah
    Neural Computing and Applications, 2024, 36 : 5347 - 5365
  • [6] Deep Ensemble Model for COVID-19 Diagnosis and Classification Using Chest CT Images
    Ragab, Mahmoud
    Eljaaly, Khalid
    Alhakamy, Nabil A.
    Alhadrami, Hani A.
    Bahaddad, Adel A.
    Abo-Dahab, Sayed M.
    Khalil, Eied M.
    BIOLOGY-BASEL, 2022, 11 (01):
  • [7] Deep learning-based lesion subtyping and prediction of clinical outcomes in COVID-19 pneumonia using chest CT
    David Bermejo-Peláez
    Raúl San José Estépar
    María Fernández-Velilla
    Carmelo Palacios Miras
    Guillermo Gallardo Madueño
    Mariana Benegas
    Carolina Gotera Rivera
    Sandra Cuerpo
    Miguel Luengo-Oroz
    Jacobo Sellarés
    Marcelo Sánchez
    Gorka Bastarrika
    German Peces Barba
    Luis M. Seijo
    María J. Ledesma-Carbayo
    Scientific Reports, 12
  • [8] Deep learning-based lesion subtyping and prediction of clinical outcomes in COVID-19 pneumonia using chest CT
    Bermejo-Pelaez, David
    San Jose Estepar, Raul
    Fernandez-Velilla, Maria
    Palacios Miras, Carmelo
    Gallardo Madueno, Guillermo
    Benegas, Mariana
    Gotera Rivera, Carolina
    Cuerpo, Sandra
    Luengo-Oroz, Miguel
    Sellares, Jacobo
    Sanchez, Marcelo
    Bastarrika, Gorka
    Peces Barba, German
    Seijo, Luis M.
    Ledesma-Carbayo, Maria J.
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [9] METHODOLOGY FOR IMPROVING DEEP LEARNING-BASED CLASSIFICATION FOR CT SCAN COVID-19 IMAGES
    Vijayalakshmi, D.
    Elangovan, Poonguzhali
    Nath, Malaya Kumar
    BIOMEDICAL ENGINEERING-APPLICATIONS BASIS COMMUNICATIONS, 2024, 36 (03):
  • [10] Deep Ensemble Learning-Based Models for Diagnosis of COVID-19 from Chest CT Images
    Mouhafid, Mohamed
    Salah, Mokhtar
    Yue, Chi
    Xia, Kewen
    HEALTHCARE, 2022, 10 (01)