Interpretable Machine Learning for COVID-19: An Empirical Study on Severity Prediction Task

被引:2
|
作者
Wu H. [1 ]
Ruan W. [1 ]
Wang J. [2 ]
Zheng D. [2 ]
Liu B. [3 ]
Geng Y. [4 ]
Chai X. [4 ]
Chen J. [5 ]
Li K. [5 ]
Li S. [6 ]
Helal S. [7 ]
机构
[1] University of Exeter, Exeter
[2] Coventry University, Coventry
[3] 910 Hospital of Pla, Department of Gastroenterology, Beijing
[4] Hy Medical Technology, Scientific Research Department Beijing, Beijing
[5] Hospital of Sun Yat-sen University, Department of Radiology, Zhuhai
[6] Hospital of Sun Yat-sen University, Department of Radiology, Guangdong Provincial Key Laboratory of Biomedical Imaging, Zhuhai
[7] University of Florida, Gainesville, 32611, FL
来源
关键词
Artificial intelligence in health; artificial intelligence in medicine; interpretable machine learning;
D O I
10.1109/TAI.2021.3092698
中图分类号
学科分类号
摘要
The black-box nature of machine learning models hinders the deployment of some high-accuracy medical diagnosis algorithms. It is risky to put one's life in the hands of models that medical researchers do not fully understand or trust. However, through model interpretation, black-box models can promptly reveal significant biomarkers that medical practitioners may have overlooked due to the surge of infected patients in the COVID-19 pandemic. This research leverages a database of 92 patients with confirmed SARS-CoV-2 laboratory tests between 18th January 2020 and 5th March 2020, in Zhuhai, China, to identify biomarkers indicative of infection severity prediction. Through the interpretation of four machine learning models, decision tree, random forests, gradient boosted trees, and neural networks using permutation feature importance, partial dependence plot, individual conditional expectation, accumulated local effects, local interpretable model-agnostic explanations, and Shapley additive explanation, we identify an increase in N-terminal pro-brain natriuretic peptide, C-reaction protein, and lactic dehydrogenase, a decrease in lymphocyte is associated with severe infection and an increased risk of death, which is consistent with recent medical research on COVID-19 and other research using dedicated models. We further validate our methods on a large open dataset with 5644 confirmed patients from the Hospital Israelita Albert Einstein, at São Paulo, Brazil from Kaggle, and unveil leukocytes, eosinophils, and platelets as three indicative biomarkers for COVID-19. © 2020 IEEE.
引用
收藏
页码:764 / 777
页数:13
相关论文
共 50 条
  • [41] COVID-19 Outbreak Prediction by Using Machine Learning Algorithms
    Sher, Tahir
    Rehman, Abdul
    Kim, Dongsun
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (01): : 1561 - 1574
  • [42] Analysis and Prediction of COVID-19 in Xinjiang based on Machine Learning
    Liu, Yunxiang
    Xiao, Yan
    2020 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, COMPUTER TECHNOLOGY AND TRANSPORTATION (ISCTT 2020), 2020, : 382 - 385
  • [43] A Novel Machine Learning based Model for COVID-19 Prediction
    Mazen, Tamer Sh
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (11) : 523 - 531
  • [44] A Novel Machine Learning based Model for COVID-19 Prediction
    Sh. Mazen T.
    International Journal of Advanced Computer Science and Applications, 2020, 11 (11): : 523 - 531
  • [45] Prediction of Online Psychological Help-Seeking Behavior During the COVID-19 Pandemic: An Interpretable Machine Learning Method
    Liu, Hui
    Zhang, Lin
    Wang, Weijun
    Huang, Yinghui
    Li, Shen
    Ren, Zhihong
    Zhou, Zongkui
    FRONTIERS IN PUBLIC HEALTH, 2022, 10
  • [46] An interpretable mortality prediction model for COVID-19 patients
    Yan, Li
    Zhang, Hai-Tao
    Goncalves, Jorge
    Xiao, Yang
    Wang, Maolin
    Guo, Yuqi
    Sun, Chuan
    Tang, Xiuchuan
    Jing, Liang
    Zhang, Mingyang
    Huang, Xiang
    Xiao, Ying
    Cao, Haosen
    Chen, Yanyan
    Ren, Tongxin
    Wang, Fang
    Xiao, Yaru
    Huang, Sufang
    Tan, Xi
    Huang, Niannian
    Jiao, Bo
    Cheng, Cheng
    Zhang, Yong
    Luo, Ailin
    Mombaerts, Laurent
    Jin, Junyang
    Cao, Zhiguo
    Li, Shusheng
    Xu, Hui
    Yuan, Ye
    NATURE MACHINE INTELLIGENCE, 2020, 2 (05) : 283 - +
  • [47] Clinical prognosis evaluation of COVID-19 patients: An interpretable hybrid machine learning approach
    Kocadagli, Ozan
    Baygul, Arzu
    Gokmen, Neslihan
    Incir, Said
    Aktan, Cagdas
    CURRENT RESEARCH IN TRANSLATIONAL MEDICINE, 2022, 70 (01)
  • [48] A large-scale machine learning study of sociodemographic factors contributing to COVID-19 severity
    Tumbas, Marko
    Markovic, Sofija
    Salom, Igor
    Djordjevic, Marko
    FRONTIERS IN BIG DATA, 2023, 6
  • [49] Interpretable deep learning for the prediction of ICU admission likelihood and mortality of COVID-19 patients
    Nazir, Amril
    Ampadu, Hyacinth Kwadwo
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [50] Machine Learning for Student QoE Prediction in Mobile Learning During COVID-19
    Korchani, Besma
    Sethom, Kaouthar
    ADVANCED INFORMATION NETWORKING AND APPLICATIONS, AINA-2022, VOL 3, 2022, 451 : 14 - 22