Predicting early-stage coronary artery disease using machine learning and routine clinical biomarkers improved by augmented virtual data

被引:2
|
作者
Koloi, Angela [1 ,2 ]
Loukas, Vasileios S. [1 ]
Hourican, Cillian [4 ]
Sakellarios, Antonis, I [1 ,5 ]
Quax, Rick [4 ]
Mishra, Pashupati P. [6 ,7 ,8 ]
Lehtimaeki, Terho [6 ,7 ,8 ]
Raitakari, Olli T. [9 ,10 ,11 ]
Papaloukas, Costas [2 ]
Bosch, Jos A. [3 ]
Maerz, Winfried [12 ,13 ,14 ]
Fotiadis, Dimitrios, I [1 ,15 ]
机构
[1] Univ Ioannina, Dept Mat Sci & Engn, Unit Med Technol & Intelligent Informat Syst, Ioannina, Greece
[2] Univ Ioannina, Dept Biol Applicat & Technol, Ioannina, Greece
[3] Univ Amsterdam, Dept Clin Psychol, Amsterdam, Netherlands
[4] Univ Amsterdam, Inst Informat, Comp Sci Lab, Amsterdam, Netherlands
[5] Univ Patras, Dept Mech Engn & Aeronaut, Biomed Engn, Patras, Greece
[6] Tampere Univ, Fac Med & Hlth Technol, Dept Clin Chem, Tampere, Finland
[7] Tampere Univ, Fac Med & Hlth Technol, Finnish Cardiovasc Res Ctr Tampere, Tampere, Finland
[8] Fimlab Labs, Dept Clin Chem, Tampere, Finland
[9] Univ Turku, Res Ctr Appl & Prevent Cardiovasc Med, Turku, Finland
[10] Turku Univ Hosp, Dept Clin Physiol & Nucl Med, Turku, Finland
[11] Univ Turku, Turku Univ Hosp, Ctr Populat Hlth Res, Turku, Finland
[12] Heidelberg Univ, Dept Internal Med 5, Mannheim, Germany
[13] Med Univ Graz, Clin Inst Med & Chem Lab Diag, Graz, Austria
[14] SYNLAB Holding Deutschland GmbH, Augsburg, Germany
[15] FORTH IMBB, Dept Biomed Res, GR-45110 Ioannina, Greece
来源
关键词
Coronary artery disease; Machine learning; Classification algorithms; Data Augmentation; CARDIOVASCULAR RISK; HEART; FUTURE; SCORE;
D O I
10.1093/ehjdh/ztae049
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Aims Coronary artery disease (CAD) is a highly prevalent disease with modifiable risk factors. In patients with suspected obstructive CAD, evaluating the pre-test probability model is crucial for diagnosis, although its accuracy remains controversial. Machine learning (ML) predictive models can help clinicians detect CAD early and improve outcomes. This study aimed to identify early-stage CAD using ML in conjunction with a panel of clinical and laboratory tests.Methods and results The study sample included 3316 patients enrolled in the Ludwigshafen Risk and Cardiovascular Health (LURIC) study. A comprehensive array of attributes was considered, and an ML pipeline was developed. Subsequently, we utilized five approaches to generating high-quality virtual patient data to improve the performance of the artificial intelligence models. An extension study was carried out using data from the Young Finns Study (YFS) to assess the results' generalizability. Upon applying virtual augmented data, accuracy increased by approximately 5%, from 0.75 to -0.79 for random forests (RFs), and from 0.76 to -0.80 for Gradient Boosting (GB). Sensitivity showed a significant boost for RFs, rising by about 9.4% (0.81-0.89), while GB exhibited a 4.8% increase (0.83-0.87). Specificity showed a significant boost for RFs, rising by similar to 24% (from 0.55 to 0.70), while GB exhibited a 37% increase (from 0.51 to 0.74). The extension analysis aligned with the initial study.Conclusion Accurate predictions of angiographic CAD can be obtained using a set of routine laboratory markers, age, sex, and smoking status, holding the potential to limit the need for invasive diagnostic techniques. The extension analysis in the YFS demonstrated the potential of these findings in a younger population, and it confirmed applicability to atherosclerotic vascular disease.Lay summary Using virtual population generation techniques, this study improved the accuracy of a machine learning model designed to identify early-stage CAD using standard laboratory tests.
引用
收藏
页码:542 / 550
页数:9
相关论文
共 50 条
  • [1] Early-Stage Detection of Ovarian Cancer Based on Clinical Data Using Machine Learning Approaches
    Ahamad, Md Martuza
    Aktar, Sakifa
    Uddin, Md Jamal
    Rahman, Tasnia
    Alyami, Salem A.
    Al-Ashhab, Samer
    Akhdar, Hanan Fawaz
    Azad, A. K. M.
    Moni, Mohammad Ali
    JOURNAL OF PERSONALIZED MEDICINE, 2022, 12 (08):
  • [2] HOSTILITY AND EARLY-STAGE CORONARY-ARTERY DISEASE (CAD)
    BAREFOOT, JC
    PATTERSON, J
    HANEY, T
    WILLIAMS, RB
    PSYCHOSOMATIC MEDICINE, 1991, 53 (02): : 234 - 234
  • [3] Early-Stage Alzheimer's Disease Prediction Using Machine Learning Models
    Kavitha, C.
    Mani, Vinodhini
    Srividhya, S. R.
    Khalaf, Osamah Ibrahim
    Tavera Romero, Carlos Andres
    FRONTIERS IN PUBLIC HEALTH, 2022, 10
  • [4] Predicting early risk of chronic kidney disease in cats using routine clinical laboratory tests and machine learning
    Bradley, Richard
    Tagkopoulos, Ilias
    Kim, Minseung
    Kokkinos, Yiannis
    Panagiotakos, Theodoros
    Kennedy, James
    De Meyer, Geert
    Watson, Phillip
    Elliott, Jonathan
    JOURNAL OF VETERINARY INTERNAL MEDICINE, 2019, 33 (06) : 2644 - 2656
  • [5] Predicting Prognosis of Early-Stage Mycosis Fungoides with Utilization of Machine Learning
    Mendi, Banu Ismail
    Sanli, Hatice
    Insel, Mert Akin
    Aydemir, Beliz Bayindir
    Atak, Mehmet Fatih
    LIFE-BASEL, 2024, 14 (11):
  • [6] CT Radiomic Features and Clinical Biomarkers for Predicting Coronary Artery Disease
    Carmelo Militello
    Francesco Prinzi
    Giulia Sollami
    Leonardo Rundo
    Ludovico La Grutta
    Salvatore Vitabile
    Cognitive Computation, 2023, 15 : 238 - 253
  • [7] Predicting coronary artery calcification using machine learning algorithms
    sun, Y. V.
    Bielak, L. F.
    Pevser, P. A.
    Turner, S. T.
    Sheed, P. F., II
    Boerwinkle, E.
    Kardial, S. L. R.
    GENETIC EPIDEMIOLOGY, 2007, 31 (05) : 499 - 499
  • [8] CT Radiomic Features and Clinical Biomarkers for Predicting Coronary Artery Disease
    Militello, Carmelo
    Prinzi, Francesco
    Sollami, Giulia
    Rundo, Leonardo
    La Grutta, Ludovico
    Vitabile, Salvatore
    COGNITIVE COMPUTATION, 2023, 15 (01) : 238 - 253
  • [9] Early-Stage Lung Cancer Detection Using Machine Learning
    Sreedevi, J.
    Bai, M. Rama
    Sujini, G. Naga
    Mahesh, Muthyala
    Satyanarayana, B.
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2021, 14 (05): : 306 - 313
  • [10] Machine Learning for Early Lung Cancer Identification Using Routine Clinical and Laboratory Data
    Gould, Michael K.
    Huang, Brian Z.
    Tammemagi, Martin C.
    Kinar, Yaron
    Shiff, Ron
    AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2021, 204 (04) : 445 - 453