Predicting early-stage coronary artery disease using machine learning and routine clinical biomarkers improved by augmented virtual data

被引:2
|
作者
Koloi, Angela [1 ,2 ]
Loukas, Vasileios S. [1 ]
Hourican, Cillian [4 ]
Sakellarios, Antonis, I [1 ,5 ]
Quax, Rick [4 ]
Mishra, Pashupati P. [6 ,7 ,8 ]
Lehtimaeki, Terho [6 ,7 ,8 ]
Raitakari, Olli T. [9 ,10 ,11 ]
Papaloukas, Costas [2 ]
Bosch, Jos A. [3 ]
Maerz, Winfried [12 ,13 ,14 ]
Fotiadis, Dimitrios, I [1 ,15 ]
机构
[1] Univ Ioannina, Dept Mat Sci & Engn, Unit Med Technol & Intelligent Informat Syst, Ioannina, Greece
[2] Univ Ioannina, Dept Biol Applicat & Technol, Ioannina, Greece
[3] Univ Amsterdam, Dept Clin Psychol, Amsterdam, Netherlands
[4] Univ Amsterdam, Inst Informat, Comp Sci Lab, Amsterdam, Netherlands
[5] Univ Patras, Dept Mech Engn & Aeronaut, Biomed Engn, Patras, Greece
[6] Tampere Univ, Fac Med & Hlth Technol, Dept Clin Chem, Tampere, Finland
[7] Tampere Univ, Fac Med & Hlth Technol, Finnish Cardiovasc Res Ctr Tampere, Tampere, Finland
[8] Fimlab Labs, Dept Clin Chem, Tampere, Finland
[9] Univ Turku, Res Ctr Appl & Prevent Cardiovasc Med, Turku, Finland
[10] Turku Univ Hosp, Dept Clin Physiol & Nucl Med, Turku, Finland
[11] Univ Turku, Turku Univ Hosp, Ctr Populat Hlth Res, Turku, Finland
[12] Heidelberg Univ, Dept Internal Med 5, Mannheim, Germany
[13] Med Univ Graz, Clin Inst Med & Chem Lab Diag, Graz, Austria
[14] SYNLAB Holding Deutschland GmbH, Augsburg, Germany
[15] FORTH IMBB, Dept Biomed Res, GR-45110 Ioannina, Greece
来源
关键词
Coronary artery disease; Machine learning; Classification algorithms; Data Augmentation; CARDIOVASCULAR RISK; HEART; FUTURE; SCORE;
D O I
10.1093/ehjdh/ztae049
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Aims Coronary artery disease (CAD) is a highly prevalent disease with modifiable risk factors. In patients with suspected obstructive CAD, evaluating the pre-test probability model is crucial for diagnosis, although its accuracy remains controversial. Machine learning (ML) predictive models can help clinicians detect CAD early and improve outcomes. This study aimed to identify early-stage CAD using ML in conjunction with a panel of clinical and laboratory tests.Methods and results The study sample included 3316 patients enrolled in the Ludwigshafen Risk and Cardiovascular Health (LURIC) study. A comprehensive array of attributes was considered, and an ML pipeline was developed. Subsequently, we utilized five approaches to generating high-quality virtual patient data to improve the performance of the artificial intelligence models. An extension study was carried out using data from the Young Finns Study (YFS) to assess the results' generalizability. Upon applying virtual augmented data, accuracy increased by approximately 5%, from 0.75 to -0.79 for random forests (RFs), and from 0.76 to -0.80 for Gradient Boosting (GB). Sensitivity showed a significant boost for RFs, rising by about 9.4% (0.81-0.89), while GB exhibited a 4.8% increase (0.83-0.87). Specificity showed a significant boost for RFs, rising by similar to 24% (from 0.55 to 0.70), while GB exhibited a 37% increase (from 0.51 to 0.74). The extension analysis aligned with the initial study.Conclusion Accurate predictions of angiographic CAD can be obtained using a set of routine laboratory markers, age, sex, and smoking status, holding the potential to limit the need for invasive diagnostic techniques. The extension analysis in the YFS demonstrated the potential of these findings in a younger population, and it confirmed applicability to atherosclerotic vascular disease.Lay summary Using virtual population generation techniques, this study improved the accuracy of a machine learning model designed to identify early-stage CAD using standard laboratory tests.
引用
收藏
页码:542 / 550
页数:9
相关论文
共 50 条
  • [41] Early-Stage Apple Leaf Disease Prediction Using Deep Learning
    Gawade, Amit
    Deolekar, Subodh
    Patil, Vaishali
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2021, 14 (05): : 40 - 43
  • [42] Identification of biomarkers associated with phagocytosis regulatory factors in coronary artery disease using machine learning and network analysis
    Jia, Runan
    Li, Zhiya
    Du, Yingying
    Liu, Huixian
    Liang, Ruirui
    MAMMALIAN GENOME, 2025,
  • [43] Machine learning based model to diagnose obstructive coronary artery disease using calcium scoring, PET imaging, and clinical data
    van Dalen, J. A.
    Koenders, S. S.
    Metselaar, R. J.
    Vendel, B. N.
    Slotman, D. J.
    Mouden, M.
    Slump, C. H.
    van Dijk, J. D.
    JOURNAL OF NUCLEAR CARDIOLOGY, 2023, 30 (04) : 1504 - 1513
  • [44] Machine learning based model to diagnose obstructive coronary artery disease using calcium scoring, PET imaging, and clinical data
    J. A. van Dalen
    S. S. Koenders
    R. J. Metselaar
    B. N. Vendel
    D. J. Slotman
    M. Mouden
    C. H. Slump
    J. D. van Dijk
    Journal of Nuclear Cardiology, 2023, 30 : 1504 - 1513
  • [45] Predicting coronary heart disease in Chinese diabetics using machine learning
    Ma, Cai-Yi
    Luo, Ya-Mei
    Zhang, Tian-Yu
    Hao, Yu-Duo
    Xie, Xue-Qin
    Liu, Xiao-Wei
    Ren, Xiao-Lei
    He, Xiao-Lin
    Han, Yu-Mei
    Deng, Ke-Jun
    Yan, Dan
    Yang, Hui
    Tang, Hua
    Lin, Hao
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 169
  • [46] Early-stage lung cancer detection via circulating microbial DNA biomarkers and machine learning classification
    Sepich-Poore, Gregory D.
    Fraraccio, Serena
    Wandro, Stephen
    Knight, Rob
    Miller-Montgomery, Sandrine
    Adams, Eddie
    CANCER RESEARCH, 2022, 82 (12)
  • [47] MACHINE LEARNING FOR PREDICTING ANGIOGRAPHIC PRESENCE OF ATHEROSCLEROTIC CORONARY ARTERY DISEASE IN YOUNG ADULTS
    Saleem, Maryam
    Yanamala, Naveena
    Zeb, Irfan
    Patel, Brijesh
    Patel, Heenaben
    Challa, Abhiram
    Sengupta, Partho
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2021, 77 (18) : 65 - 65
  • [48] A novel machine learning based technique for classification of early-stage Alzheimer’s disease using brain images
    Ruhul Amin Hazarika
    Debdatta Kandar
    Arnab Kumar Maji
    Multimedia Tools and Applications, 2024, 83 : 24277 - 24299
  • [49] A novel machine learning based technique for classification of early-stage Alzheimer's disease using brain images
    Hazarika, Ruhul Amin
    Kandar, Debdatta
    Maji, Arnab Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (8) : 24277 - 24299
  • [50] Evaluating Critical Values in Predicting Cardiomyopathy With Coronary Artery Disease Using Features Decided by Machine Learning Algorithms
    Vijayaraghavan, Sairamvinay
    English, Carter W.
    Flack, Jeffrey
    Mohapatra, Prasant
    Srivatsa, Uma N.
    CIRCULATION, 2021, 144