Predicting early-stage coronary artery disease using machine learning and routine clinical biomarkers improved by augmented virtual data

被引:2
|
作者
Koloi, Angela [1 ,2 ]
Loukas, Vasileios S. [1 ]
Hourican, Cillian [4 ]
Sakellarios, Antonis, I [1 ,5 ]
Quax, Rick [4 ]
Mishra, Pashupati P. [6 ,7 ,8 ]
Lehtimaeki, Terho [6 ,7 ,8 ]
Raitakari, Olli T. [9 ,10 ,11 ]
Papaloukas, Costas [2 ]
Bosch, Jos A. [3 ]
Maerz, Winfried [12 ,13 ,14 ]
Fotiadis, Dimitrios, I [1 ,15 ]
机构
[1] Univ Ioannina, Dept Mat Sci & Engn, Unit Med Technol & Intelligent Informat Syst, Ioannina, Greece
[2] Univ Ioannina, Dept Biol Applicat & Technol, Ioannina, Greece
[3] Univ Amsterdam, Dept Clin Psychol, Amsterdam, Netherlands
[4] Univ Amsterdam, Inst Informat, Comp Sci Lab, Amsterdam, Netherlands
[5] Univ Patras, Dept Mech Engn & Aeronaut, Biomed Engn, Patras, Greece
[6] Tampere Univ, Fac Med & Hlth Technol, Dept Clin Chem, Tampere, Finland
[7] Tampere Univ, Fac Med & Hlth Technol, Finnish Cardiovasc Res Ctr Tampere, Tampere, Finland
[8] Fimlab Labs, Dept Clin Chem, Tampere, Finland
[9] Univ Turku, Res Ctr Appl & Prevent Cardiovasc Med, Turku, Finland
[10] Turku Univ Hosp, Dept Clin Physiol & Nucl Med, Turku, Finland
[11] Univ Turku, Turku Univ Hosp, Ctr Populat Hlth Res, Turku, Finland
[12] Heidelberg Univ, Dept Internal Med 5, Mannheim, Germany
[13] Med Univ Graz, Clin Inst Med & Chem Lab Diag, Graz, Austria
[14] SYNLAB Holding Deutschland GmbH, Augsburg, Germany
[15] FORTH IMBB, Dept Biomed Res, GR-45110 Ioannina, Greece
来源
关键词
Coronary artery disease; Machine learning; Classification algorithms; Data Augmentation; CARDIOVASCULAR RISK; HEART; FUTURE; SCORE;
D O I
10.1093/ehjdh/ztae049
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Aims Coronary artery disease (CAD) is a highly prevalent disease with modifiable risk factors. In patients with suspected obstructive CAD, evaluating the pre-test probability model is crucial for diagnosis, although its accuracy remains controversial. Machine learning (ML) predictive models can help clinicians detect CAD early and improve outcomes. This study aimed to identify early-stage CAD using ML in conjunction with a panel of clinical and laboratory tests.Methods and results The study sample included 3316 patients enrolled in the Ludwigshafen Risk and Cardiovascular Health (LURIC) study. A comprehensive array of attributes was considered, and an ML pipeline was developed. Subsequently, we utilized five approaches to generating high-quality virtual patient data to improve the performance of the artificial intelligence models. An extension study was carried out using data from the Young Finns Study (YFS) to assess the results' generalizability. Upon applying virtual augmented data, accuracy increased by approximately 5%, from 0.75 to -0.79 for random forests (RFs), and from 0.76 to -0.80 for Gradient Boosting (GB). Sensitivity showed a significant boost for RFs, rising by about 9.4% (0.81-0.89), while GB exhibited a 4.8% increase (0.83-0.87). Specificity showed a significant boost for RFs, rising by similar to 24% (from 0.55 to 0.70), while GB exhibited a 37% increase (from 0.51 to 0.74). The extension analysis aligned with the initial study.Conclusion Accurate predictions of angiographic CAD can be obtained using a set of routine laboratory markers, age, sex, and smoking status, holding the potential to limit the need for invasive diagnostic techniques. The extension analysis in the YFS demonstrated the potential of these findings in a younger population, and it confirmed applicability to atherosclerotic vascular disease.Lay summary Using virtual population generation techniques, this study improved the accuracy of a machine learning model designed to identify early-stage CAD using standard laboratory tests.
引用
收藏
页码:542 / 550
页数:9
相关论文
共 50 条
  • [21] A database for using machine learning and data mining techniques for coronary artery disease diagnosis
    R. Alizadehsani
    M. Roshanzamir
    M. Abdar
    A. Beykikhoshk
    A. Khosravi
    M. Panahiazar
    A. Koohestani
    F. Khozeimeh
    S. Nahavandi
    N. Sarrafzadegan
    Scientific Data, 6
  • [22] A database for using machine learning and data mining techniques for coronary artery disease diagnosis
    Alizadehsani, R.
    Roshanzamir, M.
    Abdar, M.
    Beykikhoshk, A.
    Khosravi, A.
    Panahiazar, M.
    Koohestani, A.
    Khozeimeh, F.
    Nahavandi, S.
    Sarrafzadegan, N.
    SCIENTIFIC DATA, 2019, 6 (1)
  • [23] Machine Learning Approaches for Detecting Early-Stage Depression using Text
    Suhas, G. H.
    Suraj, L.
    Varun, J.
    Veda, D. V.
    Jayanna, H. S.
    2021 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER TECHNOLOGIES AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2021, : 106 - 110
  • [24] A Machine Learning Model for Detection of Coronary Artery Disease Using Noninvasive Clinical Parameters
    Sayadi, Mohammadjavad
    Varadarajan, Vijayakumar
    Sadoughi, Farahnaz
    Chopannejad, Sara
    Langarizadeh, Mostafa
    LIFE-BASEL, 2022, 12 (11):
  • [25] Machine learning models using symptoms and clinical variables to predict coronary artery disease on coronary angiography
    Yu, Yangjie
    Li, Weikai
    Wu, Jiajia
    Hua, Xuyun
    Jin, Bo
    Shi, Haiming
    Chen, Qiying
    Pan, Junjie
    POSTEPY W KARDIOLOGII INTERWENCYJNEJ, 2024, 20 (01): : 30 - 36
  • [26] Novel Machine-Learning Based Framework Using Electroretinography Data for the Detection of Early-Stage Glaucoma
    Gajendran, Mohan Kumar
    Rohowetz, Landon J.
    Koulen, Peter
    Mehdizadeh, Amirfarhang
    FRONTIERS IN NEUROSCIENCE, 2022, 16
  • [27] Machine Learning Model Predicting Bleeding Risk in Patients With Coronary Artery Disease
    Ishii, Masanobu
    Nakamura, Taishi
    Yamanouchi, Yoshinori
    Otsuka, Yasuhiro
    Ikebe, Sou
    Tsujita, Kenichi
    CIRCULATION, 2023, 148
  • [28] Predicting Cybersickness Using Machine Learning and Demographic Data in Virtual Reality
    Ramaseri-Chandra, Ananth N.
    Reza, Hassan
    ELECTRONICS, 2024, 13 (07)
  • [29] Detection of coronary artery disease using machine learning algorithms
    Vashistha, Kriti
    Bokhare, Anuja
    INTERNATIONAL JOURNAL OF MODELLING IDENTIFICATION AND CONTROL, 2023, 43 (02) : 83 - 91
  • [30] Early-Stage Risk Prediction of Non-Communicable Disease Using Machine Learning in Health CPS
    Ferdousi, Rahatara
    Hossain, M. Anwar
    Saddik, Abdulmotaleb El
    IEEE ACCESS, 2021, 9 : 96823 - 96837