Predicting early-stage coronary artery disease using machine learning and routine clinical biomarkers improved by augmented virtual data

被引:2
|
作者
Koloi, Angela [1 ,2 ]
Loukas, Vasileios S. [1 ]
Hourican, Cillian [4 ]
Sakellarios, Antonis, I [1 ,5 ]
Quax, Rick [4 ]
Mishra, Pashupati P. [6 ,7 ,8 ]
Lehtimaeki, Terho [6 ,7 ,8 ]
Raitakari, Olli T. [9 ,10 ,11 ]
Papaloukas, Costas [2 ]
Bosch, Jos A. [3 ]
Maerz, Winfried [12 ,13 ,14 ]
Fotiadis, Dimitrios, I [1 ,15 ]
机构
[1] Univ Ioannina, Dept Mat Sci & Engn, Unit Med Technol & Intelligent Informat Syst, Ioannina, Greece
[2] Univ Ioannina, Dept Biol Applicat & Technol, Ioannina, Greece
[3] Univ Amsterdam, Dept Clin Psychol, Amsterdam, Netherlands
[4] Univ Amsterdam, Inst Informat, Comp Sci Lab, Amsterdam, Netherlands
[5] Univ Patras, Dept Mech Engn & Aeronaut, Biomed Engn, Patras, Greece
[6] Tampere Univ, Fac Med & Hlth Technol, Dept Clin Chem, Tampere, Finland
[7] Tampere Univ, Fac Med & Hlth Technol, Finnish Cardiovasc Res Ctr Tampere, Tampere, Finland
[8] Fimlab Labs, Dept Clin Chem, Tampere, Finland
[9] Univ Turku, Res Ctr Appl & Prevent Cardiovasc Med, Turku, Finland
[10] Turku Univ Hosp, Dept Clin Physiol & Nucl Med, Turku, Finland
[11] Univ Turku, Turku Univ Hosp, Ctr Populat Hlth Res, Turku, Finland
[12] Heidelberg Univ, Dept Internal Med 5, Mannheim, Germany
[13] Med Univ Graz, Clin Inst Med & Chem Lab Diag, Graz, Austria
[14] SYNLAB Holding Deutschland GmbH, Augsburg, Germany
[15] FORTH IMBB, Dept Biomed Res, GR-45110 Ioannina, Greece
来源
关键词
Coronary artery disease; Machine learning; Classification algorithms; Data Augmentation; CARDIOVASCULAR RISK; HEART; FUTURE; SCORE;
D O I
10.1093/ehjdh/ztae049
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Aims Coronary artery disease (CAD) is a highly prevalent disease with modifiable risk factors. In patients with suspected obstructive CAD, evaluating the pre-test probability model is crucial for diagnosis, although its accuracy remains controversial. Machine learning (ML) predictive models can help clinicians detect CAD early and improve outcomes. This study aimed to identify early-stage CAD using ML in conjunction with a panel of clinical and laboratory tests.Methods and results The study sample included 3316 patients enrolled in the Ludwigshafen Risk and Cardiovascular Health (LURIC) study. A comprehensive array of attributes was considered, and an ML pipeline was developed. Subsequently, we utilized five approaches to generating high-quality virtual patient data to improve the performance of the artificial intelligence models. An extension study was carried out using data from the Young Finns Study (YFS) to assess the results' generalizability. Upon applying virtual augmented data, accuracy increased by approximately 5%, from 0.75 to -0.79 for random forests (RFs), and from 0.76 to -0.80 for Gradient Boosting (GB). Sensitivity showed a significant boost for RFs, rising by about 9.4% (0.81-0.89), while GB exhibited a 4.8% increase (0.83-0.87). Specificity showed a significant boost for RFs, rising by similar to 24% (from 0.55 to 0.70), while GB exhibited a 37% increase (from 0.51 to 0.74). The extension analysis aligned with the initial study.Conclusion Accurate predictions of angiographic CAD can be obtained using a set of routine laboratory markers, age, sex, and smoking status, holding the potential to limit the need for invasive diagnostic techniques. The extension analysis in the YFS demonstrated the potential of these findings in a younger population, and it confirmed applicability to atherosclerotic vascular disease.Lay summary Using virtual population generation techniques, this study improved the accuracy of a machine learning model designed to identify early-stage CAD using standard laboratory tests.
引用
收藏
页码:542 / 550
页数:9
相关论文
共 50 条
  • [31] Early Diagnosis of Primary Immunodeficiency Disease Using Clinical Data and Machine Learning
    Mayampurath, Anoop
    Ajith, Aswathy
    Anderson-Smits, Colin
    Chang, Shun-Chiao
    Brouwer, Emily
    Johnson, Julie
    Baltasi, Michael
    Volchenboum, Samuel
    Devercelli, Giovanna
    Ciaccio, Christina E.
    JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY-IN PRACTICE, 2022, 10 (11): : 3002 - +
  • [32] Predicting severe angiographic coronary artery disease using computerization of clinical and exercise test data
    Do, D
    Marcus, R
    Froelicher, V
    Janosi, A
    West, J
    Atwood, JE
    Myers, J
    Chilton, R
    Froning, J
    CHEST, 1998, 114 (05) : 1437 - 1445
  • [33] The application of machine learning for predicting recurrence in patients with early-stage endometrial cancer: a pilot study
    Akazawa, Munetoshi
    Hashimoto, Kazunori
    Noda, Katsuhiko
    Yoshida, Kaname
    OBSTETRICS & GYNECOLOGY SCIENCE, 2021, 64 (03) : 266 - 273
  • [34] Advanced Machine-Learning Technologies for Coronary Artery Disease Prediction Using Heterogeneous Data
    Alqulaity, Malak
    Yang, Po
    2023 IEEE 22ND INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, BIGDATASE, CSE, EUC, ISCI 2023, 2024, : 54 - 61
  • [35] Predicting tumor recurrence on baseline MR imaging in patients with early-stage hepatocellular carcinoma using deep machine learning
    Kucukkaya, Ahmet Said
    Zeevi, Tal
    Chai, Nathan Xianming
    Raju, Rajiv
    Haider, Stefan Philipp
    Elbanan, Mohamed
    Petukhova-Greenstein, Alexandra
    Lin, MingDe
    Onofrey, John
    Nowak, Michal
    Cooper, Kirsten
    Thomas, Elizabeth
    Santana, Jessica
    Gebauer, Bernhard
    Mulligan, David
    Staib, Lawrence
    Batra, Ramesh
    Chapiro, Julius
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [36] Predicting tumor recurrence on baseline MR imaging in patients with early-stage hepatocellular carcinoma using deep machine learning
    Ahmet Said Kucukkaya
    Tal Zeevi
    Nathan Xianming Chai
    Rajiv Raju
    Stefan Philipp Haider
    Mohamed Elbanan
    Alexandra Petukhova-Greenstein
    MingDe Lin
    John Onofrey
    Michal Nowak
    Kirsten Cooper
    Elizabeth Thomas
    Jessica Santana
    Bernhard Gebauer
    David Mulligan
    Lawrence Staib
    Ramesh Batra
    Julius Chapiro
    Scientific Reports, 13
  • [37] Machine learning for the diagnosis of early-stage diabetes using temporal glucose profiles
    Lee, Woo Seok
    Jo, Junghyo
    Song, Taegeun
    JOURNAL OF THE KOREAN PHYSICAL SOCIETY, 2021, 78 (05) : 373 - 378
  • [38] Machine learning for the diagnosis of early-stage diabetes using temporal glucose profiles
    Woo Seok Lee
    Junghyo Jo
    Taegeun Song
    Journal of the Korean Physical Society, 2021, 78 : 373 - 378
  • [39] Novel and improved stage estimation in Parkinson's disease using clinical scales and machine learning
    Prashanth, R.
    Roy, Sumantra Dutta
    NEUROCOMPUTING, 2018, 305 : 78 - 103
  • [40] Support Vector Machine Outperforms Other Machine Learning Models in Early Diagnosis of Dengue Using Routine Clinical Data
    Qaiser, Ariba
    Manzoor, Sobia
    Hashmi, Asraf Hussain
    Javed, Hasnain
    Zafar, Anam
    Ashraf, Javed
    ADVANCES IN VIROLOGY, 2024, 2024