Improving Coronary Heart Disease Prediction Through Machine Learning and an Innovative Data Augmentation Technique

被引:6
|
作者
Al-Ssulami, Abdulrakeeb M. [1 ]
Alsorori, Randh S. [1 ]
Azmi, Aqil M. [2 ]
Aboalsamh, Hatim [2 ]
机构
[1] Taiz Univ, Fac Appl Sci, Dept Comp Sci, Taizi, Yemen
[2] King Saud Univ, Coll Comp & Informat Sci, Dept Comp Sci, Riyadh 11543, Saudi Arabia
关键词
Coronary heart disease; Bagging algorithm; Decision tree; Random forest; Dataset augmentation; NEURAL-NETWORKS; SYSTEM; DIAGNOSIS;
D O I
10.1007/s12559-023-10151-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Coronary heart disease (CHD) is a leading cause of death globally, with over 382,000 deaths in the USA alone in 2020. The early detection of CHD is critical in reducing mortality rates. Artificial intelligence (AI) is a constantly evolving field of computer science that employs computational models to extract insights from past data and provide rapid and accurate predictions for future cases. This paper presents a novel approach that generates an augmented dataset by selectively duplicating misclassified instances during the leave-one-out cross-validation (CV) process to overfit a model. We used a paired machine learning model with an augmented dataset approach to evaluate several classifiers. The comprehensive heart disease dataset [1] served as our base dataset. Our approach achieved higher accuracy than the base dataset, with the bagged decision tree (DT) algorithm outperforming state-of-the-art models and achieving an accuracy of 97.1% in the 10-fold CV test. Further experiments using the Cleveland dataset and the same 10-fold CV test resulted in an even higher accuracy of 99.2%. Combining an augmented dataset and the bagged-DT algorithm holds great promise for early CHD prediction helping reduce CHD mortality rates. The use of AI in early CHD prediction could potentially make a difference between the life and death of the patient.
引用
收藏
页码:1687 / 1702
页数:16
相关论文
共 50 条
  • [21] Prediction of Heart Disease Using Machine Learning
    Begum, M. Asma
    Abirami, S.
    Anandhi, R.
    Dhivyadharshini, K.
    Devi, R. Ganga
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (04): : 39 - 42
  • [22] Improving Active Learning Performance through the Use of Data Augmentation
    Fonseca, Joao
    Bacao, Fernando
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2023, 2023
  • [23] Comparison of machine learning algorithms for clinical event prediction (risk of coronary heart disease)
    Beunza, Juan-Jose
    Puertas, Enrique
    Garcia-Ovejero, Ester
    Villalba, Gema
    Condes, Emilia
    Koleva, Gergana
    Hurtado, Cristian
    Landecho, Manuel F.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 97
  • [24] ECG data analysis and heart disease prediction using machine learning algorithms
    Thithi, Sushimita Roy
    Akfar, Afifa
    Aleem, Fahimul
    Chakrabarty, Amitabha
    PROCEEDINGS OF 2019 IEEE REGION 10 SYMPOSIUM (TENSYMP), 2019, : 819 - 824
  • [25] Comparison of Coronary Heart Disease Prediction models using various Machine Learning Algorithms
    Tiwari, Sunil Kr
    Garg, Suresh Kumar
    JOURNAL OF ENGINEERING RESEARCH, 2021, 9 : 32 - 47
  • [26] Improving Bond Dissociations of Reactive Machine Learning Potentials through Physics-Constrained Data Augmentation
    dos Santos, Luan G. F.
    Nebgen, Benjamin T.
    Allen, Alice E. A.
    Hamilton, Brenden W.
    Matin, Sakib
    Smith, Justin S.
    Messerly, Richard A.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2025, 65 (03) : 1198 - 1210
  • [27] phylaGAN: data augmentation through conditional GANs and autoencoders for improving disease prediction accuracy using microbiome data
    Sharma, Divya
    Lou, Wendy
    Xu, Wei
    BIOINFORMATICS, 2024, 40 (04)
  • [28] Advancing Heart Disease Prediction through Synergistic Integration of Machine Learning and Deep Learning Techniques
    Mansoor, C. M. M.
    Chettri, Sarat Kumar
    Naleer, H. M. M.
    2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,
  • [29] Improving risk prediction in heart failure using machine learning
    Adler, Eric D.
    Voors, Adriaan A.
    Klein, Liviu
    Macheret, Fima
    Braun, Oscar O.
    Urey, Marcus A.
    Zhu, Wenhong
    Sama, Iziah
    Tadel, Matevz
    Campagnari, Claudio
    Greenberg, Barry
    Yagil, Avi
    EUROPEAN JOURNAL OF HEART FAILURE, 2020, 22 (01) : 139 - 147
  • [30] Improving Classification Performance in Gastric Disease through Realistic Data Augmentation Technique Based on Poisson Blending
    Lee, Han-sung
    Cho, Hyun-chong
    JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2023, 18 (04) : 3127 - 3134