An Efficient Computational Risk Prediction Model of Heart Diseases Based on Dual-Stage Stacked Machine Learning Approaches

被引:6
|
作者
Mondal, Subhash [1 ,2 ]
Maity, Ranjan [1 ]
Omo, Yachang [3 ]
Ghosh, Soumadip [4 ]
Nag, Amitava [1 ]
机构
[1] Cent Inst Technol Kokrajhar, Dept Comp Sci & Engn, Kokrajhar 783370, Assam, India
[2] Dayananda Sagar Univ, Dept Comp Sci & Engn AI & ML, Bengaluru 560078, Karnataka, India
[3] Cent Inst Technol Kokrajhar, Dept Civil Engn, Kokrajhar 783370, Assam, India
[4] Future Inst Engn & Management, Dept Comp Sci & Engn, Kolkata 700150, West Bengal, India
关键词
Cardiovascular disease (CVD); extreme gradient boost (XGB); hyper-parameter tuning; heart disease; random forest classifier; stacking ensemble technique;
D O I
10.1109/ACCESS.2024.3350996
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cardiovascular diseases (CVDs) continue to be a prominent cause of global mortality, necessitating the development of effective risk prediction models to combat the rise in heart disease (HD) mortality rates. This work presents a novel dual-stage stacked machine learning (ML) based computational risk prediction model for cardiac disorders. Leveraging a dataset that includes eleven significant characteristics from 1190 patients from five distinct sources, five ML classifiers are utilized to create the initial prediction model. To ensure robustness and generalizability, the classifiers are cross-validated ten times. The model performance is optimized by employing two hyperparameter tuning approaches: RandomizedSearchCV and GridSearchCV. These methods aim to find the optimal estimator values. The highest-performing models, specifically Random Forest, Extreme Gradient Boost, and Decision Tree undergo additional refinement using a stacking ensemble technique. The stacking model, which leverages the capabilities of the three models, attains a remarkable accuracy rate of 96%, a recall value of 0.98, and a ROC-AUC score of 0.96. Notably, the rate of false-negative results is below 1%, demonstrating a high level of accuracy and a non-overfitted model. To evaluate the model's stability and repeatability, a comparable dataset consisting of 1000 occurrences is employed. The model consistently achieves an accuracy of 96.88% under identical experimental settings. This highlights the strength and dependability of the suggested computer model for predicting the risk of cardiac illnesses. The outcomes indicate that employing this two-step stacking ML method shows potential for prompt and precise diagnosis, hence aiding the worldwide endeavor to decrease fatalities caused by heart disease.
引用
收藏
页码:7255 / 7270
页数:16
相关论文
共 50 条
  • [1] Computational Learning Model for Prediction of Heart Disease Using Machine Learning Based on a New Regularizer
    Albahr, Abdulaziz
    Albahar, Marwan
    Thanoon, Mohammed
    Binsawad, Muhammad
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [2] A Computational Prediction Model of Blood-Brain Barrier Penetration Based on Machine Learning Approaches
    Ajabani, Deep Himmatbhai
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (12) : 485 - 493
  • [3] Model Prediction Based Dual-Stage Actuator Control in Discrete-Time Domain
    Lee, Choong Woo
    Suh, Sang Min
    IEEE TRANSACTIONS ON MAGNETICS, 2011, 47 (07) : 1830 - 1836
  • [4] Exploring potential dual-stage attention based recurrent neural network machine learning application for dosage prediction in intelligent municipal management
    Fang, Xusheng
    Zang, Jian
    Zhai, Zhengang
    Zhang, Li
    Shu, Ziyu
    Liang, Yuqi
    ENVIRONMENTAL SCIENCE-WATER RESEARCH & TECHNOLOGY, 2023, 9 (03) : 890 - 899
  • [5] Machine learning based efficient prediction of positive cases of waterborne diseases
    Hussain, Mushtaq
    Cifci, Mehmet Akif
    Sehar, Tayyaba
    Nabi, Said
    Cheikhrouhou, Omar
    Maqsood, Hasaan
    Ibrahim, Muhammad
    Mohammad, Fida
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2023, 23 (01)
  • [6] Machine learning based efficient prediction of positive cases of waterborne diseases
    Mushtaq Hussain
    Mehmet Akif Cifci
    Tayyaba Sehar
    Said Nabi
    Omar Cheikhrouhou
    Hasaan Maqsood
    Muhammad Ibrahim
    Fida Mohammad
    BMC Medical Informatics and Decision Making, 23
  • [7] Study on Machine Learning based Heart Disease Prediction Model
    Zhang, Shihan
    PROCEEDINGS OF 2023 4TH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE FOR MEDICINE SCIENCE, ISAIMS 2023, 2023, : 346 - 352
  • [8] Development of Nonlaboratory-Based Risk Prediction Models for Cardiovascular Diseases Using Conventional and Machine Learning Approaches
    Sajid, Mirza Rizwan
    Almehmadi, Bader A.
    Sami, Waqas
    Alzahrani, Mansour K.
    Muhammad, Noryanti
    Chesneau, Christophe
    Hanif, Asif
    Khan, Arshad Ali
    Shahbaz, Ahmad
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (23)
  • [9] A novel deep learning carbon price short-term prediction model with dual-stage attention mechanism
    Wang, Yanfeng
    Qin, Ling
    Wang, Qingrui
    Chen, Yingqi
    Yang, Qing
    Xing, Lu
    Ba, Shusong
    APPLIED ENERGY, 2023, 347
  • [10] PCA-IEM-DARNN: An enhanced dual-stage deep learning prediction model for concrete dam deformation based on feature decomposition
    Kang, Xinyu
    Li, Yanlong
    Zhang, Ye
    Wen, Lifeng
    Sun, Xinjian
    Wang, Jing
    MEASUREMENT, 2025, 242