An Efficient Computational Risk Prediction Model of Heart Diseases Based on Dual-Stage Stacked Machine Learning Approaches

被引:6
|
作者
Mondal, Subhash [1 ,2 ]
Maity, Ranjan [1 ]
Omo, Yachang [3 ]
Ghosh, Soumadip [4 ]
Nag, Amitava [1 ]
机构
[1] Cent Inst Technol Kokrajhar, Dept Comp Sci & Engn, Kokrajhar 783370, Assam, India
[2] Dayananda Sagar Univ, Dept Comp Sci & Engn AI & ML, Bengaluru 560078, Karnataka, India
[3] Cent Inst Technol Kokrajhar, Dept Civil Engn, Kokrajhar 783370, Assam, India
[4] Future Inst Engn & Management, Dept Comp Sci & Engn, Kolkata 700150, West Bengal, India
关键词
Cardiovascular disease (CVD); extreme gradient boost (XGB); hyper-parameter tuning; heart disease; random forest classifier; stacking ensemble technique;
D O I
10.1109/ACCESS.2024.3350996
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cardiovascular diseases (CVDs) continue to be a prominent cause of global mortality, necessitating the development of effective risk prediction models to combat the rise in heart disease (HD) mortality rates. This work presents a novel dual-stage stacked machine learning (ML) based computational risk prediction model for cardiac disorders. Leveraging a dataset that includes eleven significant characteristics from 1190 patients from five distinct sources, five ML classifiers are utilized to create the initial prediction model. To ensure robustness and generalizability, the classifiers are cross-validated ten times. The model performance is optimized by employing two hyperparameter tuning approaches: RandomizedSearchCV and GridSearchCV. These methods aim to find the optimal estimator values. The highest-performing models, specifically Random Forest, Extreme Gradient Boost, and Decision Tree undergo additional refinement using a stacking ensemble technique. The stacking model, which leverages the capabilities of the three models, attains a remarkable accuracy rate of 96%, a recall value of 0.98, and a ROC-AUC score of 0.96. Notably, the rate of false-negative results is below 1%, demonstrating a high level of accuracy and a non-overfitted model. To evaluate the model's stability and repeatability, a comparable dataset consisting of 1000 occurrences is employed. The model consistently achieves an accuracy of 96.88% under identical experimental settings. This highlights the strength and dependability of the suggested computer model for predicting the risk of cardiac illnesses. The outcomes indicate that employing this two-step stacking ML method shows potential for prompt and precise diagnosis, hence aiding the worldwide endeavor to decrease fatalities caused by heart disease.
引用
收藏
页码:7255 / 7270
页数:16
相关论文
共 50 条
  • [41] A Dual-Stage Attention-Based Vehicle Speed Prediction Model Considering Driver Heterogeneity with Fuel Consumption and Emissions Analysis
    Cheng, Rongjun
    Li, Qinyin
    Chen, Fuzhou
    Miao, Baobin
    SUSTAINABILITY, 2024, 16 (04)
  • [42] Exploring potential dual-stage attention based recurrent neural network machine learning application for dosage prediction in intelligent municipal management (vol 9, pg 890, 2023)
    Fang, Xusheng
    Zang, Jian
    Zhai, Zhengang
    Zhang, Li
    Shu, Ziyu
    Liang, Yuqi
    ENVIRONMENTAL SCIENCE-WATER RESEARCH & TECHNOLOGY, 2023, 9 (06) : 1750 - 1750
  • [43] RETRACTED: Implementation of a Heart Disease Risk Prediction Model Using Machine Learning (Retracted Article)
    Karthick, K.
    Aruna, S. K.
    Samikannu, Ravi
    Kuppusamy, Ramya
    Teekaraman, Yuvaraja
    Thelkar, Amruth Ramesh
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2022, 2022
  • [44] Encoding and Decoding Model of State of Health Estimation and Remaining Useful Life Prediction for Batteries Based on Dual-stage Attention Mechanism
    Dai J.
    Xia M.
    Chen Q.
    Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2023, 47 (06): : 168 - 177
  • [45] Efficient Machine Learning and Factional Calculus Based Mathematical Model for Early COVID Prediction
    Saroj Kumar Chandra
    Manish Kumar Bajpai
    Human-Centric Intelligent Systems, 2023, 3 (4): : 508 - 520
  • [46] Semantic Data Pre-Processing for Machine Learning Based Bankruptcy Prediction Computational Model
    Yerashenia, Natalia
    Bolotov, Alexander
    Chan, David
    Pierantoni, Gabriele
    2020 IEEE 22ND CONFERENCE ON BUSINESS INFORMATICS (CBI 2020), VOL I - RESEARCH PAPERS, 2020, : 66 - 75
  • [47] Construction of Prediction Model for Atrial Fibrillation with Valvular Heart Disease Based on Machine Learning
    Li, Qiaoqiao
    Lei, Shenghong
    Luo, Xueshan
    He, Jintao
    Fang, Yuan
    Yang, Hui
    Liu, Yang
    Deng, Chun-Yu
    Wu, Shulin
    Xue, Yu-Mei
    Rao, Fang
    REVIEWS IN CARDIOVASCULAR MEDICINE, 2022, 23 (07)
  • [48] Modern computational approaches for rice yield prediction: A systematic review of statistical and machine learning-based methods
    De Clercq, Djavan
    Mahdi, Adam
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2025, 231
  • [49] An Outcome Based Analysis on Heart Disease Prediction using Machine Learning Algorithms and Data Mining Approaches
    Deb, Aushtmi
    Koli, Mst Sadia Akter
    Akter, Sheikh Beauty
    Chowdhury, Adil Ahmed
    2022 IEEE WORLD AI IOT CONGRESS (AIIOT), 2022, : 418 - 424
  • [50] Multi-Label Active Learning-Based Machine Learning Model for Heart Disease Prediction
    El-Hasnony, Ibrahim M.
    Elzeki, Omar M.
    Alshehri, Ali
    Salem, Hanaa
    SENSORS, 2022, 22 (03)