An Efficient Computational Risk Prediction Model of Heart Diseases Based on Dual-Stage Stacked Machine Learning Approaches

被引:6
|
作者
Mondal, Subhash [1 ,2 ]
Maity, Ranjan [1 ]
Omo, Yachang [3 ]
Ghosh, Soumadip [4 ]
Nag, Amitava [1 ]
机构
[1] Cent Inst Technol Kokrajhar, Dept Comp Sci & Engn, Kokrajhar 783370, Assam, India
[2] Dayananda Sagar Univ, Dept Comp Sci & Engn AI & ML, Bengaluru 560078, Karnataka, India
[3] Cent Inst Technol Kokrajhar, Dept Civil Engn, Kokrajhar 783370, Assam, India
[4] Future Inst Engn & Management, Dept Comp Sci & Engn, Kolkata 700150, West Bengal, India
关键词
Cardiovascular disease (CVD); extreme gradient boost (XGB); hyper-parameter tuning; heart disease; random forest classifier; stacking ensemble technique;
D O I
10.1109/ACCESS.2024.3350996
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cardiovascular diseases (CVDs) continue to be a prominent cause of global mortality, necessitating the development of effective risk prediction models to combat the rise in heart disease (HD) mortality rates. This work presents a novel dual-stage stacked machine learning (ML) based computational risk prediction model for cardiac disorders. Leveraging a dataset that includes eleven significant characteristics from 1190 patients from five distinct sources, five ML classifiers are utilized to create the initial prediction model. To ensure robustness and generalizability, the classifiers are cross-validated ten times. The model performance is optimized by employing two hyperparameter tuning approaches: RandomizedSearchCV and GridSearchCV. These methods aim to find the optimal estimator values. The highest-performing models, specifically Random Forest, Extreme Gradient Boost, and Decision Tree undergo additional refinement using a stacking ensemble technique. The stacking model, which leverages the capabilities of the three models, attains a remarkable accuracy rate of 96%, a recall value of 0.98, and a ROC-AUC score of 0.96. Notably, the rate of false-negative results is below 1%, demonstrating a high level of accuracy and a non-overfitted model. To evaluate the model's stability and repeatability, a comparable dataset consisting of 1000 occurrences is employed. The model consistently achieves an accuracy of 96.88% under identical experimental settings. This highlights the strength and dependability of the suggested computer model for predicting the risk of cardiac illnesses. The outcomes indicate that employing this two-step stacking ML method shows potential for prompt and precise diagnosis, hence aiding the worldwide endeavor to decrease fatalities caused by heart disease.
引用
收藏
页码:7255 / 7270
页数:16
相关论文
共 50 条
  • [21] An Efficient Dual-Stage Compression Model for Maritime Safety Information Based on BeiDou Short-Message Communication
    Hu, Jiwei
    Hong, Yue
    Jin, Qiwen
    Zhao, Guangpeng
    Lu, Hongyang
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (08)
  • [22] An efficient plant disease prediction model based on machine learning and deep learning classifiers
    Shinde, Nirmala
    Ambhaikar, Asha
    EVOLUTIONARY INTELLIGENCE, 2025, 18 (01)
  • [23] A novel solar power prediction model based on stacked BiLSTM deep learning and improved extreme learning machine
    El Bourakadi D.
    Ramadan H.
    Yahyaouy A.
    Boumhidi J.
    International Journal of Information Technology, 2023, 15 (2) : 587 - 594
  • [24] Machine Learning-Based Aviation Meteorological Risk Prediction Model
    Miao, Shaohui
    Du, Jiaxing
    SPIN, 2025,
  • [25] Two-Stage Approaches to Accounting for Patient Heterogeneity in Machine Learning Risk Prediction Models in Oncology
    Oh, Eun Jeong
    Parikh, Ravi B.
    Chivers, Corey
    Chen, Jinbo
    JCO CLINICAL CANCER INFORMATICS, 2021, 5 : 1015 - 1023
  • [26] Machine learning based risk prediction model for asymptomatic individuals who underwent coronary artery calcium score: Comparison with traditional risk prediction approaches
    Han, Donghee
    Kolli, Kranthi K.
    Gransar, Heidi
    Lee, Ji Hyun
    Choi, Su-Yeon
    Chun, Eun Ju
    Han, Hae-Won
    Park, Sung Hak
    Sung, Jidong
    Jung, Hae Ok
    Min, James K.
    Chang, Hyuk-Jae
    JOURNAL OF CARDIOVASCULAR COMPUTED TOMOGRAPHY, 2020, 14 (02) : 168 - 176
  • [27] PREDICTION MODEL OF PATHOGENIC GENE OF CORONARY HEART DISEASE BASED ON MACHINE LEARNING
    Huang, Y. L.
    Sajid, A.
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2018, 122 : 32 - 32
  • [28] Efficient daily solar radiation prediction with deep learning 4-phase convolutional neural network, dual stage stacked regression and support vector machine CNN-REGST hybrid model
    Ghimire, Sujan
    -Huy, Thong Nguyen
    Deo, Ravinesh C.
    Casillas-Perez, David
    Salcedo-Sanz, Sancho
    SUSTAINABLE MATERIALS AND TECHNOLOGIES, 2022, 32
  • [29] Cervical Cancer Risk Prediction Model and Analysis of Risk Factors based on Machine Learning
    Yang, Wenying
    Gou, Xin
    Xu, Tongqing
    Yi, Xiping
    Jiang, Maohong
    ICBBT 2019: 2019 11TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL TECHNOLOGY, 2019, : 50 - 54
  • [30] Heart Diseases Prediction for Optimization based Feature Selection and Classification using Machine Learning Methods
    Rajinikanth, N.
    Pavithra, L.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (02) : 636 - 643