An ensemble data mining approach to discover medical patterns and provide a system to predict the mortality in the ICU of cardiac surgery based on stacking machine learning method

被引:8
|
作者
Ghavidel, Arman [1 ]
Ghousi, Rouzbeh [1 ]
Atashi, Alireza [2 ,3 ]
机构
[1] Iran Univ Sci & Technol, Sch Ind Engn, Tehran 1684613114, Iran
[2] Univ Tehran Med Sci, Hlth Dept, Virtual Sch, Tehran, Iran
[3] ACECR, Motamed Canc Inst, Breast Canc Res Ctr, Canc Informat Res Grp,Clin Res Dept, Tehran, Iran
关键词
Classification; stacking ensemble method; heart surgery; unbalanced data problem; hybrid predictive model; machine learning in healthcare; resampling method; edited-nearest-neighbor; nonparametric test; INTENSIVE-CARE UNITS; HOSPITAL MORTALITY; RISK PREDICTION; CLASSIFICATION; PERFORMANCE; ALGORITHMS;
D O I
10.1080/21681163.2022.2063189
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
The most effective approach to reduce disease mortality is to diagnose it as soon as possible. As a result, data mining by applying machine learning in the field of diseases provides good opportunities to examine the hidden patterns of this collection. An exact forecast of the mortality after heart surgery will cause successful medical treatment and fewer costs. This research wants to recommend a new stacking predictive model after utilising the random forest feature importance method to foresee the mortality after heart surgery on a highly unbalanced dataset by using the most practical features. To solve the unbalanced data problem, a combination of the SVM-SMOTE over-sampling algorithm and the Edited-Nearest-Neighbour under-sampling algorithm is used. This research compares the introduced model with some other machine learning classifiers to ensure efficiency through shuffle hold-out and 10-fold cross-validation strategies. In order to validate the performance of the implemented machine learning methods in this research, both shuffle hold-out, and 10-fold cross-validation results indicated that our model had the highest efficiency compared to the other models. Furthermore, the Friedman statistical test is applied to survey the differences between models. The result demonstrates that the introduced stacking model reached the most accurate predicting performance.
引用
收藏
页码:1316 / 1326
页数:11
相关论文
共 13 条
  • [11] Electronic Medical Record-Based Machine Learning Approach to Predict the Risk of 30-Day Major Adverse Cardiac Event After Invasive Coronary Treatment
    Kwon, Osung
    Na, Wonjun
    Yang, Hyeonkyeong
    Kweon, Jihoon
    Yang, Dong Hyun
    Chae, Jungwoo
    Hur, Cinyoung
    Cho, Yonghyun
    Kim, Young-Hak
    CIRCULATION, 2019, 140
  • [12] A Novel Preoperative Scoring System to Accurately Predict Cord-Level Intraoperative Neuromonitoring Data Loss During Spinal Deformity Surgery: A Machine-Learning Approach
    Lee, Nathan J.
    Lenke, Lawrence G.
    Arvind, Varun
    Shi, Ted
    Dionne, Alexandra C.
    Nnake, Chidebelum
    Yeary, Mitchell
    Fields, Michael
    Simhon, Matt
    Ferraro, Anastasia
    Cooney, Matthew
    Lewerenz, Erik
    Reyes, Justin L.
    Roth, Steven G.
    Hung, Chun Wai
    Scheer, Justin K.
    Zervos, Thomas
    Thuet, Earl D.
    Lombardi, Joseph M.
    Sardar, Zeeshan M.
    Lehman, Ronald A.
    Roye, Benjamin D.
    Vitale, Michael G.
    Hassan, Fthimnir M.
    JOURNAL OF BONE AND JOINT SURGERY-AMERICAN VOLUME, 2025, 107 (03): : 237 - 248
  • [13] Prediction Model for 30-Day Mortality after Non-Cardiac Surgery Using Machine-Learning Techniques Based on Preoperative Evaluation of Electronic Medical Records
    Choi, Byungjin
    Oh, Ah Ran
    Lee, Seung-Hwa
    Lee, Dong Yun
    Lee, Jong-Hwan
    Yang, Kwangmo
    Kim, Ha Yeon
    Park, Rae Woong
    Park, Jungchan
    JOURNAL OF CLINICAL MEDICINE, 2022, 11 (21)