An ensemble data mining approach to discover medical patterns and provide a system to predict the mortality in the ICU of cardiac surgery based on stacking machine learning method

被引:8
|
作者
Ghavidel, Arman [1 ]
Ghousi, Rouzbeh [1 ]
Atashi, Alireza [2 ,3 ]
机构
[1] Iran Univ Sci & Technol, Sch Ind Engn, Tehran 1684613114, Iran
[2] Univ Tehran Med Sci, Hlth Dept, Virtual Sch, Tehran, Iran
[3] ACECR, Motamed Canc Inst, Breast Canc Res Ctr, Canc Informat Res Grp,Clin Res Dept, Tehran, Iran
关键词
Classification; stacking ensemble method; heart surgery; unbalanced data problem; hybrid predictive model; machine learning in healthcare; resampling method; edited-nearest-neighbor; nonparametric test; INTENSIVE-CARE UNITS; HOSPITAL MORTALITY; RISK PREDICTION; CLASSIFICATION; PERFORMANCE; ALGORITHMS;
D O I
10.1080/21681163.2022.2063189
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
The most effective approach to reduce disease mortality is to diagnose it as soon as possible. As a result, data mining by applying machine learning in the field of diseases provides good opportunities to examine the hidden patterns of this collection. An exact forecast of the mortality after heart surgery will cause successful medical treatment and fewer costs. This research wants to recommend a new stacking predictive model after utilising the random forest feature importance method to foresee the mortality after heart surgery on a highly unbalanced dataset by using the most practical features. To solve the unbalanced data problem, a combination of the SVM-SMOTE over-sampling algorithm and the Edited-Nearest-Neighbour under-sampling algorithm is used. This research compares the introduced model with some other machine learning classifiers to ensure efficiency through shuffle hold-out and 10-fold cross-validation strategies. In order to validate the performance of the implemented machine learning methods in this research, both shuffle hold-out, and 10-fold cross-validation results indicated that our model had the highest efficiency compared to the other models. Furthermore, the Friedman statistical test is applied to survey the differences between models. The result demonstrates that the introduced stacking model reached the most accurate predicting performance.
引用
收藏
页码:1316 / 1326
页数:11
相关论文
共 13 条
  • [1] An ensemble machine learning approach to predict postoperative mortality in older patients undergoing emergency surgery
    Sang-Wook Lee
    Eun-Ho Lee
    In-Cheol Choi
    BMC Geriatrics, 23
  • [2] An ensemble machine learning approach to predict postoperative mortality in older patients undergoing emergency surgery
    Lee, Sang-Wook
    Lee, Eun-Ho
    Choi, In-Cheol
    BMC GERIATRICS, 2023, 23 (01)
  • [3] Machine-learning model to predict the cause of death using a stacking ensemble method for observational data
    Kim, Chungsoo
    You, Seng Chan
    Reps, Jenna M.
    Cheong, Jae Youn
    Park, Rae Woong
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2021, 28 (06) : 1098 - 1107
  • [4] Implementation of Real-Time Medical and Health Data Mining System Based on Machine Learning
    Wang, Pengyuan
    Li, Jie
    JOURNAL OF HEALTHCARE ENGINEERING, 2021, 2021
  • [5] Implementation of Real-Time Medical and Health Data Mining System Based on Machine Learning
    Wang, Pengyuan
    Li, Jie
    JOURNAL OF HEALTHCARE ENGINEERING, 2021, 2021
  • [6] A General Data Mining Methodology Based on a Weighted Hierarchical Adaptive Voting Ensemble (WHAVE) Machine Learning Method
    Deng, Clemen
    Perkowski, Marek
    JOURNAL OF MULTIPLE-VALUED LOGIC AND SOFT COMPUTING, 2017, 28 (4-5) : 409 - 427
  • [7] A stacking ensemble machine learning model to predict alpha-1 antitrypsin deficiency-associated liver disease clinical outcomes based on UK Biobank data
    Meng, Linxi
    Treem, Will
    Heap, Graham A.
    Chen, Jingjing
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [8] A stacking ensemble machine learning model to predict alpha-1 antitrypsin deficiency-associated liver disease clinical outcomes based on UK Biobank data
    Linxi Meng
    Will Treem
    Graham A. Heap
    Jingjing Chen
    Scientific Reports, 12
  • [9] Electronic Medical Record-Based Machine Learning Approach to Predict the Risk of 30-Day Adverse Cardiac Events After Invasive Coronary Treatment: Machine Learning Model Development and Validation
    Kwon, Osung
    Na, Wonjun
    Kang, Heejun
    Jun, Tae Joon
    Kweon, Jihoon
    Park, Gyung-Min
    Cho, YongHyun
    Hur, Cinyoung
    Chae, Jungwoo
    Kang, Do-Yoon
    Lee, Pil Hyung
    Ahn, Jung-Min
    Park, Duk-Woo
    Kang, Soo-Jin
    Lee, Seung-Whan
    Lee, Cheol Whan
    Park, Seong-Wook
    Park, Seung-Jung
    Yang, Dong Hyun
    Kim, Young-Hak
    JMIR MEDICAL INFORMATICS, 2022, 10 (05)
  • [10] Improved pore structure prediction based on MICP with a data mining and machine learning system approach in Mesozoic strata of Gaoqing field, Jiyang depression
    Wang, Xidong
    Yang, Shaochun
    Zhao, Yongfu
    Wang, Ya
    JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2018, 171 : 362 - 393