Stroke Dataset Modeling: Comparative Study of Machine Learning Classification Methods

被引:1
|
作者
Kitova, Kalina [1 ]
Ivanov, Ivan [1 ]
Hooper, Vincent [2 ]
机构
[1] Sofia Univ St Kl Ohridski, Fac Econ & Business Adm, Sofia 1113, Bulgaria
[2] Dubai Int Acad City, SP Jain Sch Global Management, POB 502345, Dubai, U Arab Emirates
关键词
stroke prediction; machine learning modeling; classification models; imbalanced dataset;
D O I
10.3390/a17120571
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stroke prediction is a vital research area due to its significant implications for public health. This comparative study offers a detailed evaluation of algorithmic methodologies and outcomes from three recent prominent studies on stroke prediction. Ivanov et al. tackled issues of imbalanced datasets and algorithmic bias using deep learning techniques, achieving notable results with a 98% accuracy and a 97% recall rate. They utilized resampling methods to balance the classes and advanced imputation techniques to handle missing data, underscoring the critical role of data preprocessing in enhancing the performance of Support Vector Machines (SVMs). Hassan et al. addressed missing data and class imbalance using multiple imputations and the Synthetic Minority Oversampling Technique (SMOTE). They developed a Dense Stacking Ensemble (DSE) model with over 96% accuracy. Their results underscore the efficiency of ensemble learning techniques and imputation for handling imbalanced datasets in stroke prediction. Bathla et al. employed various classifiers and feature selection techniques, including SMOTE, for class balancing. Their Random Forest (RF) classifier, combined with Feature Importance (FI) selection, achieved an accuracy of 97.17%, illustrating the positive impact of RF and relevant feature selection on model performance. A comparative analysis indicated that Ivanov et al.'s method achieved the highest accuracy rate. However, the studies collectively highlight that the choice of models and techniques for stroke prediction should be tailored to the specific characteristics of the dataset used. This study emphasizes the importance of effective data management and model selection in enhancing predictive performance.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Comparative evaluation of machine learning classifiers with Obesity dataset
    Ramya, A.
    Rohini, K.
    2021 INTERNATIONAL CONFERENCE ON COMPUTING SCIENCES (ICCS 2021), 2021, : 38 - 41
  • [32] A Comprehensive Study of Machine Learning Methods on Diabetic Retinopathy Classification
    Gurcan, Omer Faruk
    Beyca, Omer Faruk
    Dogan, Onur
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2021, 14 (01) : 1132 - 1141
  • [33] Comparative analysis of feature representations and machine learning methods in Android family classification
    Bai, Yude
    Xing, Zhenchang
    Ma, Duoyuan
    Li, Xiaohong
    Feng, Zhiyong
    COMPUTER NETWORKS, 2021, 184
  • [34] Comparative Analysis of Data Preprocessing Methods in Machine Learning for Breast Cancer Classification
    Stockton, Timothy
    Peddle, Brandon
    Gaulin, Angelica
    Wiechert, Emma
    Lu, Wei
    ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOL 3, AINA 2024, 2024, 201 : 268 - 279
  • [35] Modeling and Classification of Alluvial Fans with DEMs and Machine Learning Methods: A Case Study of Slovenian Torrential Fans
    Babic, Matej
    Petrovic, Dusan
    Sodnik, Jost
    Soldo, Bozo
    Komac, Marko
    Chernieva, Olena
    Kovacic, Miha
    Mikos, Matjaz
    Cali, Michele
    REMOTE SENSING, 2021, 13 (09)
  • [36] Machine Learning Modeling of Disease Treatment Default: A Comparative Analysis of Classification Models
    Owusu-Adjei, Michael
    Hayfron-Acquah, James Ben
    Twum, Frimpong
    Abdul-Salaam, Gaddafi
    ADVANCES IN PUBLIC HEALTH, 2023, 2023
  • [37] Comparison of Different Machine Learning Methods on Wisconsin Dataset
    Ivancakova, Juliana
    Babic, Frantisek
    Butka, Peter
    2018 IEEE 16TH WORLD SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS (SAMI 2018): DEDICATED TO THE MEMORY OF PIONEER OF ROBOTICS ANTAL (TONY) K. BEJCZY, 2018, : 173 - 177
  • [38] Comparative Analysis of Classical Machine Learning and Deep Learning Methods for Fruit Image Recognition and Classification
    Salim, Nareen O. M.
    Mohammed, Ahmed Khorsheed
    TRAITEMENT DU SIGNAL, 2024, 41 (03) : 1331 - 1343
  • [39] Comparative study of machine learning methods to classify bowel polyps
    Cincar, Kristijan
    Ivascu, Todor
    Negru, Viorel
    2023 25TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING, SYNASC 2023, 2023, : 279 - 286
  • [40] A comparative study of machine learning methods for gas hydrate identification
    Tian, Dongmei
    Yang, Shengxiong
    Gong, Yuehua
    Geng, Minghui
    Li, Yuanheng
    Hu, Guang
    GEOENERGY SCIENCE AND ENGINEERING, 2023, 223