Using Machine Learning for Detection and Prediction of Chronic Diseases

被引:0
|
作者
Yanes, Nacim [1 ,2 ]
Jamel, Leila [3 ]
Alabdullah, Bayan [3 ]
Ezz, Mohamed [4 ]
Mohamed Mostafa, Ayman [4 ]
Shabana, Hossameldeen [5 ]
机构
[1] Manouba Univ, RIADI Lab, Manouba 2010, Tunisia
[2] Gabes Univ, Higher Inst Management Gabes, Gabes 6033, Tunisia
[3] Princess Nourah Bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Informat Syst, POB 84428, Riyadh 11671, Saudi Arabia
[4] Jouf Univ, Coll Comp & Informat Sci, Sakaka 72388, Saudi Arabia
[5] Shaqra Univ, Coll Med, Shaqra 11961, Saudi Arabia
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Predictive models; Accuracy; Cardiac arrest; Diseases; Heart; Medical services; Data models; Prediction algorithms; Classification algorithms; Tuning; Heart attack prediction; ensemble model; chronic diseases; class imbalance; ML classifiers; model transparency;
D O I
10.1109/ACCESS.2024.3494839
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Heart attacks are a leading cause of mortality worldwide, necessitating the development of accurate predictive models to enhance early detection and intervention strategies. This study addresses the significant problem of class imbalance in medical datasets, specifically focusing on heart attack prediction using the Behavioral Risk Factor Surveillance System (BRFSS) dataset. To tackle this challenge, advanced machine learning (ML) methods are proposed to involve a refined dataset of 399,875 instances, with 47 significant features maintained through rigorous data cleaning and preparation. Balanced accuracy and macro-recall were chosen as primary metrics to ensure fair performance evaluation across classes in the imbalanced dataset. Our proposed system entails a detailed evaluation of various algorithms known for their effectiveness in managing class imbalance. The LGBM Classifier, XGB Classifier, and Logistic Regression (LR) are optimized using recursive feature elimination and hyperparameter tuning with Optuna. The results of this study are encapsulated in an ensemble model that significantly enhances predictive accuracy. The final model achieved 80.75% balanced accuracy and 79.97% recall for critical heart attack cases (class 1), along with an AUC score of 88.9%, indicating superior class distinction capability. Additionally, the application of SHAP (SHapley Additive exPlanations) analysis provided valuable insights into the contribution of each feature to heart attack likelihood, thus improving model transparency. This study's successful integration of complex ML techniques with interpretability analyses like SHAP marks a substantial advance in early detection and intervention strategies in healthcare. It demonstrates the potential of sophisticated ML approaches for early heart attack detection and prevention, highlighting their value in improving outcomes for patients with chronic diseases. These findings suggest promising pathways for employing advanced analytical tools in healthcare to enhance patient care.
引用
收藏
页码:177674 / 177691
页数:18
相关论文
共 50 条
  • [21] Analytical Approach towards Prediction of Diseases Using Machine Learning Algorithms
    Grover, Ayushi
    Kalani, Anukriti
    Dubey, Sanjay Kumar
    PROCEEDINGS OF THE CONFLUENCE 2020: 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING, 2020, : 793 - 797
  • [22] Prediction of persistent chronic cough in patients with chronic cough using machine learning
    Chen, Wansu
    Schatz, Michael
    Zhou, Yichen
    Xie, Fagen
    Bali, Vishal
    Das, Amar
    Schelfhout, Jonathan
    Stern, Julie A.
    Zeiger, Robert S.
    ERJ OPEN RESEARCH, 2023, 9 (02)
  • [23] Prediction Model for Health-Related Quality of Life of Elderly with Chronic Diseases using Machine Learning Techniques
    Lee, Soo-Kyoung
    Son, Youn-Jung
    Kim, Jeongeun
    Kim, Hong-Gee
    Lee, Jae-Il
    Kang, Bo-Yeong
    Cho, Hyeon-Sung
    Lee, Sungin
    HEALTHCARE INFORMATICS RESEARCH, 2014, 20 (02) : 125 - 134
  • [24] Diabetes Detection and Prediction Using Machine Learning/IoT: A Survey
    Sharma, Neha
    Singh, Ashima
    ADVANCED INFORMATICS FOR COMPUTING RESEARCH, ICAICR 2018, PT I, 2019, 955 : 471 - 479
  • [25] Prediction of Insurance Fraud Detection using Machine Learning Algorithms
    Rukhsar, Laiqa
    Bangyal, Waqas Haider
    Nisar, Kashif
    Nisar, Sana
    MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2022, 41 (01) : 33 - 40
  • [26] Corrosion area detection and depth prediction using machine learning
    Son, Eun-Young
    Jeong, Dayeon
    Oh, Min-Jae
    INTERNATIONAL JOURNAL OF NAVAL ARCHITECTURE AND OCEAN ENGINEERING, 2024, 16
  • [27] PCOcare: PCOS Detection and Prediction using Machine Learning Algorithms
    Thakre, Vaidehi
    Vedpathak, Shreyas
    Thakre, Kalpana
    Sonawani, Shilpa
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (14): : 240 - 244
  • [28] Air pollution prediction and hotspot detection using machine learning
    Bhatia, Shailee
    Sachdeva, Shelly
    Goswami, Puneet
    JOURNAL OF STATISTICS AND MANAGEMENT SYSTEMS, 2022, 25 (07) : 1553 - 1564
  • [29] Detection of Plant Diseases by Machine Learning
    Korkut, Umut Baris
    Gokturk, Omer Berke
    Yildiz, Oktay
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [30] Smart Healthcare: Machine Learning Enabled WBAN for Early Detection of Chronic Diseases
    Kumaran, S.
    Princy, I. Riya Evangeline
    Agnes, J. Princy
    2ND INTERNATIONAL CONFERENCE ON SUSTAINABLE COMPUTING AND SMART SYSTEMS, ICSCSS 2024, 2024, : 998 - 1003