Prediction of cardiovascular disease based on multiple feature selection and improved PSO-XGBoost model

被引:0
|
作者
Kerang Cao [1 ]
Chang Liu [2 ]
Siqi Yang [1 ]
Yuxin Zhang [1 ]
Lili Li [1 ]
Hoekyung Jung [3 ]
Shuo Zhang [4 ]
机构
[1] Shenyang University of Chemical Technology,College of Computer Science and Technology
[2] Key Laboratory of Intelligent Technology of Chemical Process Industry in Liaoning Province,Computer Engineering Dept
[3] Shenyang Maternity and Child Health Hospital,undefined
[4] Paichai University,undefined
关键词
Cardiovascular disease; Machine learning; XGBoost algorithm; Multi feature selection; Particle swarm optimization algorithm; Model prediction;
D O I
10.1038/s41598-025-96520-7
中图分类号
学科分类号
摘要
Cardiovascular disease is a common disease that threatens human health. In order to predict it more accurately, this paper proposes a cardiovascular disease prediction model that combines multiple feature selection, improved particle swarm optimization algorithm, and extreme gradient boosting tree. Firstly, the dataset is preprocessed, and an XGBoost cardiovascular disease prediction model is constructed for model training and compare it with other algorithms. Then, combined with two factor Pearson correlation analysis and feature importance ranking, multiple feature selection is performed, with the optimal feature subset as the feature input. Finally, the improved particle swarm optimization algorithm is used to adjust the hyperparameters of the extreme gradient boosting tree algorithm, and selecting the optimal hyperparameter combination to construct the MFS-DLPSO-XGBoost model. The recall, precision, accuracy, F1 score, and area under the ROC curve (AUC) of the MFS-DLPSO-XGBoost model reached 71.4%, 76.3%, 74.7%, 73.6%, and 80.8%, respectively, which increased by 3.6%, 3.2%, 2.7%, 3.2%, and 2.3% compared to XGBoost. The results indicate that the model proposed in this article has good classification performance and can provide assistance for doctors and patients in predicting and preventing heart disease.
引用
收藏
相关论文
共 50 条
  • [21] A Heart Disease Prediction Model Based on Feature Optimization and Smote-Xgboost Algorithm
    Yang, Jian
    Guan, Jinhan
    INFORMATION, 2022, 13 (10)
  • [22] XGBLC: an improved survival prediction model based on XGBoost
    Ma, Baoshan
    Yan, Ge
    Chai, Bingjie
    Hou, Xiaoyu
    BIOINFORMATICS, 2022, 38 (02) : 410 - 418
  • [23] Feature selection algorithm based on XGBoost
    Li Z.
    Liu Z.
    Tongxin Xuebao/Journal on Communications, 2019, 40 (10): : 101 - 108
  • [24] Feature selection with Optimized XGBoost model-based paddy plant leaf disease classification
    Dubey, Ratnesh Kumar
    Choubey, Dilip Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (33) : 80281 - 80281
  • [25] An evolutionary deep learning model based on XGBoost feature selection and Gaussian data augmentation for AQI prediction
    Qian, Shijie
    Peng, Tian
    Tao, Zihan
    Li, Xi
    Nazir, Muhammad Shahzad
    Zhang, Chu
    PROCESS SAFETY AND ENVIRONMENTAL PROTECTION, 2024, 191 : 836 - 851
  • [26] Improved PSO-Based Feature Construction Algorithm Using Feature Selection Methods
    Mahanipour, Afsaneh
    Nezamabadi-pour, Hossein
    2017 2ND CONFERENCE ON SWARM INTELLIGENCE AND EVOLUTIONARY COMPUTATION (CSIEC), 2017, : 1 - 5
  • [27] A Gas Emission Prediction Model Based on Feature Selection and Improved Machine Learning
    Shao, Liangshan
    Zhang, Kun
    PROCESSES, 2023, 11 (03)
  • [28] Integrating Correlation-Based Feature Selection and Clustering for Improved Cardiovascular Disease Diagnosis
    Wosiak, Agnieszka
    Zakrzewska, Danuta
    COMPLEXITY, 2018,
  • [29] Comparing different feature selection algorithms for cardiovascular disease prediction
    Hasan, Najmul
    Bao, Yukun
    HEALTH AND TECHNOLOGY, 2021, 11 (01) : 49 - 62
  • [30] Prediction of Cardiovascular Disease by Feature Selection and Machine Learning Techniques
    Ranade, Aditya
    Pise, Nitin
    ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 2, AITA 2023, 2024, 844 : 457 - 472