Predicting the risk of cardiovascular disease in adults exposed to heavy metals: Interpretable machine learning

被引:0
|
作者
Shen, Meiyue [1 ]
Zhang, Yine [2 ]
Zhan, Runqing [3 ]
Du, Tingwei [1 ]
Shen, Peixuan [1 ]
Lu, Xiaochuan [1 ]
Liu, Shengnan [1 ,2 ,3 ]
Guo, Rongrong [1 ]
Shen, Xiaoli [1 ]
机构
[1] Qingdao Univ, Sch Publ Hlth, Dept Epidemiol & Hlth Stat, 308 Ningxia Rd, Qingdao 266071, Peoples R China
[2] Ningxia Ctr Dis Control & Prevent, Yinchuan, Peoples R China
[3] Qingdao Haici Hosp, Qingdao 266033, Peoples R China
关键词
Cardiovascular disease; Heavy metals; Machine learning; Random forest; AdaBoost; Partial dependence plot; MECHANISMS; CADMIUM; METAANALYSIS; HYPERTENSION; BIOMARKERS; TUNGSTEN; BARIUM; HEALTH;
D O I
10.1016/j.ecoenv.2024.117570
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Machine learning exhibits excellent performance in terms of predictive power. We aimed to construct an interpretable machine learning model utilizing National Health and Nutrition Examination Survey data to investigate the relationship between heavy metal exposure and cardiovascular disease (CVD). A total of 4600 adults were included in the analysis. The Least Absolute Shrinkage and Selection Operator regression method was employed to select relevant feature variables. Subsequently, six machine learning models were constructed, including random forest, decision tree, gradient boosting decision tree, k-nearest neighbor, support vector machine, and AdaBoost algorithms. Feature importance analysis, partial dependence plot, and shapley additive explanations were integrated to enhance the interpretability of the CVD prediction model. Among all models, the random forest exhibited the best performance, with an accuracy of 90 %, an area under the curve of 0.85, and an F1 score of 0.86. Urine cadmium (Cd), blood lead (Pb), urine thallium (Tl), and urine tungsten (W) were identified as the most significant predictors of CVD, with importance scores of 0.062, 0.057, 0.051, and 0.050, respectively. At the overall level, higher levels of urine Cd, blood Pb, and urine W were associated with an increased risk of CVD, whereas a lower level of urine Tl was linked to a reduced CVD risk. Additionally, the analysis of synergistic effects revealed that Cd was the predominant determinant of CVD risk. The random forestbased CVD prediction model demonstrated excellent predictive power and provided valuable insights for personalized patient care and optimal resource allocation in populations exposed to heavy metals.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Interpretable machine learning for predicting chronic kidney disease progression risk
    Zheng, Jin-Xin
    Li, Xin
    Zhu, Jiang
    Guan, Shi-Yang
    Zhang, Shun-Xian
    Wang, Wei-Ming
    DIGITAL HEALTH, 2024, 10
  • [2] Predicting risk of obesity in overweight adults using interpretable machine learning algorithms
    Lin, Wei
    Shi, Songchang
    Huang, Huibin
    Wen, Junping
    Chen, Gang
    FRONTIERS IN ENDOCRINOLOGY, 2023, 14
  • [3] Predicting systemic financial risk with interpretable machine learning
    Tang, Pan
    Tang, Tiantian
    Lu, Chennuo
    NORTH AMERICAN JOURNAL OF ECONOMICS AND FINANCE, 2024, 71
  • [4] Cardiovascular Risk Assessment: An Interpretable Machine Learning Approach
    Paredes, S.
    Rocha, T.
    de Carvalho, P.
    Roseiro, I.
    Henriques, J.
    Sousa, J.
    INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS 2022, ICBHI 2022, 2024, 108 : 95 - 103
  • [5] Predicting Alzheimer's Disease with Interpretable Machine Learning
    Jia, Maoni
    Wu, Yafei
    Xiang, Chaoyi
    Fang, Ya
    DEMENTIA AND GERIATRIC COGNITIVE DISORDERS, 2023, 52 (04) : 249 - 257
  • [6] Predicting Australian Adults at High Risk of Cardiovascular Disease Mortality Using Standard Risk Factors and Machine Learning
    Sajeev, Shelda
    Champion, Stephanie
    Beleigoli, Alline
    Chew, Derek
    Reed, Richard L.
    Magliano, Dianna J.
    Shaw, Jonathan E.
    Milne, Roger L.
    Appleton, Sarah
    Gill, Tiffany K.
    Maeder, Anthony
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (06) : 1 - 14
  • [7] Machine learning methodologies versus cardiovascular risk scores, in predicting disease risk
    Alexandros C. Dimopoulos
    Mara Nikolaidou
    Francisco Félix Caballero
    Worrawat Engchuan
    Albert Sanchez-Niubo
    Holger Arndt
    José Luis Ayuso-Mateos
    Josep Maria Haro
    Somnath Chatterji
    Ekavi N. Georgousopoulou
    Christos Pitsavos
    Demosthenes B. Panagiotakos
    BMC Medical Research Methodology, 18
  • [8] Machine learning methodologies versus cardiovascular risk scores, in predicting disease risk
    Dimopoulos, Alexandros C.
    Nikolaidou, Mara
    Felix Caballero, Francisco
    Engchuan, Worrawat
    Sanchez-Niubo, Albert
    Arndt, Holger
    Luis Ayuso-Mateos, Jose
    Maria Haro, Josep
    Chatterji, Somnath
    Georgousopoulou, Ekavi N.
    Pitsavos, Christos
    Panagiotakos, Demosthenes B.
    BMC MEDICAL RESEARCH METHODOLOGY, 2018, 18
  • [9] A Machine Learning-Based Model for Predicting the Risk of Cardiovascular Disease
    Hsiao, Chiu-Han
    Yu, Po-Chun
    Hsieh, Chia-Ying
    Zhong, Bing-Zi
    Tsai, Yu-Ling
    Cheng, Hao-min
    Chang, Wei-Lun
    Lin, Frank Yeong-Sung
    Huang, Yennun
    ADVANCED INFORMATION NETWORKING AND APPLICATIONS, AINA-2022, VOL 1, 2022, 449 : 364 - 374
  • [10] Cardiovascular Risk on Workers Exposed to Heavy Metals, Pb, and Cd
    Fat, Ligia
    Negoescu, Radu
    EPIDEMIOLOGY, 2011, 22 (01) : S153 - S153