Machine Learning Approach to Metabolomic Data Predicts Type 2 Diabetes Mellitus Incidence

被引:3
|
作者
Leiherer, Andreas [1 ,2 ,3 ]
Muendlein, Axel [1 ]
Mink, Sylvia [2 ,3 ]
Mader, Arthur [1 ,4 ]
Saely, Christoph H. [1 ,3 ,4 ]
Festa, Andreas [1 ]
Fraunberger, Peter [2 ,3 ]
Drexel, Heinz [1 ,3 ,5 ,6 ]
机构
[1] Vorarlberg Inst Vasc Invest & Treatment VIVIT, A-6800 Feldkirch, Austria
[2] Cent Med Labs, A-6800 Feldkirch, Austria
[3] Private Univ Principal Liechtenstein, Fac Med Sci, FL-9495 Triesen, Liechtenstein
[4] Acad Teaching Hosp Feldkirch, Dept Internal Med 3, A-6800 Feldkirch, Austria
[5] Acad Teaching Hosp Feldkirch, Vorarlberger Landeskrankenhausbetriebsgesell, A-6800 Feldkirch, Austria
[6] Drexel Univ, Coll Med, Philadelphia, PA 19129 USA
关键词
ML; machine learning; artificial intelligence; diabetes; incidence; metabolomics; support vector machine; accuracy; CERAMIDES; MODEL;
D O I
10.3390/ijms25105331
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Metabolomics, with its wealth of data, offers a valuable avenue for enhancing predictions and decision-making in diabetes. This observational study aimed to leverage machine learning (ML) algorithms to predict the 4-year risk of developing type 2 diabetes mellitus (T2DM) using targeted quantitative metabolomics data. A cohort of 279 cardiovascular risk patients who underwent coronary angiography and who were initially free of T2DM according to American Diabetes Association (ADA) criteria was analyzed at baseline, including anthropometric data and targeted metabolomics, using liquid chromatography (LC)-mass spectroscopy (MS) and flow injection analysis (FIA)-MS, respectively. All patients were followed for four years. During this time, 11.5% of the patients developed T2DM. After data preprocessing, 362 variables were used for ML, employing the Caret package in R. The dataset was divided into training and test sets (75:25 ratio) and we used an oversampling approach to address the classifier imbalance of T2DM incidence. After an additional recursive feature elimination step, identifying a set of 77 variables that were the most valuable for model generation, a Support Vector Machine (SVM) model with a linear kernel demonstrated the most promising predictive capabilities, exhibiting an F1 score of 50%, a specificity of 93%, and balanced and unbalanced accuracies of 72% and 88%, respectively. The top-ranked features were bile acids, ceramides, amino acids, and hexoses, whereas anthropometric features such as age, sex, waist circumference, or body mass index had no contribution. In conclusion, ML analysis of metabolomics data is a promising tool for identifying individuals at risk of developing T2DM and opens avenues for personalized and early intervention strategies.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] A Deep Reinforcement Learning Approach for Type 2 Diabetes Mellitus Treatment
    Liu, Zhuo
    Ji, Linong
    Jiang, Xuehan
    Zhao, Wei
    Liao, Xiyang
    Zhao, Tingting
    Liu, Siqi
    Sun, Xingzhi
    Hu, Gang
    Feng, Mengling
    Xie, Guotong
    2020 8TH IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2020), 2020, : 1 - 9
  • [22] Performance analysis and prediction of type 2 diabetes mellitus based on lifestyle data using machine learning approaches
    Ganie, Shahid Mohammad
    Malik, Majid Bashir
    Arif, Tasleem
    JOURNAL OF DIABETES AND METABOLIC DISORDERS, 2022, 21 (01) : 339 - 352
  • [23] Performance analysis and prediction of type 2 diabetes mellitus based on lifestyle data using machine learning approaches
    Shahid Mohammad Ganie
    Majid Bashir Malik
    Tasleem Arif
    Journal of Diabetes & Metabolic Disorders, 2022, 21 : 339 - 352
  • [24] Serum Ferritin Independently Predicts the Incidence of Chronic Kidney Disease in Patients with Type 2 Diabetes Mellitus
    Wu, Yun Hong
    Wang, Su Yuan
    Li, Ming Xia
    He, Hua
    Yin, Wei Jin
    Guo, Yan Hong
    Zhang, Hui Qin
    Sun, Zeng Mei
    Zhang, Dan
    Wang, Xi
    Sun, Shu Yao
    Tang, Shu Xi
    Du, Rong
    Zhang, Cheng Hui
    DIABETES METABOLIC SYNDROME AND OBESITY-TARGETS AND THERAPY, 2020, 13 : 99 - 105
  • [25] The cost of Type II Diabetes Mellitus: A Machine Learning Perspective
    Sheikhi, G.
    Altincay, H.
    XIV MEDITERRANEAN CONFERENCE ON MEDICAL AND BIOLOGICAL ENGINEERING AND COMPUTING 2016, 2016, 57 : 818 - 821
  • [26] Metabolomic Signature of Coronary Artery Disease in Type 2 Diabetes Mellitus
    Stratmann, Bernd
    Richter, Katrin
    Wang, Ruichao
    Yu, Zhonghao
    Xu, Tao
    Prehn, Cornelia
    Adamski, Jerzy
    Illig, Thomas
    Tschoepe, Diethelm
    Wang-Sattler, Rui
    INTERNATIONAL JOURNAL OF ENDOCRINOLOGY, 2017, 2017
  • [27] Antiosteoporotic Drugs and Incidence of Type 2 Diabetes Mellitus
    Ippei Kanazawa
    Calcified Tissue International, 2012, 90 : 163 - 164
  • [28] Onychomycosis Incidence in Type 2 Diabetes Mellitus Patients
    Patricia Manzano-Gayosso
    Francisca Hernández-Hernández
    Luis Javier Méndez-Tovar
    Yanni Palacios-Morales
    Erika Córdova-Martínez
    Elva Bazán-Mora
    Rubén López-Martinez
    Mycopathologia, 2008, 166 : 41 - 45
  • [29] Impact of type 2 diabetes mellitus on the incidence of malaria
    Raghunath, Pendru
    JOURNAL OF INFECTION AND PUBLIC HEALTH, 2017, 10 (03) : 357 - 358
  • [30] Onychomycosis incidence in type 2 diabetes mellitus patients
    Manzano-Gayosso, Patricia
    Hernandez-Hernandez, Francisca
    Mendez-Tovar, Luis Javier
    Palacios-Morales, Yanni
    Cordova-Martinez, Erika
    Bazan-Mora, Elva
    Lopez-Martinez, Ruben
    MYCOPATHOLOGIA, 2008, 166 (01) : 41 - 45