Machine Learning Approach to Metabolomic Data Predicts Type 2 Diabetes Mellitus Incidence

被引:3
|
作者
Leiherer, Andreas [1 ,2 ,3 ]
Muendlein, Axel [1 ]
Mink, Sylvia [2 ,3 ]
Mader, Arthur [1 ,4 ]
Saely, Christoph H. [1 ,3 ,4 ]
Festa, Andreas [1 ]
Fraunberger, Peter [2 ,3 ]
Drexel, Heinz [1 ,3 ,5 ,6 ]
机构
[1] Vorarlberg Inst Vasc Invest & Treatment VIVIT, A-6800 Feldkirch, Austria
[2] Cent Med Labs, A-6800 Feldkirch, Austria
[3] Private Univ Principal Liechtenstein, Fac Med Sci, FL-9495 Triesen, Liechtenstein
[4] Acad Teaching Hosp Feldkirch, Dept Internal Med 3, A-6800 Feldkirch, Austria
[5] Acad Teaching Hosp Feldkirch, Vorarlberger Landeskrankenhausbetriebsgesell, A-6800 Feldkirch, Austria
[6] Drexel Univ, Coll Med, Philadelphia, PA 19129 USA
关键词
ML; machine learning; artificial intelligence; diabetes; incidence; metabolomics; support vector machine; accuracy; CERAMIDES; MODEL;
D O I
10.3390/ijms25105331
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Metabolomics, with its wealth of data, offers a valuable avenue for enhancing predictions and decision-making in diabetes. This observational study aimed to leverage machine learning (ML) algorithms to predict the 4-year risk of developing type 2 diabetes mellitus (T2DM) using targeted quantitative metabolomics data. A cohort of 279 cardiovascular risk patients who underwent coronary angiography and who were initially free of T2DM according to American Diabetes Association (ADA) criteria was analyzed at baseline, including anthropometric data and targeted metabolomics, using liquid chromatography (LC)-mass spectroscopy (MS) and flow injection analysis (FIA)-MS, respectively. All patients were followed for four years. During this time, 11.5% of the patients developed T2DM. After data preprocessing, 362 variables were used for ML, employing the Caret package in R. The dataset was divided into training and test sets (75:25 ratio) and we used an oversampling approach to address the classifier imbalance of T2DM incidence. After an additional recursive feature elimination step, identifying a set of 77 variables that were the most valuable for model generation, a Support Vector Machine (SVM) model with a linear kernel demonstrated the most promising predictive capabilities, exhibiting an F1 score of 50%, a specificity of 93%, and balanced and unbalanced accuracies of 72% and 88%, respectively. The top-ranked features were bile acids, ceramides, amino acids, and hexoses, whereas anthropometric features such as age, sex, waist circumference, or body mass index had no contribution. In conclusion, ML analysis of metabolomics data is a promising tool for identifying individuals at risk of developing T2DM and opens avenues for personalized and early intervention strategies.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Adipocyte size predicts incidence of type 2 diabetes in women
    Lonn, Malin
    Mehlig, Kirsten
    Bengtsson, Calle
    Lissner, Lauren
    FASEB JOURNAL, 2010, 24 (01): : 326 - 331
  • [42] Plasma aldosterone concentration predicts the incidence of diabetes mellitus
    Chang, Tien-Jyun
    Li, Hung-Yuan
    Yu, Tse-Ya
    Wei, Jung-Nan
    Chuang, Lee-Ming
    DIABETES RESEARCH AND CLINICAL PRACTICE, 2016, 120 : S201 - S201
  • [43] Machine-Learning Approach Reveals Microbiome, Metabolome, Lipidome, and Their Interaction in Type 1 Diabetes Mellitus
    Tan, Huiling
    Shi, Yu
    Yue, Tong
    Zheng, Dongxue
    Wang, Chaofan
    Liu, Ziyu
    Yang, Daizhi
    Ding, Yu
    Xu, Wen
    Yan, Jinhua
    Luo, Sihui
    Weng, Jianping
    Zheng, Xueying
    DIABETES, 2023, 72
  • [44] A New Metabolomic Signature in Type-2 Diabetes Mellitus and Its Pathophysiology
    Padberg, Inken
    Peter, Erik
    Gonzalez-Maldonado, Sandra
    Witt, Henning
    Mueller, Matthias
    Weis, Tanja
    Bethan, Bianca
    Liebenberg, Volker
    Wiemer, Jan
    Katus, Hugo A.
    Rein, Dietrich
    Schatz, Philipp
    PLOS ONE, 2014, 9 (01):
  • [45] Accuracy of Machine Learning Classification Models for the Prediction of Type 2 Diabetes Mellitus: A Systematic Survey and Meta-Analysis Approach
    Olusanya, Micheal O.
    Ogunsakin, Ropo Ebenezer
    Ghai, Meenu
    Adeleke, Matthew Adekunle
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2022, 19 (21)
  • [46] Application of machine learning tools: Potential and useful approach for the prediction of type 2 diabetes mellitus based on the gut microbiome profile
    Ge, Xiaochun
    Zhang, Aimin
    Li, Lihui
    Sun, Qitian
    He, Jianqiu
    Wu, Yu
    Tan, Rundong
    Pan, Yingxia
    Zhao, Jiangman
    Xu, Yue
    Tang, Hui
    Gao, Yu
    EXPERIMENTAL AND THERAPEUTIC MEDICINE, 2022, 23 (04)
  • [47] A machine learning method for selection of genetic variants to increase prediction accuracy of type 2 diabetes mellitus using sequencing data
    Jung, Luann C.
    Wang, Haiyan
    Li, Xukun
    Wu, Cen
    STATISTICAL ANALYSIS AND DATA MINING, 2020, 13 (03) : 261 - 281
  • [48] Optimising test intervals for individuals with type 2 diabetes: A machine learning approach
    Pedersen, Sasja Maria
    Damslund, Nicolai
    Kjaer, Trine
    Olsen, Kim Rose
    PLOS ONE, 2025, 20 (02):
  • [49] Bio-Inspired Machine Learning Approach to Type 2 Diabetes Detection
    Al-Tawil, Marwan
    Mahafzah, Basel A.
    Al Tawil, Arar
    Aljarah, Ibrahim
    SYMMETRY-BASEL, 2023, 15 (03):
  • [50] Serum cystatin C and the incidence of type 2 diabetes mellitus
    K. Sahakyan
    K. E. Lee
    A. Shankar
    R. Klein
    Diabetologia, 2011, 54 : 1335 - 1340