Predictive modeling for the development of diabetes mellitus using key factors in various machine learning approaches

被引:6
|
作者
Tanaka, Marenao [1 ,2 ]
Akiyama, Yukinori [3 ]
Mori, Kazuma [4 ]
Hosaka, Itaru [5 ]
Kato, Kenichi [5 ]
Endo, Keisuke [1 ]
Ogawa, Toshifumi [1 ,6 ]
Sato, Tatsuya [1 ,6 ]
Suzuki, Toru [1 ,7 ]
Yano, Toshiyuki [1 ]
Ohnishi, Hirofumi [8 ]
Hanawa, Nagisa [9 ]
Furuhashi, Masato [1 ]
机构
[1] Sapporo Med Univ, Dept Cardiovasc Renal & Metab Med, Sch Med, S-1,W-16,Chuo Ku, Sapporo 0608543, Japan
[2] Tanaka Med Clin, Yoichi, Japan
[3] Sapporo Med Univ, Dept Neurosurg, Sapporo, Japan
[4] Natl Def Med Coll, Dept Immunol & Microbiol, Tokorozawa, Japan
[5] Sapporo Med Univ, Sch Med, Dept Cardiovasc Surg, Sapporo, Japan
[6] Sapporo Med Univ, Dept Cellular Physiol & Signal Transduct, Sch Med, Sapporo, Japan
[7] Natori Toru Internal Med & Diabet Clin, Natori, Japan
[8] Sapporo Med Univ, Sch Med, Dept Publ Hlth, Sapporo, Japan
[9] Keijinkai Maruyama Clin, Dept Hlth Checkup & Promot, Sapporo, Japan
来源
基金
日本学术振兴会;
关键词
Arti ficial intelligence; Machine learning; Diabetes mellitus; Fatty liver index; FATTY LIVER INDEX; RISK;
D O I
10.1016/j.deman.2023.100191
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Aims: Machine learning (ML) approaches are beneficial when automatic identification of relevant features among numerous candidates is desired. We investigated the predictive ability of several ML models for new onset of diabetes mellitus. Methods: In 10,248 subjects who received annual health examinations, 58 candidates including fatty liver index (FLI), which is calculated by using waist circumference, body mass index and levels of triglycerides and g-glutamyl transferase, were used. Results: During a 10-year follow-up period (mean period: 6.9 years), 322 subjects (6.5 %) in the training group (70 %, n=7,173) and 127 subjects (6.2 %) in the test group (30 %, n=3,075) had new onset of diabetes mellitus. Hemoglobin A1c, fasting glucose and FLI were identified as the top 3 predictors by random forest feature selection with 10-fold cross-validation. When hemoglobin A1c and FLI were used as the selected features, C-statistics analogous in receiver operating characteristic curve analysis in ML models including logistic regression, naive Bayes, extreme gradient boosting and artificial neural network were 0.874, 0.869, 0.856 and 0.869, respectively. There was no significant difference in the discriminatory capacity among the ML models. Conclusions: ML models incorporating hemoglobin A1c and FLI provide an accurate and straightforward approach for predicting the development of diabetes mellitus. (c) 2023 The Author(s). Published by Elsevier Masson SAS. This is an open access article under the CC BY-NCND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Prediction of type 2 diabetes mellitus using hematological factors based on machine learning approaches: a cohort study analysis
    Mansoori, Amin
    Sahranavard, Toktam
    Hosseini, Zeinab Sadat
    Soflaei, Sara Saffar
    Emrani, Negar
    Nazar, Eisa
    Gharizadeh, Melika
    Khorasanchi, Zahra
    Effati, Sohrab
    Ghamsary, Mark
    Ferns, Gordon
    Esmaily, Habibollah
    Mobarhan, Majid Ghayour
    SCIENTIFIC REPORTS, 2023, 13 (01):
  • [22] Integrated Embedded system for detecting diabetes mellitus using various machine learning techniques
    Konda R.
    Ramineni A.
    Jayashree J.
    Singavajhala N.
    Vanka S.A.
    EAI Endorsed Transactions on Pervasive Health and Technology, 2024, 10
  • [23] Development and rigorous validation of antimalarial predictive models using machine learning approaches
    Danishuddin
    Madhukar, G.
    Malik, M. Z.
    Subbarao, N.
    SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2019, 30 (08) : 543 - 560
  • [24] Predictive Modeling of Volume and Biomass in Pinus pseudostrobus Using Machine Learning and Allometric Approaches
    Antunez, Pablo
    Wehenkel, Christian
    Basave-Villalobos, Erickson
    Calixto-Valencia, Celi Gloria
    Valenzuela-Encinas, Cesar
    Ruiz-Aquino, Faustino
    Sarmiento-Bustos, David
    FOREST SCIENCE AND TECHNOLOGY, 2025,
  • [25] In silico profiling nanoparticles: predictive nanomodeling using universal nanodescriptors and various machine learning approaches
    Yan, Xiliang
    Sedykh, Alexander
    Wang, Wenyi
    Zhao, Xiaoli
    Yan, Bing
    Zhu, Hao
    NANOSCALE, 2019, 11 (17) : 8352 - 8362
  • [26] New Predictive Factors in the Development of Type 2 Diabetes Mellitus
    Gonzalez Hernandez, Maurio
    FINLAY, 2023, 13 (03):
  • [27] Comparative approaches for classification of diabetes mellitus data: Machine learning paradigm
    Maniruzzaman, Md.
    Kumar, Nishith
    Abedin, Md. Menhazul
    Islam, Md. Shaykhul
    Suri, Harman S.
    El-Baz, Ayman S.
    Suri, Jasjit S.
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2017, 152 : 23 - 34
  • [28] Exploratory analysis using machine learning of predictive factors for falls in type 2 diabetes
    Yasuhiro Suzuki
    Hiroaki Suzuki
    Tatsuya Ishikawa
    Yasunori Yamada
    Shigeru Yatoh
    Yoko Sugano
    Hitoshi Iwasaki
    Motohiro Sekiya
    Naoya Yahagi
    Yasushi Hada
    Hitoshi Shimano
    Scientific Reports, 12
  • [29] PREDICTIVE FACTORS FOR THE DEVELOPMENT OF DIABETES IN WOMEN WITH PREVIOUS GESTATIONAL DIABETES-MELLITUS
    DAMM, P
    KUHL, C
    BERTELSEN, A
    MOLSTEDPEDERSEN, L
    AMERICAN JOURNAL OF OBSTETRICS AND GYNECOLOGY, 1992, 167 (03) : 607 - 616
  • [30] Exploratory analysis using machine learning of predictive factors for falls in type 2 diabetes
    Suzuki, Yasuhiro
    Suzuki, Hiroaki
    Ishikawa, Tatsuya
    Yamada, Yasunori
    Yatoh, Shigeru
    Sugano, Yoko
    Iwasaki, Hitoshi
    Sekiya, Motohiro
    Yahagi, Naoya
    Hada, Yasushi
    Shimano, Hitoshi
    SCIENTIFIC REPORTS, 2022, 12 (01)