Optimizing diabetes classification with a machine learning-based framework

被引:5
|
作者
Feng, Xin [1 ,2 ,3 ]
Cai, Yihuai [1 ]
Xin, Ruihao [4 ,5 ,6 ]
机构
[1] Jilin Inst Chem Technol, Sch Sci, Jilin 130000, Peoples R China
[2] Jilin Univ, Coll Chem, State Key Lab Inorgan Synth & Preparat Chem, Changchun 130012, Peoples R China
[3] Jilin Univ, Sch Publ Hlth, Dept Epidemiol & Biostat, Changchun 130012, Peoples R China
[4] Jilin Inst Chem Technol, Coll Informat & Control Engn, Jilin 130000, Peoples R China
[5] Jilin Univ, Coll Comp Sci & Technol, Changchun 130012, Peoples R China
[6] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun 130012, Peoples R China
关键词
Diabetes diagnoses; Machine learning; GAN;
D O I
10.1186/s12859-023-05467-x
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundDiabetes is a metabolic disorder usually caused by insufficient secretion of insulin from the pancreas or insensitivity of cells to insulin, resulting in long-term elevated blood sugar levels in patients. Patients usually present with frequent urination, thirst, and hunger. If left untreated, it can lead to various complications that can affect essential organs and even endanger life. Therefore, developing an intelligent diagnosis framework for diabetes is necessary.ResultThis paper proposes a machine learning-based diabetes classification framework machine learning optimized GAN. The framework encompasses several methodological approaches to address the diverse challenges encountered during the analysis. These approaches encompass the implementation of the mean and median joint filling method for handling missing values, the application of the cap method for outlier processing, and the utilization of SMOTEENN to mitigate sample imbalance. Additionally, the framework incorporates the employment of the proposed Diabetes Classification Model based on Generative Adversarial Network and employs logistic regression for detailed feature analysis. The effectiveness of the framework is evaluated using both the PIMA dataset and the diabetes dataset obtained from the GEO database. The experimental findings showcase our model achieved exceptional results, including a binary classification accuracy of 96.27%, tertiary classification accuracy of 99.31%, precision and f1 score of 0.9698, recall of 0.9698, and an AUC of 0.9702.ConclusionThe experimental results show that the framework proposed in this paper can accurately classify diabetes and provide new ideas for intelligent diagnosis of diabetes.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] A machine learning-based framework for analyzing car brand styling
    Li, Baojun
    Dong, Ying
    Wen, Zhijie
    Liu, Mingzeng
    Yang, Lei
    Song, Mingliang
    ADVANCES IN MECHANICAL ENGINEERING, 2018, 10 (07)
  • [42] Machine learning-based framework for saliency detection in distorted images
    Yuzhen Niu
    Lening Lin
    Yuzhong Chen
    Lingling Ke
    Multimedia Tools and Applications, 2017, 76 : 26329 - 26353
  • [43] A machine learning-based framework for predicting game server load
    Ozer, Cagdas
    Cevik, Taner
    Gurhanli, Ahmet
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (06) : 9527 - 9546
  • [44] Machine Learning-Based Quantification and Classification of Fibroblasts in Gastrointestinal Cancer
    Zhang, Z.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2024, 120 (02): : E668 - E669
  • [45] Machine learning-based approach for zircon classification and genesis determination
    Zhu Z.
    Zhou F.
    Wang Y.
    Zhou T.
    Hou Z.
    Qiu K.
    Earth Science Frontiers, 2022, 29 (05) : 464 - 475
  • [46] Machine Learning-Based Tomato Fruit Shape Classification System
    Vazquez, Dana V.
    Spetale, Flavio E.
    Nankar, Amol N.
    Grozeva, Stanislava
    Rodriguez, Gustavo R.
    PLANTS-BASEL, 2024, 13 (17):
  • [47] Machine learning-based classification of time series of chaotic systems
    Uzun, Suleyman
    EUROPEAN PHYSICAL JOURNAL-SPECIAL TOPICS, 2022, 231 (03): : 493 - 503
  • [48] Machine learning-based classification of time series of chaotic systems
    Süleyman Uzun
    The European Physical Journal Special Topics, 2022, 231 : 493 - 503
  • [49] Bull Sperm Tracking and Machine Learning-Based Motility Classification
    Hidayatullah, Priyanto
    Mengko, Tati L. E. R.
    Munir, Rinaldi
    Barlian, Anggraini
    IEEE ACCESS, 2021, 9 : 61159 - 61170
  • [50] Machine Learning-based Optimal Framework for Internet of Things Networks
    Alsafasfeh, Moath
    Arida, Zaid A.
    Saraereh, Omar A.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (03): : 5355 - 5380