Machine Learning and Data Mining Methods in Diabetes Research

被引:607
|
作者
Kavakiotis, Ioannis [1 ,2 ]
Tsave, Olga [3 ]
Salifoglou, Athanasios [3 ]
Maglaveras, Nicos [2 ,4 ]
Vlahavas, Ioannis [1 ]
Chouvarda, Ioanna [2 ,4 ]
机构
[1] Aristotle Univ Thessaloniki, Dept Informat, Thessaloniki 54124, Greece
[2] CERTH, Inst Appl Biosci, Thessaloniki, Greece
[3] Aristotle Univ Thessaloniki, Inorgan Chem Lab, Dept Chem Engn, Thessaloniki 54124, Greece
[4] Aristotle Univ Thessaloniki, Lab Comp & Med Informat, Sch Med, Thessaloniki 54124, Greece
关键词
Machine learning; Data mining; Diabetes mellitus; Diabetic complications; Disease prediction models; Biomarker(s) identification; PREDICTIVE MODELS; RISK-ASSESSMENT; RETINOPATHY; MELLITUS; DISEASE; DIAGNOSIS; CLASSIFICATION; OPTIMIZATION; ASSOCIATION; EXTRACTION;
D O I
10.1016/j.csbj.2016.12.005
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The remarkable advances in biotechnology and health sciences have led to a significant production of data, such as high throughput genetic data and clinical information, generated from large Electronic Health Records (EHRs). To this end, application of machine learning and data mining methods in biosciences is presently, more than ever before, vital and indispensable in efforts to transform intelligently all available information into valuable knowledge. Diabetes mellitus (DM) is defined as a group of metabolic disorders exerting significant pressure on human health worldwide. Extensive research in all aspects of diabetes (diagnosis, etiopathophysiology, therapy, etc.) has led to the generation of huge amounts of data. The aim of the present study is to conduct a systematic review of the applications of machine learning, data mining techniques and tools in the field of diabetes research with respect to a) Prediction and Diagnosis, b) Diabetic Complications, c) Genetic Background and Environment, and e) Health Care and Management with the first category appearing to be the most popular. A wide range of machine learning algorithms were employed. In general, 85% of those used were characterized by supervised learning approaches and 15% by unsupervised ones, and more specifically, association rules. Support vector machines (SVM) arise as the most successful and widely used algorithm. Concerning the type of data, clinical datasets were mainly used. The title applications in the selected articles project the usefulness of extracting valuable knowledge leading to new hypotheses targeting deeper understanding and further investigation in DM. (C) 2017 The Authors. Published by Elsevier B.V.
引用
收藏
页码:104 / 116
页数:13
相关论文
共 50 条
  • [1] Data mining/machine learning methods in foodomics
    Jimenez-Carvelo, Ana M.
    Cuadros-Rodriguez, Luis
    CURRENT OPINION IN FOOD SCIENCE, 2021, 37 : 76 - 82
  • [2] Research on real estate pricing methods based on data mining and machine learning
    Yu, Yanliang
    Lu, Jingfu
    Shen, Dan
    Chen, Binbing
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (09): : 3925 - 3937
  • [3] Research on real estate pricing methods based on data mining and machine learning
    Yanliang Yu
    Jingfu Lu
    Dan Shen
    Binbing Chen
    Neural Computing and Applications, 2021, 33 : 3925 - 3937
  • [4] Fuzzy methods in machine learning and data mining: Status and prospects
    Hüllermeier, E
    FUZZY SETS AND SYSTEMS, 2005, 156 (03) : 387 - 406
  • [5] Knowledge Discovery: Methods from data mining and machine learning
    Shu, Xiaoling
    Ye, Yiwan
    SOCIAL SCIENCE RESEARCH, 2023, 110
  • [6] Italian Machine Learning and Data Mining research: The last years
    Di Mauro, Nicola
    Frasconi, Paolo
    Angiulli, Fabrizio
    Bacciu, Davide
    de Gemmis, Marco
    Esposito, Floriana
    Fanizzi, Nicola
    Ferilli, Stefano
    Gori, Marco
    Lisi, Francesca A.
    Lops, Pasquale
    Malerba, Donato
    Micheli, Alessio
    Pelillo, Marcello
    Ricci, Francesco
    Riguzzi, Fabrizio
    Saitta, Lorenza
    Semeraro, Giovanni
    INTELLIGENZA ARTIFICIALE, 2013, 7 (02) : 77 - 89
  • [7] Research on Data Mining Technology Based on Machine Learning Algorithm
    Li, Shangran
    2018 INTERNATIONAL CONFERENCE ON COMPUTER INFORMATION SCIENCE AND APPLICATION TECHNOLOGY, 2019, 1168
  • [8] A Research on Machine Learning Methods for Big Data Processing
    Qiu, Junfei
    Sun, Youming
    PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND MANAGEMENT INNOVATION, 2015, 28 : 920 - 928
  • [9] Machine learning and data mining
    Mitchell, TM
    COMMUNICATIONS OF THE ACM, 1999, 42 (11) : 30 - 36
  • [10] Data Mining and Machine Learning Methods Applied to A Numerical Clinching Model
    Goetz, Marco
    Leichsenring, Ferenc
    Kropp, Thomas
    Muller, Peter
    Falk, Tobias
    Graf, Wolfgang
    Kaliske, Michael
    Drossel, Welf-Guntram
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2018, 117 (03): : 387 - 423