Diabetes Mellitus;
Bioinformatics;
Medical Diagnosis;
Machine Learning;
Classification;
MULTIPLE IMPUTATION;
MEDICAL DIAGNOSIS;
MISSING DATA;
D O I:
暂无
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
Diabetes mellitus is a chronic disease and a health challenge worldwide. According to the International Diabetes Federation, 451 million people across the globe have diabetes, with this number anticipated to rise up to 693 million people by 2045. It has been shown that 80% of the complications arising from type II diabetes can be prevented or delayed by early identification of the people who are at risk. Diabetes is difficult to diagnose in the early stages as its symptoms grow subtly and gradually. In a majority of the cases, the patients remain undiagnosed until they are admitted for a heart attack or begin to lose their sight. This paper analyzes the different classification algorithms based on a patient's health history to aid doctors identify the presence of as well as promote early diagnosis and treatment. The experiments were conducted on Pima Indian Diabetes data set. Various classifiers used include K Nearest Neighbors, Logistic Regression, Decision Trees, Random Forest, Gradient Boosting, Support Vector Machine and Neural Network. Results demonstrate that Random Forests performed well on the data set giving an accuracy of 79.7%.
机构:
Chinese Univ Hong Kong, Dept Physiol, Fac Med, Shatin, Hong Kong, Peoples R ChinaChinese Univ Hong Kong, Dept Physiol, Fac Med, Shatin, Hong Kong, Peoples R China
Chu, Kwan Yi
Leung, Po Sing
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Univ Hong Kong, Dept Physiol, Fac Med, Shatin, Hong Kong, Peoples R ChinaChinese Univ Hong Kong, Dept Physiol, Fac Med, Shatin, Hong Kong, Peoples R China
机构:
Department of Health Care Management, University of Greifswald, Friedrich-Loeffler-Straẞe 70, GreifswaldDepartment of Health Care Management, University of Greifswald, Friedrich-Loeffler-Straẞe 70, Greifswald