A Big Data framework to analyze risk factors of diabetes outbreak in Indian population using a MapReduce algorithm

被引:0
|
作者
Ramsingh, J. [1 ]
Bhuvaneswari, V. [1 ]
机构
[1] Bharathiar Univ, Dept Comp Applicat, Coimbatore, Tamil Nadu, India
关键词
Big Data; Social Media; Diabetics; Corpus; Text Mining; Map Reduce; FITNESS; GLUCOSE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Increase in burden of chronic disease is hurting the economic and the prosperity of the country with the global risk, financial loss with increased expenditure, loss of productivity and likely to affect India's economic development adversely over the next couple of decades. Instantaneous measures are to be taken to create awareness to thwart epidemic among Indian Population. A Big Data unified data analysis and evaluation framework is proposed to analyze the awareness of risk factors of Diabetes among young, middle-aged Indian population. As a first phase data acquisition is done from heterogeneous data sources with different formats (Xml, Log files, Text document, Whats app, Emails) using Scoop. The data acquired is converted from different structure to a structured format using ETL and Text mining engine, Diabetic corpus is formed using with the reference of the food chart and the domain consultant for further processing its stored in HDFS. The data analysis is done as a MapReduce task using machine learning algorithms and the results are visualized. The results show devastating effects on the middle aged Indian population. High intake of refined carbohydrate foods and significant reduction of physical activity resulted in many younger generations being more prone to endemic diabetes. Rapid nutrition transition due to westernized diet and lifestyle increase the rate of diabetes. More than half of the young adolescents are more prone to diabetes. Extensive studies and clinical evidences show that type-2 diabetes is almost preventable through lifestyle changes and food habits. To hold back the growing outbreak of diabetes, the primary prevention must be through advertise of a healthy diet, food nutrition value and good physical activity as a global public policy priority.
引用
收藏
页码:1755 / 1760
页数:6
相关论文
共 50 条
  • [41] Knowledge Extraction from Big Data using MapReduce-based Parallel-Reduct Algorithm
    Chowdhury, Tapan
    Chakraborty, Susanta
    Setua, S. K.
    PROCEEDINGS OF 2016 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2016, : 240 - 246
  • [42] MapReduce based big data framework using associative Kruskal poly Kernel classifier for diabetic disease prediction
    Ramani, R.
    Raja, S. Edwin
    Dhinakaran, D.
    Jagan, S.
    Prabaharan, G.
    METHODSX, 2025, 14
  • [43] Predicting the Risk of Diabetes in Big Data Electronic Health Records by using Scalable Random Forest Classification Algorithm
    Rallapalli, Sreekanth
    Suryakanthi, T.
    2016 THIRD INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND ENGINEERING (ICACCE 2016), 2016, : 281 - 284
  • [44] An efficient and scalable SPARQL query processing framework for big data using MapReduce and hybrid optimum load balancing
    Kumar, V. Naveen
    Kumar, P. S. Ashok
    DATA & KNOWLEDGE ENGINEERING, 2023, 148
  • [45] Risk factors of PAD in asymptomatic Type 2 diabetes patients in the north Indian population
    Kapoor, Rohit
    Kapoor, Atul
    DIABETES RESEARCH AND CLINICAL PRACTICE, 2008, 79 : S93 - S93
  • [46] Risk of type 2 diabetes mellitus among urban slum population using Indian Diabetes Risk Score
    Nittoori, Sreeja
    Wilson, Vidhya
    INDIAN JOURNAL OF MEDICAL RESEARCH, 2020, 152 (03) : 308 - 311
  • [47] Derivation of a Novel Diabetes Risk Score Using Semantic Discretization for Indian Population
    Chandrakar, Omprakash
    Saini, Jatinderkumar R.
    AMBIENT COMMUNICATIONS AND COMPUTER SYSTEMS, RACCCS 2017, 2018, 696 : 331 - 340
  • [48] A Topic Mapping-based framework to analyze textual risk reports from social media big data contents
    Mohammadreza Sheikhattar
    Alireza Mansouri
    The Journal of Supercomputing, 2024, 80 : 9687 - 9712
  • [49] A Topic Mapping-based framework to analyze textual risk reports from social media big data contents
    Sheikhattar, Mohammadreza
    Mansouri, Alireza
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (07): : 9687 - 9712
  • [50] Development and Comparison of Three Data Models for Predicting Diabetes Mellitus Using Risk Factors in a Nigerian Population
    Odukoya, Oluwakemi
    Nwaneri, Solomon
    Odeniyi, Ifedayo
    Akodu, Babatunde
    Oluwole, Esther
    Olorunfemi, Gbenga
    Popoola, Oluwatoyin
    Osuntoki, Akinniyi
    HEALTHCARE INFORMATICS RESEARCH, 2022, 28 (01) : 58 - 67