A Big Data framework to analyze risk factors of diabetes outbreak in Indian population using a MapReduce algorithm

被引:0
|
作者
Ramsingh, J. [1 ]
Bhuvaneswari, V. [1 ]
机构
[1] Bharathiar Univ, Dept Comp Applicat, Coimbatore, Tamil Nadu, India
关键词
Big Data; Social Media; Diabetics; Corpus; Text Mining; Map Reduce; FITNESS; GLUCOSE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Increase in burden of chronic disease is hurting the economic and the prosperity of the country with the global risk, financial loss with increased expenditure, loss of productivity and likely to affect India's economic development adversely over the next couple of decades. Instantaneous measures are to be taken to create awareness to thwart epidemic among Indian Population. A Big Data unified data analysis and evaluation framework is proposed to analyze the awareness of risk factors of Diabetes among young, middle-aged Indian population. As a first phase data acquisition is done from heterogeneous data sources with different formats (Xml, Log files, Text document, Whats app, Emails) using Scoop. The data acquired is converted from different structure to a structured format using ETL and Text mining engine, Diabetic corpus is formed using with the reference of the food chart and the domain consultant for further processing its stored in HDFS. The data analysis is done as a MapReduce task using machine learning algorithms and the results are visualized. The results show devastating effects on the middle aged Indian population. High intake of refined carbohydrate foods and significant reduction of physical activity resulted in many younger generations being more prone to endemic diabetes. Rapid nutrition transition due to westernized diet and lifestyle increase the rate of diabetes. More than half of the young adolescents are more prone to diabetes. Extensive studies and clinical evidences show that type-2 diabetes is almost preventable through lifestyle changes and food habits. To hold back the growing outbreak of diabetes, the primary prevention must be through advertise of a healthy diet, food nutrition value and good physical activity as a global public policy priority.
引用
收藏
页码:1755 / 1760
页数:6
相关论文
共 50 条
  • [31] Density-based Algorithms for Big Data Clustering Using MapReduce Framework: A Comprehensive Study
    Khader, Mariam
    Al-Naymat, Ghazi
    ACM COMPUTING SURVEYS, 2020, 53 (05)
  • [32] Sequence-Growth : A Scalable and Effective Frequent Itemset Mining Algorithm for Big Data Based on MapReduce Framework
    Liang, Yen-hui
    Wu, Shiow-yang
    2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015, 2015, : 393 - 400
  • [33] A Novel Nodesets-Based Frequent Itemset Mining Algorithm for Big Data using MapReduce
    Sivaiah, Borra
    Rao, Ramisetty Rajeswara
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (09) : 1051 - 1058
  • [34] Risk of developing Diabetes Mellitus among urban poor South Indian population using Indian Diabetes Risk Score
    Oruganti, Aditya
    Kavi, Avinash
    Walvekar, Padmaja R.
    JOURNAL OF FAMILY MEDICINE AND PRIMARY CARE, 2019, 8 (02) : 487 - 492
  • [35] MapReduce with Deep Learning Framework for Student Health Monitoring System using IoT Technology for Big Data
    Akhtar, Md. Mobin
    Shatat, Abdallah Saleh Ali
    Al-Hashimi, Mukhtar
    Zamani, Abu Sarwar
    Rizwanullah, Mohammed
    Mohamed, Sara Saadeldeen Ibrahim
    Ayub, Rashid
    JOURNAL OF GRID COMPUTING, 2023, 21 (04)
  • [36] An intelligent approach to Big Data analytics for sustainable retail environment using Apriori-MapReduce framework
    Verma, Neha
    Singh, Jatinder
    INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 2017, 117 (07) : 1503 - 1520
  • [37] MapReduce with Deep Learning Framework for Student Health Monitoring System using IoT Technology for Big Data
    Md. Mobin Akhtar
    Abdallah Saleh Ali Shatat
    Mukhtar Al-Hashimi
    Abu Sarwar Zamani
    Mohammed Rizwanullah
    Sara Saadeldeen Ibrahim Mohamed
    Rashid Ayub
    Journal of Grid Computing, 2023, 21
  • [38] Big Data Analytics: Performance Evaluation for High Availability and Fault Tolerance using MapReduce Framework with HDFS
    Verma, Jai Prakash
    Mankad, Sapan H.
    Garg, Sanjay
    2018 FIFTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (IEEE PDGC), 2018, : 770 - 775
  • [39] Performance Analysis of Matrix and Graph Computations using Data Compression Techniques in MPI and Hadoop MapReduce in Big Data Framework
    Ramakrishnaiah, Nagendla
    Reddy, Sirigiri Konda
    2017 IEEE INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES AND MANAGEMENT FOR COMPUTING, COMMUNICATION, CONTROLS, ENERGY AND MATERIALS (ICSTM), 2017, : 54 - 62
  • [40] MapReduce and Spark-based Analytic Framework Using Social Media Data for Earlier Flu Outbreak Detection
    Al Essa, Ali
    Faezipour, Miad
    ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS, ICDM 2017, 2017, 10357 : 246 - 257