A Novel Approach for Exploring Data-Driven Nutritional Insights Using Clustering and Dimensionality Reduction Techniques

被引:0
|
作者
Nandini Garg [1 ]
Pulkit Dwivedi [2 ]
机构
[1] Chandigarh University,Apex Institute of Technology (CSE)
[2] IILM University,School of Computer Science and Engineering
关键词
Dimensionality reduction; PCA; t-SNE; Nutrition; Clustering; Machine learning;
D O I
10.1007/s42979-024-03397-w
中图分类号
学科分类号
摘要
The analysis of high-dimensional datasets poses significant challenges, particularly in big data analytics where extracting meaningful insights is crucial. Current techniques often struggle with maintaining a balance between preserving global data structures and capturing local relationships. In this study, we address these challenges by integrating Principal Component Analysis (PCA) and t-distributed Stochastic Neighbor Embedding (t-SNE) for dimensionality reduction, with a dedicated focus on the domain of nutrition. Our research specifically targets pressing health concerns among the youth, such as obesity and nutritional deficiencies, by simplifying the analysis of extensive nutrition datasets. We propose a methodology that enhances the selection of nutritious food alternatives through effective data simplification and analysis. The efficacy of our approach is demonstrated through comprehensive experimental results, which include detailed comparisons with state-of-the-art methods, and evaluations based on clustering accuracy, computational efficiency, and visualization quality. Additionally, we optimize the performance of clustering algorithms using hyperparameter tuning techniques, specifically the Elbow Method and the Silhouette Coefficient. Our findings highlight the significant role of dimensionality reduction in improving data analysis and machine learning processes. This study offers valuable insights for researchers and practitioners, contributing to a deeper understanding of how dimensionality reduction techniques can unlock latent knowledge within vast datasets. Ultimately, our research aims to facilitate more informed decision-making and drive innovation in the era of big data analytics, with practical applications extending across diverse domains, particularly in nutrition and health.
引用
收藏
相关论文
共 50 条
  • [21] Hospital Acquired Infection Reduction Using a Multidisciplinary, Data-Driven Approach
    Knightly, John Joseph
    Halperin, John
    Zampella, Edward
    Ninni, Sharon
    Weiss, Bonnie
    Ruggerio, Charlene
    Prasek, Dorian
    Richards, Ann
    Verdi, Iris
    JOURNAL OF NEUROSURGERY, 2016, 124 (04) : A1179 - A1179
  • [22] Data-driven reduction and decomposition with time-axis clustering
    Barwey, S.
    Raman, V.
    PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2023, 479 (2274):
  • [23] A BIG-DATA APPROACH TO ELECTRONIC HEALTH RECORD DATA - USING DIMENSIONALITY REDUCTION AND CLUSTERING TECHNIQUES TO STUDY LONGITUDINAL RELATIONSHIPS BETWEEN DISEASES
    Maurits, Marc
    Huizinga, Thomas
    Raychaudhuri, Soumya
    Reinders, Marcel
    Karlson, Elizabeth
    van den Akker, Erik
    Knevel, Rachel
    ANNALS OF THE RHEUMATIC DISEASES, 2019, 78 : 2102 - 2102
  • [24] Using Dimensionality Reduction and Clustering Techniques to Classify Space Plasma Regimes
    Bakrania, Mayur R.
    Rae, I. Jonathan
    Walsh, Andrew P.
    Verscharen, Daniel
    Smith, Andy W.
    FRONTIERS IN ASTRONOMY AND SPACE SCIENCES, 2020, 7
  • [25] Clustering and Dimensionality-reduction Techniques Applied on Power Quality Measurement Data
    Rosenlund, Gjert H.
    Hoiem, Kristian W.
    Torsaeter, Bendik N.
    Andresen, Christian A.
    2020 INTERNATIONAL CONFERENCE ON SMART ENERGY SYSTEMS AND TECHNOLOGIES (SEST), 2020,
  • [26] Dimensionality reduction enhances data-driven reliability-based design optimizer
    Kanno, Yoshihiro
    JOURNAL OF ADVANCED MECHANICAL DESIGN SYSTEMS AND MANUFACTURING, 2020, 14 (01):
  • [27] Novel high voltage polymer insulators using computational and data-driven techniques
    Kamal, Deepak
    Huan Tran
    Kim, Chiho
    Wang, Yifei
    Chen, Lihua
    Cao, Yang
    Joseph, V. Roshan
    Ramprasad, Rampi
    JOURNAL OF CHEMICAL PHYSICS, 2021, 154 (17):
  • [28] Data-driven approach for hydrocarbon production forecasting using machine learning techniques
    Chahar, Jaiyesh
    Verma, Jayant
    Vyas, Divyanshu
    Goyal, Mukul
    JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2022, 217
  • [29] A Data-Driven Predictive Approach for Drug Delivery Using Machine Learning Techniques
    Li, YuanYuan
    Lenaghan, Scott C.
    Zhang, Mingjun
    PLOS ONE, 2012, 7 (02):
  • [30] Exploring Novel Data-Driven Clustering Methods for Uncovering Patterns in Longitudinal Neonatal Postoperative Temperature Measurements
    Helman, Stephanie
    Riek, Nathan
    Sereika, Susan
    Olsen, Robert
    Gaynor, J.
    Lisanti, Amy
    Al-Zaiti, Salah
    CIRCULATION, 2024, 150