Transforming Big Data into AI-ready data for nutrition and obesity research

被引:1
|
作者
Thomas, Diana M. [1 ]
Knight, Rob [2 ]
Gilbert, Jack A. [3 ,4 ]
Cornelis, Marilyn C. [5 ]
Gantz, Marie G. [6 ]
Burdekin, Kate [6 ]
Cummiskey, Kevin [1 ]
Sumner, Susan C. J. [7 ]
Pathmasiri, Wimal [7 ]
Sazonov, Edward [8 ]
Gabriel, Kelley Pettee [9 ]
Dooley, Erin E. [9 ]
Green, Mark A. [10 ]
Pfluger, Andrew [11 ]
Kleinberg, Samantha [12 ]
机构
[1] United States Mil Acad, Dept Math Sci, West Point, NY 10996 USA
[2] Univ Calif San Diego, Bioinformat & Syst Biol Program, La Jolla, CA 92093 USA
[3] Univ Calif San Diego, Dept Pediat, La Jolla, CA USA
[4] Univ Calif San Diego, Scripps Inst Oceanog, La Jolla, CA USA
[5] Northwestern Univ, Feinberg Sch Med, Dept Prevent Med, Chicago, IL USA
[6] Res Triangle Inst Int, Biostat & Epidemiol Div, Res Triangle Pk, NC USA
[7] Univ North Carolina Chapel Hill, Nutr Res Inst, Dept Nutr, Kannapolis, NC USA
[8] Univ Alabama, Elect & Comp Engn Dept, Tuscaloosa, AL USA
[9] Univ Alabama Birmingham, Dept Epidemiol, Birmingham, AL USA
[10] Univ Liverpool, Dept Geog & Planning, Liverpool, England
[11] United States Mil Acad, Dept Geog & Environm Engn, West Point, NY USA
[12] Stevens Inst Technol, Comp Sci Dept, Hoboken, NJ USA
基金
美国国家卫生研究院;
关键词
ENERGY-EXPENDITURE; METABOLOMICS; FOOD; MICROBIOME; CHALLENGES; SEQUENCES; COUNT; WRIST;
D O I
10.1002/oby.23989
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Objective: Big Data are increasingly used in obesity and nutrition research to gain new insights and derive personalized guidance; however, this data in raw form are often not usable. Substantial preprocessing, which requires machine learning (ML), human judgment, and specialized software, is required to transform Big Data into artificial intelligence (AI)- and ML-ready data. These preprocessing steps are the most complex part of the entire modeling pipeline. Understanding the complexity of these steps by the end user is critical for reducing misunderstanding, faulty interpretation, and erroneous downstream conclusions. Methods: We reviewed three popular obesity/nutrition Big Data sources: microbiome, metabolomics, and accelerometry. The preprocessing pipelines, specialized software, challenges, and how decisions impact final AI- and ML-ready products were detailed. Results: Opportunities for advances to improve quality control, speed of preprocessing, and intelligent end user consumption were presented. Conclusions: Big Data have the exciting potential for identifying new modifiable factors that impact obesity research. However, to ensure accurate interpretation of conclusions arising from Big Data, the choices involved in preparing AI- and ML-ready data need to be transparent to investigators and clinicians relying on the conclusions.
引用
收藏
页码:857 / 870
页数:14
相关论文
共 50 条
  • [31] AI and Big data applications in nematology
    DiGennaro, Peter
    Mishra, S.
    JOURNAL OF NEMATOLOGY, 2023, 55 (01) : 26 - 26
  • [32] AI, big data, and the future of consent
    Adam J. Andreotta
    Nin Kirkham
    Marco Rizzi
    AI & SOCIETY, 2022, 37 : 1715 - 1728
  • [33] Big research data and data science
    Li, Jianhui
    Data Science Journal, 2015, 14
  • [34] Research on Big Data and Data Acquisition
    Huang, Ning
    PROCEEDINGS OF THE 2017 5TH INTERNATIONAL CONFERENCE ON MACHINERY, MATERIALS AND COMPUTING TECHNOLOGY (ICMMCT 2017), 2017, 126 : 1268 - 1272
  • [35] Research of Big data and data security
    Ren, Qiong
    Xu, Zhongyuan
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON MATERIALS ENGINEERING AND INFORMATION TECHNOLOGY APPLICATIONS, 2015, 28 : 871 - 875
  • [36] On the Research of Data Warehouse in Big Data
    Qin, Hai-fei
    Qian, Zhi-ming
    Zhao, Yong-chao
    2015 INTERNATIONAL CONFERENCE ON NETWORK AND INFORMATION SYSTEMS FOR COMPUTERS (ICNISC), 2015, : 354 - 357
  • [37] Big data in nephrology: Are we ready for the change?
    Yang, Chao
    Kong, Guilan
    Wang, Liwei
    Zhang, Luxia
    Zhao, Ming-Hui
    NEPHROLOGY, 2019, 24 (11) : 1097 - 1102
  • [38] Research and Application Progress of Big Data and Dietary Nutrition Health in China
    Tang X.
    Zhou B.
    Yang W.
    Liu T.
    Xu G.
    Ma Z.
    Wu W.
    Science and Technology of Food Industry, 2023, 44 (02) : 19 - 28
  • [39] Are hospitality/tourism curricula ready for big data?
    Ogbeide, Godwin-Charles
    Fu, Yao-Yi
    Cecil, Amanda Kay
    JOURNAL OF HOSPITALITY AND TOURISM TECHNOLOGY, 2021, 12 (01) : 112 - 123
  • [40] Big Data and IS Research
    Goes, Paulo B.
    MIS QUARTERLY, 2014, 38 (03) : III - VIII