Challenges of big data integration in the life sciences

被引:0
|
作者
Sven Fillinger
Luis de la Garza
Alexander Peltzer
Oliver Kohlbacher
Sven Nahnsen
机构
[1] University of Tübingen,Quantitative Biology Center (QBiC)
[2] University of Tübingen,Center for Bioinformatics
[3] Applied Bioinformatics,Institute for Translational Bioinformatics
[4] Department of Computer Science,Biomolecular Interactions
[5] University Hospital of Tübingen,undefined
[6] Max Planck Institute for Developmental Biology,undefined
来源
关键词
Big data; Bioanalytics; Data integration; Bioinformatics; Scalability;
D O I
暂无
中图分类号
学科分类号
摘要
Big data has been reported to be revolutionizing many areas of life, including science. It summarizes data that is unprecedentedly large, rapidly generated, heterogeneous, and hard to accurately interpret. This availability has also brought new challenges: How to properly annotate data to make it searchable? What are the legal and ethical hurdles when sharing data? How to store data securely, preventing loss and corruption? The life sciences are not the only disciplines that must align themselves with big data requirements to keep up with the latest developments. The large hadron collider, for instance, generates research data at a pace beyond any current biomedical research center. There are three recent major coinciding events that explain the emergence of big data in the context of research: the technological revolution for data generation, the development of tools for data analysis, and a conceptual change towards open science and data. The true potential of big data lies in pattern discovery in large datasets, as well as the formulation of new models and hypotheses. Confirmation of the existence of the Higgs boson, for instance, is one of the most recent triumphs of big data analysis in physics. Digital representations of biological systems have become more comprehensive. This, in combination with advances in machine learning, creates exciting new research possibilities. In this paper, we review the state of big data in bioanalytical research and provide an overview of the guidelines for its proper usage.
引用
收藏
页码:6791 / 6800
页数:9
相关论文
共 50 条
  • [21] Recent trends in knowledge and data integration for the life sciences
    McGarry, Ken
    Garfield, Sheila
    Morris, Nick
    EXPERT SYSTEMS, 2006, 23 (05) : 330 - 341
  • [22] Data integration in the life sciences: Fun, findings and frustrations
    Paton, Norman W.
    DATA INTEGRATION IN THE LIFE SCIENCES, PROCEEDINGS, 2008, 5109 : 8 - 10
  • [23] Data integration and knowledge aggregation in life sciences discovery
    Stephens, Susie
    Scientific Computing and Instrumentation, 2005, 22 (02): : 21 - 23
  • [24] Big Data Integration: The Big Promise of Data Integration
    Gal, Avigdor
    2015 3RD INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD (FICLOUD) AND INTERNATIONAL CONFERENCE ON OPEN AND BIG (OBD), 2015, : XLIV - XLIV
  • [25] Special Issue on "Big Data in Biology, Life Sciences and Healthcare"
    He, Q. Peter
    Wang, Jin
    PROCESSES, 2022, 10 (01)
  • [26] Genotype and phenotype data standardization, utilization and integration in the big data era for agricultural sciences
    Deng, Cecilia H.
    Naithani, Sushma
    Kumari, Sunita
    Cobo-Simon, Irene
    Quezada-Rodriguez, Elsa H.
    Skrabisova, Maria
    Gladman, Nick
    Correll, Melanie J.
    Sikiru, Akeem Babatunde
    Afuwape, Olusola O.
    Marrano, Annarita
    Rebollo, Ines
    Zhang, Wentao
    Jung, Sook
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2023, 2023
  • [27] A data management infrastructure for the integration of imaging and omics data in life sciences
    Cuellar, Luis Kuhn
    Friedrich, Andreas
    Gabernet, Gisela
    de la Garza, Luis
    Fillinger, Sven
    Seyboldt, Adrian
    Koch, Tobias
    Zur Oven-Krockhaus, Sven
    Wanke, Friederike
    Richter, Sandra
    Thaiss, Wolfgang M.
    Horger, Marius
    Malek, Nisar
    Harter, Klaus
    Bitzer, Michael
    Nahnsen, Sven
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [28] A data management infrastructure for the integration of imaging and omics data in life sciences
    Luis Kuhn Cuellar
    Andreas Friedrich
    Gisela Gabernet
    Luis de la Garza
    Sven Fillinger
    Adrian Seyboldt
    Tobias Koch
    Sven zur Oven-Krockhaus
    Friederike Wanke
    Sandra Richter
    Wolfgang M. Thaiss
    Marius Horger
    Nisar Malek
    Klaus Harter
    Michael Bitzer
    Sven Nahnsen
    BMC Bioinformatics, 23
  • [29] Life Sciences Data and Application Integration with B-Fabric
    Turker, Can
    Akal, Fuat
    Schlapbach, Ralph
    JOURNAL OF INTEGRATIVE BIOINFORMATICS, 2011, 8 (02):
  • [30] Big Data, Big Challenges
    Boland, Michael V.
    OPHTHALMOLOGY, 2016, 123 (01) : 7 - 8