Metadata integrity in bioinformatics: Bridging the gap between data and knowledge

被引:2
|
作者
Caliskan, Aylin [1 ]
Dangwal, Seema [2 ]
Dandekar, Thomas [1 ]
机构
[1] Univ Wurzburg, Dept Bioinformat, Bioctr, D-97074 Wurzburg, Germany
[2] Stanford Univ, Stanford Cardiovasc Inst, Dept Med, Sch Med, Stanford, CA 94305 USA
关键词
Meta-data; Error; Annotation; Error-transfer; Wrong labelling; Patient data; Control group; Tools overview; CONTROLLED VOCABULARIES; GENE-EXPRESSION; CELL; CHALLENGES; ONTOLOGIES;
D O I
10.1016/j.csbj.2023.10.006
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In the fast-evolving landscape of biomedical research, the emergence of big data has presented researchers with extraordinary opportunities to explore biological complexities. In biomedical research, big data imply also a big responsibility. This is not only due to genomics data being sensitive information but also due to genomics data being shared and re-analysed among the scientific community. This saves valuable resources and can even help to find new insights in silico. To fully use these opportunities, detailed and correct metadata are imperative. This includes not only the availability of metadata but also their correctness. Metadata integrity serves as a fundamental determinant of research credibility, supporting the reliability and reproducibility of data-driven findings. Ensuring metadata availability, curation, and accuracy are therefore essential for bioinformatic research. Not only must metadata be readily available, but they must also be meticulously curated and ideally error-free. Motivated by an accidental discovery of a critical metadata error in patient data published in two high-impact journals, we aim to raise awareness for the need of correct, complete, and curated metadata. We describe how the metadata error was found, addressed, and present examples for metadata-related challenges in omics research, along with supporting measures, including tools for checking metadata and software to facilitate various steps from data analysis to published research.
引用
收藏
页码:4895 / 4913
页数:19
相关论文
共 50 条
  • [21] Bridging the Gap between Human Knowledge and Machine Learning
    Alvarado-Perez, Juan C.
    Peluffo-Ordonez, Diego H.
    Theron, Roberto
    ADCAIJ-ADVANCES IN DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE JOURNAL, 2015, 4 (01): : 54 - 64
  • [22] BRIDGING THE GAP BETWEEN NEUROSCIENCE KNOWLEDGE AND ADDICTION TREATMENT
    Verdejo-Garcia, Antonio
    DRUG AND ALCOHOL REVIEW, 2019, 38 : S5 - S5
  • [23] Bridging the gap between the data warehouse and XML
    Burnell, D
    Al-Zobaidie, A
    Windall, G
    14TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2003, : 241 - 246
  • [24] Bridging the Gap between Functional and Structural Data
    Burtscher, Verena
    Hotka, Matej
    Stockner, Thomas
    Machtens, Jan-Philipp
    Sandtner, Walter
    BIOPHYSICAL JOURNAL, 2019, 116 (03) : 557A - 557A
  • [25] Bridging the gap between overt and personality-based integrity tests
    Hogan, J
    Brinkmeyer, K
    PERSONNEL PSYCHOLOGY, 1997, 50 (03) : 587 - 599
  • [26] Self-Contained Sequence Representation: Bridging the Gap between Bioinformatics and Cheminformatics
    Chen, William L.
    Leland, Burton A.
    Durant, Joseph L.
    Grier, David L.
    Christie, Bradley D.
    Nourse, James G.
    Taylor, Keith T.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2011, 51 (09) : 2186 - 2208
  • [27] Bridging the gap between medical and bioinformatics: An ontological case study in colon carcinoma
    Kumar, Anand
    Yip, Yum Lina
    Smith, Barry
    Grenon, Pierre
    COMPUTERS IN BIOLOGY AND MEDICINE, 2006, 36 (7-8) : 694 - 711
  • [28] Bridging the gap in African biodiversity genomics and bioinformatics
    Abdoallah Sharaf
    Charlotte C. Ndiribe
    Taiwo Crossby Omotoriogun
    Linelle Abueg
    Bouabid Badaoui
    Fatu J. Badiane Markey
    Girish Beedessee
    Diaga Diouf
    Vincent C. Duru
    Chukwuike Ebuzome
    Samuel C. Eziuzor
    Yasmina Jaufeerally Fakim
    Giulio Formenti
    Nidhal Ghanmi
    Fatma Zahra Guerfali
    Isidore Houaga
    Justin Eze Ideozu
    Sally Mueni Katee
    Slimane Khayi
    Josiah O. Kuja
    Emmanuel Hala Kwon-Ndung
    Rose A. Marks
    Acclaim M. Moila
    Zahra Mungloo-Dilmohamud
    Sadik Muzemil
    Helen Nigussie
    Julian O. Osuji
    Verena Ras
    Yves H. Tchiechoua
    Yedomon Ange Bovys Zoclanclounon
    Krystal A. Tolley
    Cathrine Ziyomo
    Ntanganedzeni Mapholi
    Anne W. T. Muigai
    Appolinaire Djikeng
    ThankGod Echezona Ebenezer
    Nature Biotechnology, 2023, 41 : 1348 - 1354
  • [29] Bridging the gap in African biodiversity genomics and bioinformatics
    Sharaf, Abdoallah
    Ndiribe, Charlotte C.
    Omotoriogun, Taiwo Crossby
    Abueg, Linelle
    Badaoui, Bouabid
    Markey, Fatu J. Badiane
    Beedessee, Girish
    Diouf, Diaga
    Duru, Vincent C.
    Ebuzome, Chukwuike
    Eziuzor, Samuel C.
    Fakim, Yasmina Jaufeerally
    Formenti, Giulio
    Ghanmi, Nidhal
    Guerfali, Fatma Zahra
    Houaga, Isidore
    Ideozu, Justin Eze
    Katee, Sally Mueni
    Khayi, Slimane
    Kuja, Josiah O.
    Kwon-Ndung, Emmanuel Hala
    Marks, Rose A.
    Moila, Acclaim M.
    Mungloo-Dilmohamud, Zahra
    Muzemil, Sadik
    Nigussie, Helen
    Osuji, Julian O.
    Ras, Verena
    Tchiechoua, Yves H.
    Zoclanclounon, Yedomon Ange Bovys
    Tolley, Krystal A.
    Ziyomo, Cathrine
    Mapholi, Ntanganedzeni
    Muigai, Anne W. T.
    Djikeng, Appolinaire
    Ebenezer, ThankGod Echezona
    NATURE BIOTECHNOLOGY, 2023, 41 (09) : 1348 - 1354
  • [30] From knowledge to advocacy: Bridging the gap between research and action
    Reynolds, Evelyn A.
    Harrington, Shariska P.
    Bakkum-Gamez, Jamie N.
    GYNECOLOGIC ONCOLOGY REPORTS, 2024, 54