Adding open spectral data to MassBank and PubChem using open source tools to support non-targeted exposomics of mixtures

被引:8
|
作者
Elapavalore, Anjana [1 ]
Kondic, Todor [1 ]
Singh, Randolph R. [1 ,2 ]
Shoemaker, Benjamin A. [3 ]
Thiessen, Paul A. [3 ]
Zhang, Jian [3 ]
Bolton, Evan E. [3 ]
Schymanski, Emma L. [1 ]
机构
[1] Univ Luxembourg, Luxembourg Ctr Syst Biomed LCSB, 6 Ave Swing, L-4367 Belvaux, Luxembourg
[2] IFREMER Inst Francais Rech Exploitat Mer, Lab Biogeochim Contaminants Organ, Rue Ile Yeu,BP 21105, F-44311 Nantes 3, France
[3] NIH, Natl Ctr Biotechnol Informat NCBI, Natl Lib Med NLM, Bethesda, MD 20894 USA
基金
美国国家卫生研究院;
关键词
SPECTROMETRY; CHALLENGE; CHEMICALS; MS/MS;
D O I
10.1039/d3em00181d
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The term "exposome" is defined as a comprehensive study of life-course environmental exposures and the associated biological responses. Humans are exposed to many different chemicals, which can pose a major threat to the well-being of humanity. Targeted or non-targeted mass spectrometry techniques are widely used to identify and characterize various environmental stressors when linking exposures to human health. However, identification remains challenging due to the huge chemical space applicable to exposomics, combined with the lack of sufficient relevant entries in spectral libraries. Addressing these challenges requires cheminformatics tools and database resources to share curated open spectral data on chemicals to improve the identification of chemicals in exposomics studies. This article describes efforts to contribute spectra relevant for exposomics to the open mass spectral library MassBank (https://www.massbank.eu) using various open source software efforts, including the R packages RMassBank and Shinyscreen. The experimental spectra were obtained from ten mixtures containing toxicologically relevant chemicals from the US Environmental Protection Agency (EPA) Non-Targeted Analysis Collaborative Trial (ENTACT). Following processing and curation, 5582 spectra from 783 of the 1268 ENTACT compounds were added to MassBank, and through this to other open spectral libraries (e.g., MoNA, GNPS) for community benefit. Additionally, an automated deposition and annotation workflow was developed with PubChem to enable the display of all MassBank mass spectra in PubChem, which is rerun with each MassBank release. The new spectral records have already been used in several studies to increase the confidence in identification in non-target small molecule identification workflows applied to environmental and exposomics research.
引用
收藏
页码:1788 / 1801
页数:14
相关论文
共 44 条
  • [31] Kinetic analysis of dynamic positron emission tomography data using open-source image processing and statistical inference tools
    Hawe, David
    Fernandez, Francisco R. Hernandez
    O'Suilleabhain, Liam
    Huang, Jian
    Wolsztynski, Eric
    O'Sullivan, Finbarr
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2012, 4 (03): : 316 - 322
  • [32] Tools for open interpretation: Using novel, non-desktop computing to support multiple perspectives in children's historical understanding
    Hall, T
    Bannon, L
    Ciolfi, L
    Ferris, K
    Gallagher, P
    Hickey, N
    Hedman, A
    ICLS2004: INTERNATIONAL CONFERENCE OF THE LEARNING SCIENCES, PROCEEDINGS: EMBRACING DIVERSITY IN THE LEARNING SCIENCES, 2004, : 604 - 604
  • [33] Methods of Statistical Evaluation of Data from Long-term Measurements of Soil Temperature Profile Using Open-source Tools
    Fabo, Peter
    Gaspar, Gabriel
    Pavlikova, Sona
    Siroky, Peter
    PROCEEDINGS OF 13TH INTERNATIONAL SYMPOSIUM ON MECHATRONICS, 2010, : 81 - 83
  • [34] CONSTRUCTING A COMPREHENSIVE TOOL FOR DERIVING DRAINAGE NETWORK USING SEMI-OPEN SOURCE TOOLS AND COMPARISION ON DIFFERENT DEM DATA SOURCES
    Gupta, Prasun Kumar
    Yadav, Pratik
    ISPRS TECHNICAL COMMISSION VIII SYMPOSIUM, 2014, 40-8 : 1129 - 1132
  • [35] End-to-End Data Automation for Pooled Sample SARS-CoV-2 Using R and Other Open-Source Tools
    Mobini, Mahdi
    Matic, Nancy
    Van Der Gugten, J. Grace
    Ritchie, Gordon
    Lowe, Christopher F.
    Holmes, Daniel T.
    JOURNAL OF APPLIED LABORATORY MEDICINE, 2023, 8 (01): : 41 - 52
  • [36] Processing single-cell RNA-seq data for dimension reduction-based analyses using open-source tools
    Chen, Bob
    Ramirez-Solano, Marisol A.
    Heiser, Cody N.
    Liu, Qi
    Lau, Ken S.
    STAR PROTOCOLS, 2021, 2 (02):
  • [37] I-DATA Study: Randomized, Sequential, Open-Label Study to Evaluate the Efficacy of IDH Targeted/Non- Targeted Versus Non-Targeted/IDH-Targeted Approaches in the Treatment of Newly Diagnosed IDH Mutated Adult AML Patients Not Candidates for Intensive Induction Therapy
    Ozga, Michael P.
    Dvorak-Kornaus, Kaitlyn M.
    Zhao, Qiuhong
    Buss, Jill
    Laganson, Andrea
    Hamp, Ethan
    Madanat, Yazan F.
    Pollyea, Daniel A.
    Stein, Eytan M.
    Zeidner, Joshua F.
    Mardis, Elaine R.
    Eisfeld, Ann-Kathrin
    Mims, Alice
    BLOOD, 2023, 142
  • [38] An integrated methodology using open soil spectral libraries and Earth Observation data for soil organic carbon estimations in support of soil-related SDGs
    Tziolas, Nikolaos
    Tsakiridis, Nikolaos
    Ogen, Yaron
    Kalopesa, Eleni
    Ben-Dor, Eyal
    Theocharis, John
    Zalidis, George
    REMOTE SENSING OF ENVIRONMENT, 2020, 244 (244)
  • [39] Integrating Data-Mining Support into a Brain-Image Database Using Open-Source Components (vol 53, pg 172, 2008)
    Herskovits, E. H.
    Owis, M., I
    Chen, R.
    ADVANCES IN MEDICAL SCIENCES, 2013, 58 (01): : 184 - 184
  • [40] Open-Source Data Analysis Tool for Spectral Small-Angle X-ray Scattering Using Spectroscopic Photon-Counting Detector
    Amer, Sabri
    Xu, Andrew
    Badano, Aldo
    Dahal, Eshan
    SENSORS, 2024, 24 (16)