Adding open spectral data to MassBank and PubChem using open source tools to support non-targeted exposomics of mixtures

被引:8
|
作者
Elapavalore, Anjana [1 ]
Kondic, Todor [1 ]
Singh, Randolph R. [1 ,2 ]
Shoemaker, Benjamin A. [3 ]
Thiessen, Paul A. [3 ]
Zhang, Jian [3 ]
Bolton, Evan E. [3 ]
Schymanski, Emma L. [1 ]
机构
[1] Univ Luxembourg, Luxembourg Ctr Syst Biomed LCSB, 6 Ave Swing, L-4367 Belvaux, Luxembourg
[2] IFREMER Inst Francais Rech Exploitat Mer, Lab Biogeochim Contaminants Organ, Rue Ile Yeu,BP 21105, F-44311 Nantes 3, France
[3] NIH, Natl Ctr Biotechnol Informat NCBI, Natl Lib Med NLM, Bethesda, MD 20894 USA
基金
美国国家卫生研究院;
关键词
SPECTROMETRY; CHALLENGE; CHEMICALS; MS/MS;
D O I
10.1039/d3em00181d
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The term "exposome" is defined as a comprehensive study of life-course environmental exposures and the associated biological responses. Humans are exposed to many different chemicals, which can pose a major threat to the well-being of humanity. Targeted or non-targeted mass spectrometry techniques are widely used to identify and characterize various environmental stressors when linking exposures to human health. However, identification remains challenging due to the huge chemical space applicable to exposomics, combined with the lack of sufficient relevant entries in spectral libraries. Addressing these challenges requires cheminformatics tools and database resources to share curated open spectral data on chemicals to improve the identification of chemicals in exposomics studies. This article describes efforts to contribute spectra relevant for exposomics to the open mass spectral library MassBank (https://www.massbank.eu) using various open source software efforts, including the R packages RMassBank and Shinyscreen. The experimental spectra were obtained from ten mixtures containing toxicologically relevant chemicals from the US Environmental Protection Agency (EPA) Non-Targeted Analysis Collaborative Trial (ENTACT). Following processing and curation, 5582 spectra from 783 of the 1268 ENTACT compounds were added to MassBank, and through this to other open spectral libraries (e.g., MoNA, GNPS) for community benefit. Additionally, an automated deposition and annotation workflow was developed with PubChem to enable the display of all MassBank mass spectra in PubChem, which is rerun with each MassBank release. The new spectral records have already been used in several studies to increase the confidence in identification in non-target small molecule identification workflows applied to environmental and exposomics research.
引用
收藏
页码:1788 / 1801
页数:14
相关论文
共 44 条
  • [1] Open source software toolchain for automated non-targeted screening for toxins in alternative foods
    Breuer, S. W.
    Toppen, L.
    Schum, S. K.
    Pearce, J. M.
    METHODSX, 2021, 8
  • [2] Satellite Data Classification Using Open Source Support
    S. Biswal
    A. Ghosh
    R. Sharma
    P. K. Joshi
    Journal of the Indian Society of Remote Sensing, 2013, 41 : 523 - 530
  • [3] Satellite Data Classification Using Open Source Support
    Biswal, S.
    Ghosh, A.
    Sharma, R.
    Joshi, P. K.
    JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2013, 41 (03) : 523 - 530
  • [4] Chemical mixtures: File format, open source tools, example data, and mixtures InChl derivative
    Clark, Alex
    Cheung, Philip
    Darlington, Janice
    McEwen, Leah
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 258
  • [5] BIG DATA, OPEN SOURCE TOOLS, AND CLINICAL DECISION SUPPORT IN A PEDIATRIC ICU
    Kennedy, Curtis
    Arikan, Ayse
    Williams, Eric
    CRITICAL CARE MEDICINE, 2014, 42 (12)
  • [6] Open Source Synergy: Developing and Validating PMU Data Analysis Techniques Using Open Source Tools and Datasets
    Etingov, Pavel
    Follum, Jim
    Biswas, Shuchismita
    Yin, Tianzhixi
    2024 INTERNATIONAL CONFERENCE ON SMART GRID SYNCHRONIZED MEASUREMENTS AND ANALYTICS, SGSMA 2024, 2024,
  • [7] Spatial Data Warehouses and SOLAP Using Open-Source Tools
    Bogantes Gonzalez, Diana
    Pandolfi Gonzalez, Leonardo
    PROCEEDINGS OF THE 2013 XXXIX LATIN AMERICAN COMPUTING CONFERENCE (CLEI), 2013,
  • [8] Data Anonymization: An Experimental Evaluation Using Open-Source Tools
    Tomas, Joana
    Rasteiro, Deolinda
    Bernardino, Jorge
    FUTURE INTERNET, 2022, 14 (06):
  • [9] Shock tube data processing tools using open source hardware and software platforms
    Thirumalesh, K.
    Raju, Salgeri Puttaswamy
    Somashekarappa, Hiriyur Mallaiah
    Swaroop, Kumaraswamy
    ENGINEERING REPORTS, 2021, 3 (09)
  • [10] Big Data Analytics for Power Distribution Systems using AMI and Open Source Tools
    Duggan, Gerald P.
    Zimmerle, Daniel
    Upadhyay, Sonu
    2020 IEEE/PES TRANSMISSION AND DISTRIBUTION CONFERENCE AND EXPOSITION (T&D), 2020,