Combining Experimental with Computational Infrared and Mass Spectra for High-Throughput Nontargeted Chemical Structure Identification

被引:2
|
作者
Karunaratne, Erandika [1 ]
Hill, Dennis W. [1 ]
Duhrkop, Kai [2 ]
Bocker, Sebastian
Grant, David F. [1 ]
机构
[1] Univ Connecticut, Dept Pharmaceut Sci, Storrs, CT 06269 USA
[2] Friedrich Schiller Univ Jena, Fac Math & Comp Sci, Chair Bioinformat, D-07743 Jena, Germany
关键词
INTEGRATED GAS-CHROMATOGRAPHY; METABOLOMICS; DATABASE; MS; FRAGMENTATION; METLIN;
D O I
10.1021/acs.analchem.3c00937
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Theinability to identify the structures of most metabolites detectedin environmental or biological samples limits the utility of nontargetedmetabolomics. The most widely used analytical approaches combine massspectrometry and machine learning methods to rank candidate structurescontained in large chemical databases. Given the large chemical spacetypically searched, the use of additional orthogonal data may improvethe identification rates and reliability. Here, we present resultsof combining experimental and computational mass and IR spectral datafor high-throughput nontargeted chemical structure identification.Experimental MS/MS and gas-phase IR data for 148 test compounds wereobtained from NIST. Candidate structures for each of the test compoundswere obtained from PubChem (mean = 4444 candidate structures per testcompound). Our workflow used CSI:FingerID to initially score and rankthe candidate structures. The top 1000 ranked candidates were subsequentlyused for IR spectra prediction, scoring, and ranking using densityfunctional theory (DFT-IR). Final ranking of the candidates was basedon a composite score calculated as the average of the CSI:FingerIDand DFT-IR rankings. This approach resulted in the correct identificationof 88 of the 148 test compounds (59%). 129 of the 148 test compounds(87%) were ranked within the top 20 candidates. These identificationrates are the highest yet reported when candidate structures are usedfrom PubChem. Combining experimental and computational MS/MS and IRspectral data is a potentially powerful option for prioritizing candidatesfor final structure verification.
引用
收藏
页码:11901 / 11907
页数:7
相关论文
共 50 条
  • [41] Computational identification of protein binding sites on RNAs using high-throughput RNA structure-probing data
    Hu, Xihao
    Wong, Thomas K. F.
    Lu, Zhi John
    Chan, Ting Fung
    Lau, Terrence Chi Kong
    Yiu, Siu Ming
    Yip, Kevin Y.
    BIOINFORMATICS, 2014, 30 (08) : 1049 - 1055
  • [42] Comment on 'PRISMA: A robust and intuitive tool for high-throughput processing of chemical spectra'
    Meier, Robert J.
    CHEMISTRYMETHODS, 2022, 2 (11):
  • [43] High-throughput computational-experimental screening protocol for the discovery of bimetallic catalysts
    Yeo, Byung Chul
    Nam, Hyunji
    Nam, Hyobin
    Kim, Min-Cheol
    Lee, Hong Woo
    Kim, Sung-Chul
    Won, Sung Ok
    Kim, Donghun
    Lee, Kwan-Young
    Lee, Seung Yong
    Han, Sang Soo
    NPJ COMPUTATIONAL MATERIALS, 2021, 7 (01)
  • [44] High-throughput computational-experimental screening protocol for the discovery of bimetallic catalysts
    Byung Chul Yeo
    Hyunji Nam
    Hyobin Nam
    Min-Cheol Kim
    Hong Woo Lee
    Sung-Chul Kim
    Sung Ok Won
    Donghun Kim
    Kwan-Young Lee
    Seung Yong Lee
    Sang Soo Han
    npj Computational Materials, 7
  • [45] High-throughput computational and experimental biology strategy in identifying tumor expressing CAMs
    Sadanandam, A
    Varney, ML
    Singh, RK
    2004 IEEE COMPUTATIONAL SYSTEMS BIOINFORMATICS CONFERENCE, PROCEEDINGS, 2004, : 450 - 451
  • [46] High-Throughput Computational Screening of Hydrocarbon Molecules for Long-Wavelength Infrared Imaging
    Tameh, Maliheh Shaban
    Coropceanu, Veaceslav
    Coen, Addison G.
    Pyun, Jeffrey
    Lichtenberger, Dennis L.
    Bredas, Jean-Luc
    ACS MATERIALS LETTERS, 2024, 6 (09): : 4371 - 4378
  • [47] LassoHTP: A High-Throughput Computational Tool for Lasso Peptide Structure Construction and Modeling
    Juarez, Reecan J.
    Jiang, Yaoyukun
    Tremblay, Matthew
    Shao, Qianzhen
    Link, A. James
    Yang, Zhongyue J.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2023, 63 (02) : 522 - 530
  • [48] CHEMICAL STRUCTURE IDENTIFICATION IN METABOLOMICS: COMPUTATIONAL MODELING OF EXPERIMENTAL FEATURES
    Menikarachchi, Lochana C.
    Hamdalla, Mai A.
    Hill, Dennis W.
    Grant, David F.
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2013, 5 (06):
  • [49] High-Throughput Mass Spectrometry for Hit Identification: Current Landscape and Future Perspectives
    McLaren, David G.
    Shah, Vinit
    Wisniewski, Thomas
    Ghislain, Lucien
    Liu, Chang
    Zhang, Hui
    Saldanha, S. Adrian
    SLAS DISCOVERY, 2021, 26 (02) : 168 - 191
  • [50] A computational framework to integrate high-throughput '-omics' datasets for the identification of potential mechanistic links
    Pedersen, Helle Krogh
    Forslund, Sofia K.
    Gudmundsdottir, Valborg
    Petersen, Anders Ostergaard
    Hildebrand, Falk
    Hyotylainen, Tuulia
    Nielsen, Trine
    Hansen, Torben
    Bork, Peer
    Ehrlich, S. Dusko
    Brunak, Soren
    Oresic, Matej
    Pedersen, Oluf
    Nielsen, Henrik Bjorn
    NATURE PROTOCOLS, 2018, 13 (12) : 2781 - 2800