Combining Experimental with Computational Infrared and Mass Spectra for High-Throughput Nontargeted Chemical Structure Identification

被引:2
|
作者
Karunaratne, Erandika [1 ]
Hill, Dennis W. [1 ]
Duhrkop, Kai [2 ]
Bocker, Sebastian
Grant, David F. [1 ]
机构
[1] Univ Connecticut, Dept Pharmaceut Sci, Storrs, CT 06269 USA
[2] Friedrich Schiller Univ Jena, Fac Math & Comp Sci, Chair Bioinformat, D-07743 Jena, Germany
关键词
INTEGRATED GAS-CHROMATOGRAPHY; METABOLOMICS; DATABASE; MS; FRAGMENTATION; METLIN;
D O I
10.1021/acs.analchem.3c00937
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Theinability to identify the structures of most metabolites detectedin environmental or biological samples limits the utility of nontargetedmetabolomics. The most widely used analytical approaches combine massspectrometry and machine learning methods to rank candidate structurescontained in large chemical databases. Given the large chemical spacetypically searched, the use of additional orthogonal data may improvethe identification rates and reliability. Here, we present resultsof combining experimental and computational mass and IR spectral datafor high-throughput nontargeted chemical structure identification.Experimental MS/MS and gas-phase IR data for 148 test compounds wereobtained from NIST. Candidate structures for each of the test compoundswere obtained from PubChem (mean = 4444 candidate structures per testcompound). Our workflow used CSI:FingerID to initially score and rankthe candidate structures. The top 1000 ranked candidates were subsequentlyused for IR spectra prediction, scoring, and ranking using densityfunctional theory (DFT-IR). Final ranking of the candidates was basedon a composite score calculated as the average of the CSI:FingerIDand DFT-IR rankings. This approach resulted in the correct identificationof 88 of the 148 test compounds (59%). 129 of the 148 test compounds(87%) were ranked within the top 20 candidates. These identificationrates are the highest yet reported when candidate structures are usedfrom PubChem. Combining experimental and computational MS/MS and IRspectral data is a potentially powerful option for prioritizing candidatesfor final structure verification.
引用
收藏
页码:11901 / 11907
页数:7
相关论文
共 50 条
  • [1] High-Throughput Non-targeted Chemical Structure Identification Using Gas-Phase Infrared Spectra
    Karunaratne, Erandika
    Hill, Dennis W.
    Pracht, Philipp
    Gascon, Jose A.
    Grimme, Stefan
    Grant, David F.
    ANALYTICAL CHEMISTRY, 2021, 93 (30) : 10688 - 10696
  • [2] Combining High-Throughput Synthesis and High-Throughput Protein Crystallography for Accelerated Hit Identification
    Sutanto, Fandi
    Shaabani, Shabnam
    Oerlemans, Rick
    Eris, Deniz
    Patil, Pravin
    Hadian, Mojgan
    Wang, Meitian
    Sharpe, May Elizabeth
    Groves, Matthew R.
    Domling, Alexander
    ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 2021, 60 (33) : 18231 - 18239
  • [3] Nontargeted in vitro metabolomics for high-throughput identification of novel enzymes in Escherichia coli
    Sevin, Daniel C.
    Fuhrer, Tobias
    Zamboni, Nicola
    Sauer, Uwe
    NATURE METHODS, 2017, 14 (02) : 187 - 194
  • [4] Nontargeted in vitro metabolomics for high-throughput identification of novel enzymes in Escherichia coli
    Sévin D.C.
    Fuhrer T.
    Zamboni N.
    Sauer U.
    Nature Methods, 2017, 14 (2) : 187 - 194
  • [5] Phenyx: Combining high-throughput and pertinence in protein identification
    Masselot, Alexandre
    Binz, Pierre-Alain
    Cambria, Lorenzo
    Appel, Ron D.
    MOLECULAR & CELLULAR PROTEOMICS, 2004, 3 (10) : S257 - S257
  • [6] MetaUniDec: High-Throughput Deconvolution of Native Mass Spectra
    Reid, Deseree J.
    Diesing, Jessica M.
    Miller, Matthew A.
    Perry, Scott M.
    Wales, Jessica A.
    Montfort, William R.
    Marty, Michael T.
    JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 2019, 30 (01) : 118 - 127
  • [7] High-throughput computational and experimental techniques in structural genomics
    Chance, MR
    Fiser, A
    Sali, A
    Pieper, U
    Eswar, N
    Xu, GP
    Fajardo, JE
    Radhakannan, T
    Marinkovic, N
    GENOME RESEARCH, 2004, 14 (10B) : 2145 - 2154
  • [8] High-throughput, nontargeted metabolite fingerprinting using nominal mass flow injection electrospray mass spectrometry
    Manfred Beckmann
    David Parker
    David P Enot
    Emilie Duval
    John Draper
    Nature Protocols, 2008, 3 : 486 - 504
  • [9] High-throughput, nontargeted metabolite fingerprinting using nominal mass flow injection electrospray mass spectrometry
    Beckmann, Manfred
    Parker, David
    Enot, David P.
    Duval, Emilie
    Draper, John
    NATURE PROTOCOLS, 2008, 3 (03) : 486 - 504
  • [10] High-throughput experimental and computational technologies at the National Center for Computational Toxicology
    Williams, Antony
    Wambaugh, John
    Houck, Keith
    Judson, Richard
    Paul-Friedman, Katie
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 258