Development and in silico evaluation of large-scale metabolite identification methods using functional group detection for rnetabolonnics

被引:16
|
作者
Mitchell, Joshua M. [1 ]
Fan, Teresa W. -M. [1 ]
Lane, Andrew N. [1 ]
Moseley, Hunter N. B. [1 ]
机构
[1] Univ Kentucky, Dept Mol & Cellular Biochem, Lucille P Markey Canc Ctr, Lexington, KY 40536 USA
来源
FRONTIERS IN GENETICS | 2014年 / 5卷
基金
美国国家科学基金会;
关键词
ISOTOPE-RESOLVED METABOLOMICS; MASS-SPECTROMETRY; SUBGRAPH ISOMORPHISM; LUNG-CANCER; DATABASE; PATHWAYS; NMR; QUANTIFICATION; STRATEGIES; HMDB;
D O I
10.3389/fgene.2014.00237
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Large-scale identification of metabolites is key to elucidating and modeling metabolism at the systems level. Advances in metabolomics technologies, particularly ultra-high resolution mass spectrometry (MS) enable comprehensive and rapid analysis of metabolites. However, a significant barrier to meaningful data interpretation is the identification of a wide range of metabolites including unknowns and the determination of their role(s) in various metabolic networks. Chemoselective (CS) probes to tag metabolite functional groups combined with high mass accuracy provide additional structural constraints for metabolite identification and quantification. We have developed a novel algorithm, Chemically Aware Substructure Search (CASS) that efficiently detects functional groups within existing metabolite databases, allowing for combined molecular formula and functional group (from CS tagging) queries to aid in metabolite identification without a priori knowledge. Analysis of the isomeric compounds in both Human Metabolome Database (HMDB) and KEGG Ligand demonstrated a high percentage of isomeric molecular formulae (43 and 28%, respectively), indicating the necessity for techniques such as CS-tagging. Furthermore, these two databases have only moderate overlap in molecular formulae. Thus, it is prudent to use multiple databases in metabolite assignment, since each major metabolite database represents different portions of metabolism within the biosphere. In silico analysis of various CS-tagging strategies under different conditions for adduct formation demonstrate that combined FT-MS derived molecular formulae and CS-tagging can uniquely identify up to 71% of KEGG and 37% of the combined KEGG/HMDB database vs. 41 and 17%, respectively without adduct formation. This difference between database isomer disambiguation highlights the strength of CS-tagging for non-lipid metabolite identification. However, unique identification of complex lipids still needs additional information.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Development and in silico Evaluation of Large-Scale Metabolite Identification Methods Using Functional Group Detection for Metabolomics
    Mitchell, Joshua
    Fan, Teresa
    Lane, Andrew
    Moseley, Hunter
    FASEB JOURNAL, 2015, 29
  • [2] Development and in silico evaluation of large-scale metabolite identifcation methods using functional group detection for metabolomics
    Mitchell, Joshua
    Fan, Teresa
    Lane, Andrew
    Moseley, Hunter
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2015, 249
  • [3] Development of large-scale metabolite identification methods for metabolomics
    Joshua M Mitchell
    Teresa W-M Fan
    Andrew N Lane
    Hunter NB Moseley
    BMC Bioinformatics, 15 (Suppl 10)
  • [4] Development of large-scale metabolite identification methods for metabolomics
    Mitchell, Joshua M.
    Fan, Teresa W-M
    Lane, Andrew N.
    Moseley, Hunter N. B.
    BMC BIOINFORMATICS, 2014, 15
  • [5] Development of large-scale metabolite identification methods for metabolomics
    Moseley, Hunter N.B. (hunter.moseley@uky.edu), 1600, BioMed Central Ltd. (15):
  • [6] Development of large-scale metabolite identification methods for metabolomics
    Mitchell, Joshua M.
    Fan, Teresa W-M
    Lane, Andrew N.
    Moseley, Hunter N. B.
    BMC BIOINFORMATICS, 2014, 15
  • [7] Large-scale identification of polymorphic microsatellites using an in silico approach
    Tang, Jifeng
    Baldwin, Samantha J.
    Jacobs, Jeanne M. E.
    van der Linden, C. Gerard
    Voorrips, Roeland E.
    Leunissen, Jack A. M.
    van Eck, Herman
    Vosman, Ben
    BMC BIOINFORMATICS, 2008, 9 (1)
  • [8] Large-scale identification of polymorphic microsatellites using an in silico approach
    Jifeng Tang
    Samantha J Baldwin
    Jeanne ME Jacobs
    C Gerard van der Linden
    Roeland E Voorrips
    Jack AM Leunissen
    Herman van Eck
    Ben Vosman
    BMC Bioinformatics, 9
  • [9] Identification of large-scale genomic variation in cancer genomes using in silico reference models
    Killcoyne, Sarah
    del Sol, Antonio
    NUCLEIC ACIDS RESEARCH, 2016, 44 (01) : e5
  • [10] A Large-scale, multicenter serum metabolite biomarker identification study for the early detection of hepatocellular carcinoma
    Luo, Ping
    Yin, Peiyuan
    Hua, Rui
    Tan, Yexiong
    Li, Zaifang
    Qiu, Gaokun
    Yin, Zhenyu
    Xie, Xingwang
    Wang, Xiaomei
    Chen, Wenbin
    Zhou, Lina
    Wang, Xiaolin
    Li, Yanli
    Chen, Hongsong
    Gao, Ling
    Lu, Xin
    Wu, Tangchun
    Wang, Hongyang
    Niu, Junqi
    Xu, Guowang
    HEPATOLOGY, 2018, 67 (02) : 662 - 675