Development and in silico evaluation of large-scale metabolite identification methods using functional group detection for rnetabolonnics

被引:16
|
作者
Mitchell, Joshua M. [1 ]
Fan, Teresa W. -M. [1 ]
Lane, Andrew N. [1 ]
Moseley, Hunter N. B. [1 ]
机构
[1] Univ Kentucky, Dept Mol & Cellular Biochem, Lucille P Markey Canc Ctr, Lexington, KY 40536 USA
来源
FRONTIERS IN GENETICS | 2014年 / 5卷
基金
美国国家科学基金会;
关键词
ISOTOPE-RESOLVED METABOLOMICS; MASS-SPECTROMETRY; SUBGRAPH ISOMORPHISM; LUNG-CANCER; DATABASE; PATHWAYS; NMR; QUANTIFICATION; STRATEGIES; HMDB;
D O I
10.3389/fgene.2014.00237
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Large-scale identification of metabolites is key to elucidating and modeling metabolism at the systems level. Advances in metabolomics technologies, particularly ultra-high resolution mass spectrometry (MS) enable comprehensive and rapid analysis of metabolites. However, a significant barrier to meaningful data interpretation is the identification of a wide range of metabolites including unknowns and the determination of their role(s) in various metabolic networks. Chemoselective (CS) probes to tag metabolite functional groups combined with high mass accuracy provide additional structural constraints for metabolite identification and quantification. We have developed a novel algorithm, Chemically Aware Substructure Search (CASS) that efficiently detects functional groups within existing metabolite databases, allowing for combined molecular formula and functional group (from CS tagging) queries to aid in metabolite identification without a priori knowledge. Analysis of the isomeric compounds in both Human Metabolome Database (HMDB) and KEGG Ligand demonstrated a high percentage of isomeric molecular formulae (43 and 28%, respectively), indicating the necessity for techniques such as CS-tagging. Furthermore, these two databases have only moderate overlap in molecular formulae. Thus, it is prudent to use multiple databases in metabolite assignment, since each major metabolite database represents different portions of metabolism within the biosphere. In silico analysis of various CS-tagging strategies under different conditions for adduct formation demonstrate that combined FT-MS derived molecular formulae and CS-tagging can uniquely identify up to 71% of KEGG and 37% of the combined KEGG/HMDB database vs. 41 and 17%, respectively without adduct formation. This difference between database isomer disambiguation highlights the strength of CS-tagging for non-lipid metabolite identification. However, unique identification of complex lipids still needs additional information.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Large-scale protein identification using mass spectrometry
    Lin, D
    Tabb, DL
    Yates, JR
    BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS, 2003, 1646 (1-2): : 1 - 10
  • [22] Large-Scale Data Analysis Using Heuristic Methods
    Dzemyda, Gintautas
    Sakalauskas, Leonidas
    INFORMATICA, 2011, 22 (01) : 1 - 10
  • [23] Large-Scale System Identification Using a Randomized SVD
    Wang, Han
    Anderson, James
    2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 2178 - 2185
  • [24] An Evaluation of Large-scale Methods for Image Instance and Class Discovery
    Douze, Matthijs
    Jegou, Herve
    Johnson, Jeff
    PROCEEDINGS OF THE THEMATIC WORKSHOPS OF ACM MULTIMEDIA 2017 (THEMATIC WORKSHOPS'17), 2017, : 1 - 9
  • [25] Large-scale in silico identification of drugs exerting sex-specific effects in the heart
    Changting Cui
    Chuanbo Huang
    Kejia Liu
    Guoheng Xu
    Jichun Yang
    Yong Zhou
    Yingmei Feng
    Georgios Kararigas
    Bin Geng
    Qinghua Cui
    Journal of Translational Medicine, 16
  • [26] Large-scale in silico identification of drugs exerting sex-specific effects in the heart
    Cui, Changting
    Huang, Chuanbo
    Liu, Kejia
    Xu, Guoheng
    Yang, Jichun
    Zhou, Yong
    Feng, Yingmei
    Kararigas, Georgios
    Geng, Bin
    Cui, Qinghua
    JOURNAL OF TRANSLATIONAL MEDICINE, 2018, 16
  • [27] Development and Evaluation of Geostatistical Methods for Estimating Weather Related Collisions: A Large-Scale Case Study
    Wong, Andy H.
    Kwon, Tae J.
    TRANSPORTATION RESEARCH RECORD, 2021, 2675 (11) : 828 - 840
  • [28] Functional Development of Large-Scale Sensorimotor Cortical Networks in the Brain
    Quairiaux, Charles
    Megevand, Pierre
    Kiss, Jozsef Z.
    Michel, Christoph M.
    JOURNAL OF NEUROSCIENCE, 2011, 31 (26): : 9574 - 9584
  • [29] Development of large-scale mosquito densovirus production by in vivo methods
    Sun, Yan
    Dong, Yunqiao
    Li, Jing
    Lai, Zetian
    Hao, Yanqiang
    Liu, Peiwen
    Chen, Xiaoguang
    Gu, Jinbao
    PARASITES & VECTORS, 2019, 12 (1)
  • [30] Development of large-scale mosquito densovirus production by in vivo methods
    Yan Sun
    Yunqiao Dong
    Jing Li
    Zetian Lai
    Yanqiang Hao
    Peiwen Liu
    Xiaoguang Chen
    Jinbao Gu
    Parasites & Vectors, 12