Development and in silico evaluation of large-scale metabolite identification methods using functional group detection for rnetabolonnics

被引：16

作者：

Mitchell, Joshua M. ^{[1
]}

Fan, Teresa W. -M. ^{[1
]}

Lane, Andrew N. ^{[1
]}

Moseley, Hunter N. B. ^{[1
]}

机构：

[1] Univ Kentucky, Dept Mol & Cellular Biochem, Lucille P Markey Canc Ctr, Lexington, KY 40536 USA

来源：

FRONTIERS IN GENETICS | 2014年 / 5卷

基金：

美国国家科学基金会;

关键词：

ISOTOPE-RESOLVED METABOLOMICS; MASS-SPECTROMETRY; SUBGRAPH ISOMORPHISM; LUNG-CANCER; DATABASE; PATHWAYS; NMR; QUANTIFICATION; STRATEGIES; HMDB;

D O I：

10.3389/fgene.2014.00237

中图分类号：

Q3 [遗传学];

学科分类号：

071007 ; 090102 ;

摘要：

Large-scale identification of metabolites is key to elucidating and modeling metabolism at the systems level. Advances in metabolomics technologies, particularly ultra-high resolution mass spectrometry (MS) enable comprehensive and rapid analysis of metabolites. However, a significant barrier to meaningful data interpretation is the identification of a wide range of metabolites including unknowns and the determination of their role(s) in various metabolic networks. Chemoselective (CS) probes to tag metabolite functional groups combined with high mass accuracy provide additional structural constraints for metabolite identification and quantification. We have developed a novel algorithm, Chemically Aware Substructure Search (CASS) that efficiently detects functional groups within existing metabolite databases, allowing for combined molecular formula and functional group (from CS tagging) queries to aid in metabolite identification without a priori knowledge. Analysis of the isomeric compounds in both Human Metabolome Database (HMDB) and KEGG Ligand demonstrated a high percentage of isomeric molecular formulae (43 and 28%, respectively), indicating the necessity for techniques such as CS-tagging. Furthermore, these two databases have only moderate overlap in molecular formulae. Thus, it is prudent to use multiple databases in metabolite assignment, since each major metabolite database represents different portions of metabolism within the biosphere. In silico analysis of various CS-tagging strategies under different conditions for adduct formation demonstrate that combined FT-MS derived molecular formulae and CS-tagging can uniquely identify up to 71% of KEGG and 37% of the combined KEGG/HMDB database vs. 41 and 17%, respectively without adduct formation. This difference between database isomer disambiguation highlights the strength of CS-tagging for non-lipid metabolite identification. However, unique identification of complex lipids still needs additional information.

引用

页数：18

共 50 条

[41] Structural and functional analytics for community detection in large-scale complex networks
Chopade P.
Zhan J.
Journal of Big Data, 2 (1)
[42] Development of a large-scale TEG for evaluation and analysis of yield and variation
Yamamoto, M
Endo, H
Masuda, H
IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, 2004, 17 (02) : 111 - 122
[43] Large-scale functional analysis using peptide or protein arrays
Emili, AQ
Cagney, G
NATURE BIOTECHNOLOGY, 2000, 18 (04) : 393 - 397
[44] Large-scale functional analysis using peptide or protein arrays
Alia Qureshi Emili
Gerard Cagney
Nature Biotechnology, 2000, 18 : 393 - 397
[45] Large-scale SOP minimization using decomposition and functional properties
Mishchenko, A
Sasao, T
40TH DESIGN AUTOMATION CONFERENCE, PROCEEDINGS 2003, 2003, : 149 - 154
[46] Development of a large-scale TEG for evaluation and analysis of yield and variation
Yamamoto, M
Endo, H
Masuda, H
ICMTS 2003: PROCEEDINGS OF THE 2003 INTERNATIONAL CONFERENCE ON MICROELECTRONIC TEST STRUCTURES, 2003, : 53 - 58
[47] Large-Scale Mobile App Identification Using Deep Learning
Rezaei, Shahbaz
Kroencke, Bryce
Liu, Xin
IEEE ACCESS, 2020, 8 : 348 - 362
[48] DECOMPOSITION METHODS USING COMPOUND PROPOSALS FOR LARGE-SCALE OPTIMIZATION
KRIVONOZHKO, VE
LECTURE NOTES IN CONTROL AND INFORMATION SCIENCES, 1992, 180 : 231 - 240
[49] IDENTIFICATION OF AN ACTIVE GENE BY USING LARGE-SCALE CDNA SEQUENCING
ITOH, K
MATSUBARA, K
OKUBO, K
GENE, 1994, 140 (02) : 295 - 296
[50] Accelerating fingerprint identification using FPGA for large-scale applications
Shafiq, Mohsin
Taj, Imtiaz A.
Ghafoor, Mubeen
Tariq, Syed Ali
Abbas, Assad
Zomaya, Albert Y.
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 141 : 35 - 48

← 1 2 3 4 5 →