Understanding the improved sensitivity of spectral library searching over sequence database searching in proteomics data analysis

被引:64
|
作者
Zhang, Xin [1 ]
Li, Yunzi [1 ]
Shao, Wenguang [1 ]
Lam, Henry [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Chem & Biomol Engn, Clear Water Bay, Hong Kong, Peoples R China
关键词
Bioinformatics; Sequence searching; Spectral library; Spectral searching; INDUCED DISSOCIATION SPECTRA; PEPTIDE IDENTIFICATION; PROTEIN IDENTIFICATION; MS/MS SPECTRA; TANDEM; VALIDATION; PREDICTION; STRATEGY;
D O I
10.1002/pmic.201000492
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Spectral library searching has been recently proposed as an alternative to sequence database searching for peptide identification from MS/MS. We performed a systematic comparison between spectral library searching and sequence database searching using a wide variety of data to better demonstrate, and understand, the superior sensitivity of the former observed in preliminary studies. By decoupling the effect of search space, we demonstrated that the success of spectral library searching is primarily attributable to the use of real library spectra for matching, without which the sensitivity advantage largely disappears. We further determined the extent to which the use of real peak intensities and non-canonical fragments, both under-utilized information in sequence database searching, contributes to the sensitivity advantage. Lastly, we showed that spectral library searching is disproportionately more successful in identifying low-quality spectra, and complex spectra of higher- charged precursors, both important frontiers in peptide sequencing. Our results answered important outstanding questions about this promising yet unproven method using well-controlled computational experiments and sound statistical approaches.
引用
收藏
页码:1075 / 1085
页数:11
相关论文
共 40 条
  • [31] GlycoSLASH: Concurrent Glycopeptide Identification from Multiple Related LC-MS/MS Data Sets by Using Spectral Clustering and Library Searching
    Li, Sujun
    Zhu, Jianhui
    Lubman, David M.
    Zhou, He
    Tang, Haixu
    JOURNAL OF PROTEOME RESEARCH, 2023, 22 (05) : 1501 - 1509
  • [32] Pattern Recognition-Assisted Infrared Library Searching of the Paint Data Query Database to Enhance Lead Information from Automotive Paint Trace Evidence
    Lavine, Barry K.
    White, Collin G.
    Allen, Matthew D.
    Weakley, Andrew
    APPLIED SPECTROSCOPY, 2017, 71 (03) : 480 - 495
  • [33] MacroSEQUEST: Efficient Candidate-Centric Searching and High-Resolution Correlation Analysis for Large-Scale Proteomics Data Sets
    Faherty, Brendan K.
    Gerber, Scott A.
    ANALYTICAL CHEMISTRY, 2010, 82 (16) : 6821 - 6829
  • [34] Database searching and accounting of multiplexed precursor and product ion spectra from the data independent analysis of simple and complex peptide mixtures
    Li, Guo-Zhong
    Vissers, Johannes P. C.
    Silva, Jeffrey C.
    Golick, Dan
    Gorenstein, Marc V.
    Geromanos, Scott J.
    PROTEOMICS, 2009, 9 (06) : 1696 - 1719
  • [35] Identification of glycosylation sites on peptides and proteins by LC/ESI/MS/MS and cross-correlation of MS/MS data with protein sequence database searching
    Wheeler, K
    Chaudhary, T
    Land, A
    Mylchreest, I
    Sweeney, M
    GLYCOBIOLOGY, 1996, 6 (07) : 106 - 106
  • [36] Untargeted, spectral library-free analysis of data-independent acquisition proteomics data generated using Orbitrap mass spectrometers
    Tsou, Chih-Chiang
    Tsai, Chia-Feng
    Teo, Guo Ci
    Chen, Yu-Ju
    Nesvizhskii, Alexey I.
    PROTEOMICS, 2016, 16 (15-16) : 2257 - 2271
  • [37] An evaluation for cross-species proteomics research by publicly available expressed sequence tag database search using tandem mass spectral data
    Huang, Mei
    Chen, Tong
    Chan, ZhuLong
    RAPID COMMUNICATIONS IN MASS SPECTROMETRY, 2006, 20 (18) : 2635 - 2640
  • [38] Proteome Analysis of Sorangium cellulosum Employing 2D-HPLC-MS/MS and Improved Database Searching Strategies for CID and ETD Fragment Spectra
    Leinenbach, Andreas
    Hartmer, Ralf
    Lubeck, Markus
    Kneissl, Benny
    Elnakady, Yasser A.
    Baessmann, Carsten
    Mueller, Rolf
    Huber, Christian G.
    JOURNAL OF PROTEOME RESEARCH, 2009, 8 (09) : 4350 - 4361
  • [39] Rapid identification and screening of proteins from whole cell lysates of human erythroleukemia cells in the liquid phase, using non-porous reversed phase high-performance liquid chromatography separations of proteins followed by multi-assisted laser desorption/ionization mass spectrometry analysis and sequence database searching
    Chen, YJ
    Wall, D
    Lubman, DM
    RAPID COMMUNICATIONS IN MASS SPECTROMETRY, 1998, 12 (24) : 1994 - 2003
  • [40] Sensitivity of Band-Pass Filtered In Situ Low-Earth Orbit and Ground-Based Ionosphere Observations to Lithosphere-Atmosphere-Ionosphere Coupling Over the Aegean Sea: Spectral Analysis of Two-Year Ionospheric Data Series
    Jarmolowski, Wojciech
    Belehaki, Anna
    Wielgosz, Pawel
    SENSORS, 2024, 24 (23)