Comparative Shotgun Proteomics Using Spectral Count Data and Quasi-Likelihood Modeling

被引:85
|
作者
Li, Ming [2 ]
Gray, William [2 ]
Zhang, Haixia [3 ,4 ]
Chung, Christine H. [1 ,6 ]
Billheimer, Dean [5 ]
Yarbrough, Wendell G. [1 ,7 ]
Liebler, Daniel C. [3 ,4 ]
Shyr, Yu [2 ]
Slebos, Robbert J. C. [1 ,4 ]
机构
[1] Vanderbilt Univ, Dept Canc Biol, Sch Med, Nashville, TN 37232 USA
[2] Vanderbilt Univ, Dept Biostat, Sch Med, Nashville, TN 37232 USA
[3] Vanderbilt Univ, Dept Biochem, Sch Med, Nashville, TN 37232 USA
[4] Vanderbilt Univ, Sch Med, Jim Ayers Inst Precanc Detect & Diag, Nashville, TN 37232 USA
[5] Univ Arizona, Dept Agr & Biosyst Engn, Tucson, AZ 85721 USA
[6] Vanderbilt Univ, Sch Med, Dept Med, Nashville, TN 37232 USA
[7] Vanderbilt Univ, Sch Med, Dept Otolaryngol, Nashville, TN 37232 USA
基金
美国国家卫生研究院;
关键词
LC-MS/MS; shotgun proteomics; multiple reaction monitoring (MRM); head and neck carcinoma; Generalized Linear Model; spectral counting; ACID-BINDING-PROTEIN; GENE-EXPRESSION PATTERNS; PEPTIDE IDENTIFICATION; MASS-SPECTROMETRY; IN-VIVO; DISCOVERY; TISSUES; ABUNDANCE; BETA; SKIN;
D O I
10.1021/pr100527g
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Shotgun proteomics provides the most powerful analytical platform for global inventory of complex proteomes using liquid chromatography-tandem mass spectrometry (LC-MS/MS) and allows a global analysis of protein changes. Nevertheless, sampling of complex proteomes by current shotgun proteomics platforms is incomplete, and this contributes to variability in assessment of peptide and protein inventories by spectral counting approaches. Thus, shotgun proteomics data pose challenges in comparing proteomes from different biological states. We developed an analysis strategy using quasi-likelihood Generalized Linear Modeling (GLM), included in a graphical interface software package (Quasi-Tel) that reads standard output from protein assemblies created by IDPicker, an HTML-based user interface to query shotgun proteomic data sets. This approach was compared to four other statistical analysis strategies: Student t test, Wilcoxon rank test, Fisher's Exact test, and Poisson-based GLM. We analyzed the performance of these tests to identify differences in protein levels based on spectral counts in a shotgun data set in which equimolar amounts of 48 human proteins were spiked at different levels into whole yeast lysates. Both GLM approaches and the Fisher Exact test performed adequately, each with their unique limitations. We subsequently compared the proteomes of normal tonsil epithelium and HNSCC using this approach and identified 86 proteins with differential spectral counts between normal tonsil epithelium and HNSCC. We selected 18 proteins from this comparison for verification of protein levels between the individual normal and tumor tissues using liquid chromatography- multiple reaction monitoring mass spectrometry (LC-MRM-MS). This analysis confirmed the magnitude and direction of the protein expression differences in all 6 proteins for which reliable data could be obtained. Our analysis demonstrates that shotgun proteomic data sets from different tissue phenotypes are sufficiently rich in quantitative information and that statistically significant differences in proteins spectral counts reflect the underlying biology of the samples.
引用
收藏
页码:4295 / 4305
页数:11
相关论文
共 50 条
  • [1] Comparative Shotgun Proteomics Using Spectral Count Data and Quasi-Likelihood Modeling (vol 9, pg 4295, 2010)
    Li, Ming
    Gray, William
    Zhang, Haixia
    Chung, Christine H.
    Billheimer, Dean
    Yarbrough, Wendell G.
    Liebler, Daniel C.
    Shyr, Yu
    Slebos, Robbert J. C.
    JOURNAL OF PROTEOME RESEARCH, 2010, 9 (11) : 6090 - 6090
  • [2] A Bayesian Mixture Model for Comparative Spectral Count Data in Shotgun Proteomics
    Booth, James G.
    Eilertson, Kirsten E.
    Olinares, Paul Dominic B.
    Yu, Haiyuan
    MOLECULAR & CELLULAR PROTEOMICS, 2011, 10 (08)
  • [3] A flexible quasi-likelihood model for microbiome abundance count data
    Shi, Yiming
    Li, Huilin
    Wang, Chan
    Chen, Jun
    Jiang, Hongmei
    Shih, Ya-Chen T.
    Zhang, Haixiang
    Song, Yizhe
    Feng, Yang
    Liu, Lei
    STATISTICS IN MEDICINE, 2023, 42 (25) : 4632 - 4643
  • [4] Reliability prediction through degradation data modeling using a quasi-likelihood approach
    Jayaram, JSR
    Girish, T
    ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM, 2005 PROCEEDINGS, 2005, : 193 - 199
  • [5] Significance Analysis of Spectral Count Data in Label-free Shotgun Proteomics
    Choi, Hyungwon
    Fermin, Damian
    Nesvizhskii, Alexey I.
    MOLECULAR & CELLULAR PROTEOMICS, 2008, 7 (12) : 2373 - 2385
  • [6] Nonparametric quasi-likelihood for right censored data
    Lili Yu
    Lifetime Data Analysis, 2011, 17 : 594 - 607
  • [7] Penalized quasi-likelihood with spatially correlated data
    Dean, CB
    Ugarte, MD
    Militino, AF
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2004, 45 (02) : 235 - 248
  • [8] Nonparametric quasi-likelihood for right censored data
    Yu, Lili
    LIFETIME DATA ANALYSIS, 2011, 17 (04) : 594 - 607
  • [9] Semiparametric quasi-likelihood estimation with missing data
    Bravo, Francesco
    Jacho-Chavez, David T.
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2016, 45 (05) : 1345 - 1369
  • [10] Estimating intraclass correlation for binary data using extended quasi-likelihood
    Lee, Y
    STATISTICAL MODELLING, 2004, 4 (02) : 113 - 126