Identifying differences in protein expression levels by spectral counting and feature selection

被引:58
|
作者
Carvalho, P. C. [1 ]
Hewel, J. [2 ]
Barbosa, V. C. [1 ]
Yates, J. R., III [2 ]
机构
[1] Univ Fed Rio de Janeiro, COPPE, Programa Engn Sistemas & Computacao, BR-21945 Rio De Janeiro, Brazil
[2] Scripps Res Inst, Dept Cell Biol, La Jolla, CA USA
来源
GENETICS AND MOLECULAR RESEARCH | 2008年 / 7卷 / 02期
关键词
MudPIT; feature selection; support vector machine; spectral counting; feature ranking;
D O I
10.4238/vol7-2gmr426
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Spectral counting is a strategy to quantify relative protein concentrations in pre-digested protein mixtures analyzed by liquid chromatography online with tandem mass spectrometry. In the present study, we used combinations of normalization and statistical (feature selection) methods on spectral counting data to verify whether we could pinpoint which and how many proteins were differentially expressed when comparing complex protein mixtures. These combinations were evaluated on real, but controlled, experiments (yeast lysates were spiked with protein markers at different concentrations to simulate differences), which were therefore verifiable. The following normalization methods were applied: total signal, Z-normalization, hybrid normalization, and log preprocessing. The feature selection methods were: the Golub index, the Student t-test, a strategy based on the weighting used in a forward-support vector machine (SVM-F) model, and SVM recursive feature elimination. The results showed that Z-normalization combined with SVM-F correctly identified which and how many protein markers were added to the yeast lysates for all different concentrations. The software we used is available at http://pcarvalho.com/patternlab.
引用
收藏
页码:342 / 356
页数:15
相关论文
共 50 条
  • [1] Feature Selection and Spectral Indices for Identifying Maize Stress Types
    Li, Yanru
    Yang, Keming
    Wu, Bing
    APPLIED SPECTROSCOPY, 2025, 79 (02) : 306 - 319
  • [2] FEATURE SELECTION FOR IDENTIFYING PROTEIN-DISORDERED REGIONS
    Hsu, Hui-Huang
    Hsieh, Cheng-Wei
    BIOMEDICAL ENGINEERING-APPLICATIONS BASIS COMMUNICATIONS, 2010, 22 (02): : 119 - 125
  • [3] Balanced Spectral Feature Selection
    Zhou, Peng
    Chen, Jiangyong
    Du, Liang
    Li, Xuejun
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (07) : 4232 - 4244
  • [4] Mutual information for feature selection: estimation or counting?
    Nguyen H.B.
    Xue B.
    Andreae P.
    Evolutionary Intelligence, 2016, 9 (3) : 95 - 110
  • [5] Two-Step Feature Selection for Identifying Developmental Differences in Resting fMRI Intrinsic Connectivity Networks
    Qiao, Chen
    Gao, Bin
    Lu, Lu-Jia
    Calhoun, Vince D.
    Wang, Yu-Ping
    APPLIED SCIENCES-BASEL, 2019, 9 (20):
  • [6] Feature selection for spectral sensors with overlapping noisy spectral bands
    Paskaleva, Biliana
    Hayat, Majeed M.
    Tyo, J. Scott
    Wang, Zhipeng
    Martinez, Monica
    ALGORITHMS AND TECHNOLOGIES FOR MULTISPECTRAL, HYPERSPECTRAL, AND ULTRASPECTRAL IMAGERY XII PTS 1 AND 2, 2006, 6233
  • [7] Identifying Diagnostic Biomarkers of Breast Cancer Based on Gene Expression Data and Ensemble Feature Selection
    Li, Lingyu
    Algabri, Yousif A.
    Liu, Zhi-Ping
    CURRENT BIOINFORMATICS, 2023, 18 (03) : 232 - 246
  • [8] DIFFERENCES BETWEEN COUNTING AND NON-COUNTING DIAMONDS .1. COUNTING PROPERTIES AND SPECTRAL TRANSMISSION
    FREEMAN, GP
    VANDERVELDEN, HA
    PHYSICA, 1952, 18 (01): : 1 - 8
  • [9] Unsupervised Spectral Feature Selection for Face Recognition
    Zhang, Zhihong
    Hancock, Edwin R.
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1787 - 1790
  • [10] Dynamic graph learning for spectral feature selection
    Wei Zheng
    Xiaofeng Zhu
    Yonghua Zhu
    Rongyao Hu
    Cong Lei
    Multimedia Tools and Applications, 2018, 77 : 29739 - 29755