Identifying differences in protein expression levels by spectral counting and feature selection

被引:58
|
作者
Carvalho, P. C. [1 ]
Hewel, J. [2 ]
Barbosa, V. C. [1 ]
Yates, J. R., III [2 ]
机构
[1] Univ Fed Rio de Janeiro, COPPE, Programa Engn Sistemas & Computacao, BR-21945 Rio De Janeiro, Brazil
[2] Scripps Res Inst, Dept Cell Biol, La Jolla, CA USA
来源
GENETICS AND MOLECULAR RESEARCH | 2008年 / 7卷 / 02期
关键词
MudPIT; feature selection; support vector machine; spectral counting; feature ranking;
D O I
10.4238/vol7-2gmr426
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Spectral counting is a strategy to quantify relative protein concentrations in pre-digested protein mixtures analyzed by liquid chromatography online with tandem mass spectrometry. In the present study, we used combinations of normalization and statistical (feature selection) methods on spectral counting data to verify whether we could pinpoint which and how many proteins were differentially expressed when comparing complex protein mixtures. These combinations were evaluated on real, but controlled, experiments (yeast lysates were spiked with protein markers at different concentrations to simulate differences), which were therefore verifiable. The following normalization methods were applied: total signal, Z-normalization, hybrid normalization, and log preprocessing. The feature selection methods were: the Golub index, the Student t-test, a strategy based on the weighting used in a forward-support vector machine (SVM-F) model, and SVM recursive feature elimination. The results showed that Z-normalization combined with SVM-F correctly identified which and how many protein markers were added to the yeast lysates for all different concentrations. The software we used is available at http://pcarvalho.com/patternlab.
引用
收藏
页码:342 / 356
页数:15
相关论文
共 50 条
  • [31] Protein-protein interaction extraction with feature selection by evaluating contribution levels of groups consisting of related features
    Thi Thanh Thuy Phan
    Takenao Ohkawa
    BMC Bioinformatics, 17
  • [32] Protein-protein interaction extraction with feature selection by evaluating contribution levels of groups consisting of related features
    Thi Thanh Thuy Phan
    Ohkawa, Takenao
    BMC BIOINFORMATICS, 2016, 17
  • [33] Unsupervised Spectral Feature Selection with local structure learning
    Zhang, Shichao
    Fang, Yue
    Lei, Cong
    Li, Yangding
    Hu, Rongyao
    Li, Yonggang
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (IEEE ICBK 2017), 2017, : 303 - 308
  • [34] Feature selection by genetic algorithms for mass spectral classifiers
    Yoshida, H
    Leardi, R
    Funatsu, K
    Varmuza, K
    ANALYTICA CHIMICA ACTA, 2001, 446 (1-2) : 485 - 494
  • [35] Supervised spectral feature selection with neighborhood rough set
    Liu, Qiong
    Cai, Mingjie
    Li, Qingguo
    APPLIED SOFT COMPUTING, 2024, 165
  • [36] An Efficient Algorithm Combining Spectral Clustering with Feature Selection
    Qimin Luo
    Guoqiu Wen
    Leyuan Zhang
    Mengmeng Zhan
    Neural Processing Letters, 2020, 52 : 1913 - 1925
  • [37] An Efficient Algorithm Combining Spectral Clustering with Feature Selection
    Luo, Qimin
    Wen, Guoqiu
    Zhang, Leyuan
    Zhan, Mengmeng
    NEURAL PROCESSING LETTERS, 2020, 52 (03) : 1913 - 1925
  • [38] Graph spectral approach for identifying protein domains
    Center for Computational Natural Science and Bioinformatics, International Institute of Information Technology, Gachibowli, Hyderabad, 500032, India
    Lect. Notes Comput. Sci., (437-448):
  • [39] Graph Spectral Approach for Identifying Protein Domains
    Yalamanchili, Hari Krishna
    Parekh, Nita
    BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, PROCEEDINGS, 2009, 5462 : 437 - 448
  • [40] Deep Spectral Clustering With Projected Adaptive Feature Selection
    Zhao, Yang
    Bi, Zixuan
    Zhu, Peican
    Yuan, Aihong
    Li, Xuelong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63