Identifying differences in protein expression levels by spectral counting and feature selection

被引:58
|
作者
Carvalho, P. C. [1 ]
Hewel, J. [2 ]
Barbosa, V. C. [1 ]
Yates, J. R., III [2 ]
机构
[1] Univ Fed Rio de Janeiro, COPPE, Programa Engn Sistemas & Computacao, BR-21945 Rio De Janeiro, Brazil
[2] Scripps Res Inst, Dept Cell Biol, La Jolla, CA USA
来源
GENETICS AND MOLECULAR RESEARCH | 2008年 / 7卷 / 02期
关键词
MudPIT; feature selection; support vector machine; spectral counting; feature ranking;
D O I
10.4238/vol7-2gmr426
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Spectral counting is a strategy to quantify relative protein concentrations in pre-digested protein mixtures analyzed by liquid chromatography online with tandem mass spectrometry. In the present study, we used combinations of normalization and statistical (feature selection) methods on spectral counting data to verify whether we could pinpoint which and how many proteins were differentially expressed when comparing complex protein mixtures. These combinations were evaluated on real, but controlled, experiments (yeast lysates were spiked with protein markers at different concentrations to simulate differences), which were therefore verifiable. The following normalization methods were applied: total signal, Z-normalization, hybrid normalization, and log preprocessing. The feature selection methods were: the Golub index, the Student t-test, a strategy based on the weighting used in a forward-support vector machine (SVM-F) model, and SVM recursive feature elimination. The results showed that Z-normalization combined with SVM-F correctly identified which and how many protein markers were added to the yeast lysates for all different concentrations. The software we used is available at http://pcarvalho.com/patternlab.
引用
收藏
页码:342 / 356
页数:15
相关论文
共 50 条
  • [41] Balanced Spectral Clustering Algorithm Based on Feature Selection
    Luo, Qimin
    Lu, Guangquan
    Wen, Guoqiu
    Su, Zidong
    Liu, Xingyi
    Wei, Jian
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2021, PT II, 2022, 13088 : 356 - 367
  • [42] Spectral Feature Selection Optimization for Water Quality Estimation
    Van Nguyen, Manh
    Lin, Chao-Hung
    Chu, Hone-Jay
    Muhamad Jaelani, Lalu
    Aldila Syariz, Muhammad
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2020, 17 (01)
  • [43] Spectral Clustering Based Unsupervised Feature Selection Algorithms
    Xie J.-Y.
    Ding L.-J.
    Wang M.-Z.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (04): : 1009 - 1024
  • [44] Discernible neighborhood counting based incremental feature selection for heterogeneous data
    Yanyan Yang
    Shiji Song
    Degang Chen
    Xiao Zhang
    International Journal of Machine Learning and Cybernetics, 2020, 11 : 1115 - 1127
  • [45] Discernible neighborhood counting based incremental feature selection for heterogeneous data
    Yang, Yanyan
    Song, Shiji
    Chen, Degang
    Zhang, Xiao
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (05) : 1115 - 1127
  • [46] Unsupervised feature selection with joint self-expression and spectral analysis via adaptive graph constraints
    Mengbo You
    Lujie Ban
    Yuhan Wang
    Juan Kang
    Guorui Wang
    Aihong Yuan
    Multimedia Tools and Applications, 2023, 82 : 5879 - 5898
  • [47] Unsupervised feature selection with joint self-expression and spectral analysis via adaptive graph constraints
    You, Mengbo
    Ban, Lujie
    Wang, Yuhan
    Kang, Juan
    Wang, Guorui
    Yuan, Aihong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (04) : 5879 - 5898
  • [48] FEATURE SELECTION FOR IDENTIFYING SKILLS AND PERSONAL AFFINITIES IN TUTORS AND STUDENTS
    Urbina-Najera, Argelia B.
    de la Calleja, Jorge
    Auxilio Medina, Ma.
    ADVED 15: INTERNATIONAL CONFERENCE ON ADVANCES IN EDUCATION AND SOCIAL SCIENCES, 2015, : 719 - 724
  • [49] A novel feature extraction method for identifying quality seed selection
    Suganthi, M.
    Sathiaseelan, G. R.
    INTERNATIONAL JOURNAL OF INTELLIGENT ENGINEERING INFORMATICS, 2022, 10 (05) : 359 - 378
  • [50] A model for identifying neuropeptides by feature selection based on hybrid features
    Zhou F.-F.
    Yan Z.-W.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2023, 53 (11): : 3238 - 3245