Improving protein-protein interaction prediction based on phylogenetic information using a least-squares support vector machine

被引:9
|
作者
Craig, Roger A. [1 ]
Liao, Li [1 ]
机构
[1] Univ Delaware, Dept Comp & Informat Sci, Newark, DE 19716 USA
关键词
protein-protein interaction; phylogenetic vectors; least-squares support vector machines;
D O I
10.1196/annals.1407.005
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Predicting protein-protein interactions has become a key step of reverse-engineering biological networks to better understand cellular functions. The experimental methods in determining protein-protein interactions are time-consuming and costly, which has motivated vigorous development of computational approaches for predicting protein-protein interactions. A set of recently developed bioinformatics methods utilizes coevolutionary information of the interacting partners (e.g., as exhibited in the form of correlations between distance matrices, where, for each protein, a matrix stores the pair-wise distances between the protein and its orthologs in a group of reference genomes). We proposed a novel method to account for the intra-matrix correlations in improving predictive accuracy. The distance matrices for a pair of proteins are transformed and concatenated into a phylogenetic vector. A least-squares support vector machine is trained and tested on pairs of proteins, represented as phylogenetic vectors, whose interactions are known. The intra-matrix correlations are accounted for by introducing a weighted linear kernel, which determines the dot product of two phylogenetic vectors. The performance, measured as receiver operator characteristic (ROC) score in cross-validation experiments, shows significant improvement of our method (ROC score 0.928) over that obtained by Pearson correlations (0.659).
引用
收藏
页码:154 / 167
页数:14
相关论文
共 50 条
  • [21] Using Topology Information for Protein-Protein Interaction Prediction
    Birlutiu, Adriana
    Heskes, Tom
    PATTERN RECOGNITION IN BIOINFORMATICS, PRIB 2014, 2014, 8626 : 10 - 22
  • [22] Prediction of Coal Ash Fusion Temperature by Least-Squares Support Vector Machine Model
    Zhao, Bingtao
    Zhang, Zhongxiao
    Wu, Xiaojiang
    ENERGY & FUELS, 2010, 24 (05) : 3066 - 3071
  • [23] Prediction of protein-protein interactions using support vector machines
    Dohkan, S
    Koike, A
    Takagi, T
    BIBE 2004: FOURTH IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, PROCEEDINGS, 2004, : 576 - 583
  • [24] A Dual-Based Pruning Method for the Least-Squares Support Vector Machine
    Xia, Xiao-Lei
    Zhou, Shang-Ming
    Ouyang, Mingxing
    Xiang, Dafang
    Zhang, Zhijun
    Zhou, Zexiang
    IFAC PAPERSONLINE, 2023, 56 (02): : 10377 - 10383
  • [25] Improved sparse least-squares support vector machine classifiers
    Li, Yuangui
    Lin, Chen
    Zhang, Weidong
    NEUROCOMPUTING, 2006, 69 (13-15) : 1655 - 1658
  • [26] Sensitivity prediction of sensor based on least squares support vector machine
    Wang, Z. (zhixuewangg@126.com), 2012, Binary Information Press, P.O. Box 162, Bethel, CT 06801-0162, United States (08):
  • [27] Coal consumption prediction based on least squares support vector machine
    Zhang, Li
    Zhou, Liansheng
    Zhang, Yingtian
    Wang, Kun
    Zhang, Yu
    E, Zhijun
    Gan, Zhiyong
    Wang, Ziyue
    Qu, Bin
    Li, Guohao
    THIRD INTERNATIONAL CONFERENCE ON ENERGY ENGINEERING AND ENVIRONMENTAL PROTECTION, 2019, 227
  • [28] Prediction of protein-protein binding site by using core interface residue and support vector machine
    Li, Nan
    Sun, Zhonghua
    Jiang, Fan
    BMC BIOINFORMATICS, 2008, 9 (1)
  • [29] Prediction of protein-protein binding site by using core interface residue and support vector machine
    Nan Li
    Zhonghua Sun
    Fan Jiang
    BMC Bioinformatics, 9
  • [30] Discussion of "Improving Prediction Accuracy of Hydrologic Time Series by Least-Squares Support Vector Machine Using Decomposition Reconstruction and Swarm Intelligence "
    Kisi, Ozgur
    Liepelt, Kai
    Kulls, Christoph
    JOURNAL OF HYDROLOGIC ENGINEERING, 2023, 28 (04)