Prediction of protein-protein interactions based on deep neural networks

被引:0
|
作者
Liu G.-X. [1 ,2 ]
Wang M.-Y. [1 ,2 ]
Su L.-T. [1 ,2 ]
Wu C.-G. [1 ,2 ]
Sun L.-Y. [1 ,2 ]
Wang R.-Q. [1 ,2 ]
机构
[1] College of Computer Science and Technology, Jilin University, Changchun
[2] Symbol Computation and Knowledge Engineering of Ministry Education, Jilin University, Changchun
关键词
Artificial intelligence; Deep neural network; Protein features; Protein sequence; Protein-protein interaction;
D O I
10.13229/j.cnki.jdxbgxb20171101
中图分类号
学科分类号
摘要
In order to deal with the high false-positive to false-negative rate in experimental methods, a Deep Neural Network (DNN) is constructed based on several biology features. Protein features, including GO term semantic similarity, sequence similarity, essentiality and subcellular localization information, are integrated from diverse databases to form a fixed-length eigenvector. This vector contains a great deal of related information and can be used as the input of a classifier to predict protein interactions. Then the DNN which is data driven is constructed. It is used to automatically learn information from the input data and predict whether the unknown protein pairs interact or not. Dropout is used during the training phase to prevent co-adaption and improve its performance. The method achieves a prediction accuracy of 95.67% with 96.38% precision on the S. cerevisae dataset. Experimental results show that the extracted features are suitable for the prediction of PPIs, and many commonly used machine learning models can predict interaction effectively and efficiently based on this eigenvector. Moreover the DNN has good generalization capacity and shows high performance on various feature data. © 2019, Jilin University Press. All right reserved.
引用
收藏
页码:570 / 577
页数:7
相关论文
共 23 条
  • [1] Ho Y., Gruhler A., Heilbut A., Et al., Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry, Nature, 415, 6868, pp. 180-183, (2002)
  • [2] Zhu H., Bilgin M., Bangham R., Et al., Global analysis of protein activities using proteome chips, Science, 293, 5537, pp. 2101-2105, (2001)
  • [3] Ito T., Chiba T., Ozawa R., Et al., A comprehensive two-hybrid analysis to explore the yeast protein interactome, Proceedings of the National Academy of Sciences of the United States of America, 98, 8, pp. 4569-4574, (2001)
  • [4] Huang Y.A., You Z.H., Gao X., Et al., Using weighted sparse representation model combined with discrete cosine transformation to predict protein-protein interactions from protein sequence, Biomed Research International, 2015, pp. 1-10, (2015)
  • [5] You Z.H., Zhu L., Zheng C.H., Et al., Prediction of protein-protein interactions from amino acid sequences using a novel multi-scale continuous and discontinuous feature set, BMC Bioinformatics, 15, pp. 1-9, (2014)
  • [6] Wong L., You Z.H., Ming Z., Et al., Detection of interactions between proteins through rotation forest and local phase quantization descriptors, International Journal of Molecular Sciences, 17, 1, pp. 21-31, (2015)
  • [7] Huang Y.A., You Z.H., Xing C., Et al., Sequence-based prediction of protein-protein interactions using weighted sparse representation model combined with global encoding, BMC Bioinformatics, 17, 1, pp. 184-194, (2016)
  • [8] Chatterjee P., Basu S., Kundu M., Et al., PPI_SVM: prediction of protein-protein interactions using machine learning, domain-domain affinities and frequency tables, Cellular & Molecular Biology Letters, 16, 2, pp. 264-278, (2011)
  • [9] Saha I., Zubek J., Klingstrom T., Et al., Ensemble learning prediction of protein-protein interactions using proteins functional annotations, Molecular Biosystems, 10, 4, pp. 820-830, (2014)
  • [10] Xenarios I., Salwinski L., Duan X.J., Et al., DIP, the database of interacting proteins: a research tool for studying cellular networks of protein interactions, Nucleic Acids Research, 30, 1, pp. 303-305, (2002)