A novel conjoint triad auto covariance (CTAC) coding method for predicting protein-protein interaction based on amino acid sequence

被引:9
|
作者
Wang Xue [1 ,2 ,3 ]
Wang Rujing [2 ]
Wei Yuanyuan [2 ]
Gui Yuanmiao [2 ,3 ]
机构
[1] Chinese Acad Sci, Inst Tech Biol & Agr Engn, Hefei 230031, Anhui, Peoples R China
[2] Chinese Acad Sci, Inst Intelligent Machine, Hefei Inst Phys Sci, Hefei 230031, Anhui, Peoples R China
[3] Univ Sci & Technol China, Hefei 230026, Anhui, Peoples R China
关键词
Deep neural networks; Protein-protein interaction; Conjoint triad auto covariance; DEEP NEURAL-NETWORKS; FOREST;
D O I
10.1016/j.mbs.2019.04.002
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Protein-protein interactions (PPIs) play a crucial role in the life-sustaining activities of organisms. Although various methods for the prediction of PPIs have been developed in the past decades, their robustness and prediction accuracy need to be improved. Therefore, it is necessary to develop an effective and accurate method to predict PPIs. Aiming at making sure that PPIs can be predicted effectively, in this paper, we propose a new sequence-based approach based on deep neural network (DNN) and conjoint triad auto covariance (CTAC) to improve the effectiveness of predicting PPIs. The coding method of CTAC combines the advantages of conjoint triad and auto covariance. Therefore, the CTAC can obtain more PPIs information from the amino acid sequence. The model of DNN-CTAC achieved an accuracy of 98.37%, recall of 99.41%, area under the curve (AUC) of 99.24% and loss of 22.7%, respectively, on human dataset. These results indicate that DNN-CTAC can enhance the predictive power of PPIs and can significantly enhance the accuracy of the prediction. And, it has proved to be a useful complement to future proteomics research. The source codes and all datasets are available at https://github.com/smalltalkman/hppi-tensorflow.
引用
收藏
页码:41 / 47
页数:7
相关论文
共 50 条
  • [21] Iteration method for predicting essential proteins based on orthology and protein-protein interaction networks
    Peng, Wei
    Wang, Jianxin
    Wang, Weiping
    Liu, Qing
    Wu, Fang-Xiang
    Pan, Yi
    BMC SYSTEMS BIOLOGY, 2012, 6
  • [22] A Deep Learning and XGBoost-Based Method for Predicting Protein-Protein Interaction Sites
    Wang, Pan
    Zhang, Guiyang
    Yu, Zu-Guo
    Huang, Guohua
    FRONTIERS IN GENETICS, 2021, 12
  • [23] Sequence-based machine learning method for predicting the effects of phosphorylation on protein-protein interactions
    Hong, Xiaokun
    Lv, Jiyang
    Li, Zhengxin
    Xiong, Yi
    Zhang, Jian
    Chen, Hai-Feng
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2023, 243
  • [24] Clustered complementary amino acid pairing (CCAAP) for protein-protein interaction
    Baek, Christina Kyung Eun
    Baek, Chang-Ho
    BIOTECHNOLOGY LETTERS, 2019, 41 (01) : 79 - 90
  • [25] Geometric and amino acid type determinants for protein-protein interaction interfaces
    Yang, Yongxiao
    Wang, Wei
    Lou, Yuan
    Yin, Jianxin
    Gong, Xinqi
    QUANTITATIVE BIOLOGY, 2018, 6 (02) : 163 - 174
  • [26] Geometric and amino acid type determinants for protein-protein interaction interfaces
    Yongxiao Yang
    Wei Wang
    Yuan Lou
    Jianxin Yin
    Xinqi Gong
    Quantitative Biology, 2018, 6 (02) : 163 - 174
  • [27] An algorithm for predicting protein-protein interaction sites: Abnormally exposed amino acid residues and secondary structure elements
    Hoskins, J
    Lovell, S
    Blundell, TL
    PROTEIN SCIENCE, 2006, 15 (05) : 1017 - 1029
  • [28] PREDICTING PROTEIN SECONDARY STRUCTURE BASED ON AMINO-ACID-SEQUENCE
    NISHIKAWA, K
    NOGUCHI, T
    METHODS IN ENZYMOLOGY, 1991, 202 : 31 - 44
  • [29] A novel intra-molecular protein-protein interaction code based on partial complementary coding of co-locating amino acids
    Biro, JC
    MEDICAL HYPOTHESES, 2006, 66 (01) : 137 - 142
  • [30] Prediction of potential protein-protein interaction sites from amino acid sequence - Identification of a fibrin polymerization site
    Kini, RM
    Evans, HJ
    FEBS LETTERS, 1996, 385 (1-2) : 81 - 86