A novel conjoint triad auto covariance (CTAC) coding method for predicting protein-protein interaction based on amino acid sequence

被引:9
|
作者
Wang Xue [1 ,2 ,3 ]
Wang Rujing [2 ]
Wei Yuanyuan [2 ]
Gui Yuanmiao [2 ,3 ]
机构
[1] Chinese Acad Sci, Inst Tech Biol & Agr Engn, Hefei 230031, Anhui, Peoples R China
[2] Chinese Acad Sci, Inst Intelligent Machine, Hefei Inst Phys Sci, Hefei 230031, Anhui, Peoples R China
[3] Univ Sci & Technol China, Hefei 230026, Anhui, Peoples R China
关键词
Deep neural networks; Protein-protein interaction; Conjoint triad auto covariance; DEEP NEURAL-NETWORKS; FOREST;
D O I
10.1016/j.mbs.2019.04.002
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Protein-protein interactions (PPIs) play a crucial role in the life-sustaining activities of organisms. Although various methods for the prediction of PPIs have been developed in the past decades, their robustness and prediction accuracy need to be improved. Therefore, it is necessary to develop an effective and accurate method to predict PPIs. Aiming at making sure that PPIs can be predicted effectively, in this paper, we propose a new sequence-based approach based on deep neural network (DNN) and conjoint triad auto covariance (CTAC) to improve the effectiveness of predicting PPIs. The coding method of CTAC combines the advantages of conjoint triad and auto covariance. Therefore, the CTAC can obtain more PPIs information from the amino acid sequence. The model of DNN-CTAC achieved an accuracy of 98.37%, recall of 99.41%, area under the curve (AUC) of 99.24% and loss of 22.7%, respectively, on human dataset. These results indicate that DNN-CTAC can enhance the predictive power of PPIs and can significantly enhance the accuracy of the prediction. And, it has proved to be a useful complement to future proteomics research. The source codes and all datasets are available at https://github.com/smalltalkman/hppi-tensorflow.
引用
收藏
页码:41 / 47
页数:7
相关论文
共 50 条
  • [31] A novel method for predicting protein subcellular localization based on pseudo amino acid composition
    Ma, Junwei
    Gu, Hong
    BMB REPORTS, 2010, 43 (10) : 670 - 676
  • [32] Identification of Protein-Protein Interactions via a Novel Matrix-Based Sequence Representation Model with Amino Acid Contact Information
    Ding, Yijie
    Tang, Jijun
    Guo, Fei
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2016, 17 (10)
  • [33] Using the augmented Chou's pseudo amino acid composition for predicting protein submitochondria locations based on auto covariance approach
    Zeng, Yu-hong
    Guo, Yan-zhi
    Xiao, Rong-quan
    Yang, Li
    Yu, Le-zheng
    Li, Meng-long
    JOURNAL OF THEORETICAL BIOLOGY, 2009, 259 (02) : 366 - 372
  • [34] A Novel Feature Extraction Scheme with Ensemble Coding for Protein-Protein Interaction Prediction
    Du, Xiuquan
    Cheng, Jiaxing
    Zheng, Tingting
    Duan, Zheng
    Qian, Fulan
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2014, 15 (07) : 12731 - 12749
  • [35] Comment on 'protein-protein binding affinity prediction from amino acid sequence'
    Moal, Iain H.
    Fernandez-Recio, Juan
    BIOINFORMATICS, 2015, 31 (04) : 614 - 615
  • [36] Prediction of Protein-Protein Interactions Using Local Description of Amino Acid Sequence
    Zhou, Yu Zhen
    Gao, Yun
    Zheng, Ying Ying
    ADVANCES IN COMPUTER SCIENCE AND EDUCATION APPLICATIONS, PT II, 2011, 202 : 254 - +
  • [37] Human branched-chain amino acid metabolon: A novel protein-protein interaction in a supramolecular complex
    Islam, MM
    Wallin, R
    Wynn, R
    Mobley, J
    Chuang, D
    Hutson, SM
    FASEB JOURNAL, 2006, 20 (04): : A530 - A530
  • [38] A Method for Predicting Protein Complexes from Dynamic Weighted Protein-Protein Interaction Networks
    Liu, Lizhen
    Sun, Xiaowu
    Song, Wei
    Du, Chao
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2018, 25 (06) : 586 - 605
  • [39] GOR method for predicting protein secondary structure from amino acid sequence
    Garnier, J
    Gibrat, JF
    Robson, B
    COMPUTER METHODS FOR MACROMOLECULAR SEQUENCE ANALYSIS, 1996, 266 : 540 - 553
  • [40] A statistical method for predicting protein unfolding rates from amino acid sequence
    Gromiha, M. Michael
    Selvaraj, S.
    Thangakani, A. Mary
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (03) : 1503 - 1508