Conjoint Feature Representation of GO and Protein Sequence for PPI Prediction Based on an Inception RNN Attention Network

被引:19
|
作者
Zhao, Lingling [1 ]
Wang, Junjie [1 ]
Hu, Yang [2 ]
Cheng, Liang [3 ,4 ]
机构
[1] Harbin Inst Technol, Fac Comp, Harbin 150001, Peoples R China
[2] Harbin Inst Technol, Sch Life Sci & Technol, Dept Comp Sci, Harbin 150001, Peoples R China
[3] Harbin Med Univ, NHC & CAMS Key Lab Mol Probe & Targeted Theranost, Harbin 150028, Heilongjiang, Peoples R China
[4] Harbin Med Univ, Coll Bioinformat Sci & Technol, Harbin 150081, Heilongjiang, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
NEURAL-NETWORK;
D O I
10.1016/j.omtn.2020.08.025
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
Protein-protein interactions (PPIs) are pivotal for cellular functions and biological processes. In the past years, computational methods using amino acid sequences and gene ontology (GO) annotations of proteins for prioritizing PPIs have provided important references for biological experiments in the wet lab. Despite the current success, sequence information and ontological annotation in semantic representation have not been integrated into current methods. We propose a deep-learning-based PPI prediction methodology conjointly featuring sequence information and GO annotation. First, we adopt a word-embedding tool, the NCBI-blueBERT model pre-trained on PubMed, to map the GO terms into their semantic vectors. Then, the GO semantic vectors and protein sequence vector serve as the input of the proposed inception recurrent neural network (RNN) attention network (IRAN). The IRAN captures the spatial relationship and the potential sequential feature of the protein sequence and ontological annotation semantics. The extensive experimental results on 12 benchmarks demonstrate that our method achieves superiority over state-of-the-art baselines. In the yeast dataset of a binary PPI prediction, our method improved the performance with the Matthews correlation coefficient increasing from 94.2% to 98.2% and the accuracy from 97.1% to 98.2%. The analogous results were also obtained in other comparison evaluations.
引用
收藏
页码:198 / 208
页数:11
相关论文
共 50 条
  • [31] Sequence Generation Network Based on Hierarchical Attention for Multi-Charge Prediction
    Zhu, Kongfan
    Ma, Baosen
    Huang, Tianhuan
    Li, Zeqiang
    Ma, Haoyang
    Li, Yujun
    IEEE ACCESS, 2020, 8 : 109315 - 109324
  • [32] Sequence representation approaches for sequence-based protein prediction tasks that use deep learning
    Cui, Feifei
    Zhang, Zilong
    Zou, Quan
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2021, 20 (01) : 61 - 73
  • [33] PPI-Detect: A Support Vector Machine Model for Sequence-Based Prediction of Protein-Protein Interactions
    Romero-Molina, Sandra
    Ruiz-Blanco, Yasser B.
    Harms, Mirja
    Muench, Jan
    Sanchez-Garcia, Elsa
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2019, 40 (11) : 1233 - 1242
  • [34] DSSGNN-PPI: A Protein–Protein Interactions prediction model based on Double Structure and Sequence graph neural networks
    Zhang F.
    Chang S.
    Wang B.
    Zhang X.
    Computers in Biology and Medicine, 2024, 177
  • [35] ATGPred-FL: sequence-based prediction of autophagy proteins with feature representation learning
    Jiao, Shihu
    Chen, Zheng
    Zhang, Lichao
    Zhou, Xun
    Shi, Lei
    AMINO ACIDS, 2022, 54 (05) : 799 - 809
  • [36] ATGPred-FL: sequence-based prediction of autophagy proteins with feature representation learning
    Shihu Jiao
    Zheng Chen
    Lichao Zhang
    Xun Zhou
    Lei Shi
    Amino Acids, 2022, 54 : 799 - 809
  • [37] Sequence-Based Prediction with Feature Representation Learning and Biological Function Analysis of Channel Proteins
    Chen, Zheng
    Jiao, Shihu
    Zhao, Da
    Hesham, Abd El-Latif
    Zou, Quan
    Xu, Lei
    Sun, Mingai
    Zhang, Lijun
    FRONTIERS IN BIOSCIENCE-LANDMARK, 2022, 27 (06):
  • [38] Prediction of biomarker-disease associations based on graph attention network and text representation
    Yang, Minghao
    Huang, Zhi-An
    Gu, Wenhao
    Han, Kun
    Pan, Wenying
    Yang, Xiao
    Zhu, Zexuan
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (05)
  • [39] Decision Tree Based Approaches for Detecting Protein Complex in Protein Protein Interaction Network (PPI) via Link and Sequence Analysis
    Sikandar, Aisha
    Anwar, Waqas
    Bajwa, Usama Ijaz
    Wang, Xuan
    Sikandar, Misba
    Yao, Lin
    Jiang, Zoe L.
    Zhang Chunkai
    IEEE ACCESS, 2018, 6 : 22108 - 22120
  • [40] MFAGCN: Multi-Feature Based Attention Graph Convolutional Network for Traffic Prediction
    Li, Haoran
    Li, Jianbo
    Lv, Zhiqiang
    Xu, Zhihao
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2021, PT I, 2021, 12937 : 227 - 239