Conjoint Feature Representation of GO and Protein Sequence for PPI Prediction Based on an Inception RNN Attention Network

被引:19
|
作者
Zhao, Lingling [1 ]
Wang, Junjie [1 ]
Hu, Yang [2 ]
Cheng, Liang [3 ,4 ]
机构
[1] Harbin Inst Technol, Fac Comp, Harbin 150001, Peoples R China
[2] Harbin Inst Technol, Sch Life Sci & Technol, Dept Comp Sci, Harbin 150001, Peoples R China
[3] Harbin Med Univ, NHC & CAMS Key Lab Mol Probe & Targeted Theranost, Harbin 150028, Heilongjiang, Peoples R China
[4] Harbin Med Univ, Coll Bioinformat Sci & Technol, Harbin 150081, Heilongjiang, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
NEURAL-NETWORK;
D O I
10.1016/j.omtn.2020.08.025
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
Protein-protein interactions (PPIs) are pivotal for cellular functions and biological processes. In the past years, computational methods using amino acid sequences and gene ontology (GO) annotations of proteins for prioritizing PPIs have provided important references for biological experiments in the wet lab. Despite the current success, sequence information and ontological annotation in semantic representation have not been integrated into current methods. We propose a deep-learning-based PPI prediction methodology conjointly featuring sequence information and GO annotation. First, we adopt a word-embedding tool, the NCBI-blueBERT model pre-trained on PubMed, to map the GO terms into their semantic vectors. Then, the GO semantic vectors and protein sequence vector serve as the input of the proposed inception recurrent neural network (RNN) attention network (IRAN). The IRAN captures the spatial relationship and the potential sequential feature of the protein sequence and ontological annotation semantics. The extensive experimental results on 12 benchmarks demonstrate that our method achieves superiority over state-of-the-art baselines. In the yeast dataset of a binary PPI prediction, our method improved the performance with the Matthews correlation coefficient increasing from 94.2% to 98.2% and the accuracy from 97.1% to 98.2%. The analogous results were also obtained in other comparison evaluations.
引用
收藏
页码:198 / 208
页数:11
相关论文
共 50 条
  • [41] Multi-feature Urban Traffic Prediction Based on Unconstrained Graph Attention Network
    He, Hangtao
    Ye, Kejiang
    Xu, Cheng-Zhong
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 1409 - 1417
  • [42] SAINT-Angle: self-attention augmented inception-inside-inception network and transfer learning improve protein backbone torsion angle prediction
    Hasan, A. K. M. Mehedi
    Ahmed, Ajmain Yasar
    Mahbub, Sazan
    Rahman, M. Saifur
    Bayzid, Md Shamsuzzoha
    BIOINFORMATICS ADVANCES, 2023, 3 (01):
  • [43] DeepCF-PPI: improved prediction of protein-protein interactions by combining learned and handcrafted features based on attention mechanisms
    Hoai-Nhan Tran
    Quynh Nguyen Phuc Xuan
    Tuong-Tri Nguyen
    Applied Intelligence, 2023, 53 : 17887 - 17902
  • [44] DeepCF-PPI: improved prediction of protein-protein interactions by combining learned and handcrafted features based on attention mechanisms
    Tran, Hoai-Nhan
    Xuan, Quynh Nguyen Phuc
    Nguyen, Tuong-Tri
    APPLIED INTELLIGENCE, 2023, 53 (14) : 17887 - 17902
  • [45] A Sub-Sequence Based Approach to Protein Function Prediction via Multi-Attention Based Multi-Aspect Network
    Ranjan, Ashish
    Tiwari, Archana
    Deepak, Akshay
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (01) : 94 - 105
  • [47] SEGT-GO: a graph transformer method based on PPI serialization and explanatory artificial intelligence for protein function prediction
    Wang, Yansong
    Sun, Yundong
    Lin, Baohui
    Zhang, Haotian
    Luo, Xiaoling
    Liu, Yumeng
    Jin, Xiaopeng
    Zhu, Dongjie
    BMC BIOINFORMATICS, 2025, 26 (01):
  • [48] Prediction of protein solubility based on sequence physicochemical patterns and distributed representation information with DeepSoluE
    Chao Wang
    Quan Zou
    BMC Biology, 21
  • [49] Prediction of protein solubility based on sequence physicochemical patterns and distributed representation information with DeepSoluE
    Wang, Chao
    Zou, Quan
    BMC BIOLOGY, 2023, 21 (01)
  • [50] Discriminative Feature Representation Based on Cascaded Attention Network with Adversarial Joint Loss for Speech Emotion Recognition
    Liu, Yang
    Sun, Haoqin
    Guan, Wenbo
    Xia, Yuqi
    Zhao, Zhen
    INTERSPEECH 2022, 2022, : 4750 - 4754