Protein-protein interaction and site prediction using transfer learning

被引:9
|
作者
Liu, Tuoyu [1 ]
Gao, Han [2 ]
Ren, Xiaopu [1 ]
Xu, Guoshun [2 ]
Liu, Bo [1 ]
Wu, Ningfeng [1 ]
Luo, Huiying [2 ]
Wang, Yuan [2 ]
Tu, Tao [2 ]
Yao, Bin [2 ]
Guan, Feifei
Teng, Yue [3 ]
Huang, Huoqing [2 ]
Tian, Jian [2 ]
机构
[1] Chinese Acad Agr Sci, Biotechnol Res Inst, Beijing, Peoples R China
[2] Chinese Acad Agr Sci, Inst Anim Sci, Beijing, Peoples R China
[3] Acad Mil Med Sci, Beijing Inst Microbiol & Epidemiol, State Key Lab Pathogen & Biosecur, Beijing 100071, Peoples R China
基金
中国国家自然科学基金;
关键词
protein-protein interaction; transformer; PPI site; transfer learning; BERT; FINGERPRINTS; DATABASE; UNIREF; SYSTEM;
D O I
10.1093/bib/bbad376
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The advanced language models have enabled us to recognize protein-protein interactions (PPIs) and interaction sites using protein sequences or structures. Here, we trained the MindSpore ProteinBERT (MP-BERT) model, a Bidirectional Encoder Representation from Transformers, using protein pairs as inputs, making it suitable for identifying PPIs and their respective interaction sites. The pretrained model (MP-BERT) was fine-tuned as MPB-PPI (MP-BERT on PPI) and demonstrated its superiority over the state-of-the-art models on diverse benchmark datasets for predicting PPIs. Moreover, the model's capability to recognize PPIs among various organisms was evaluated on multiple organisms. An amalgamated organism model was designed, exhibiting a high level of generalization across the majority of organisms and attaining an accuracy of 92.65%. The model was also customized to predict interaction site propensity by fine-tuning it with PPI site data as MPB-PPISP. Our method facilitates the prediction of both PPIs and their interaction sites, thereby illustrating the potency of transfer learning in dealing with the protein pair task.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Algorithmic approaches to protein-protein interaction site prediction
    Tristan T Aumentado-Armstrong
    Bogdan Istrate
    Robert A Murgita
    Algorithms for Molecular Biology, 10
  • [2] Algorithmic approaches to protein-protein interaction site prediction
    Aumentado-Armstrong, Tristan T.
    Istrate, Bogdan
    Murgita, Robert A.
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2015, 10
  • [3] EGRET: edge aggregated graph attention networks and transfer learning improve protein-protein interaction site prediction
    Mahbub, Sazan
    Bayzid, Md Shamsuzzoha
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (02)
  • [4] Protein-protein interaction site prediction using random forest proximity distance
    Qiu, Zhijun
    Liu, Qingjie
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2021, 19 (01)
  • [5] Active learning for human protein-protein interaction prediction
    Mohamed, Thahir P.
    Carbonell, Jaime G.
    Ganapathiraju, Madhavi K.
    BMC BIOINFORMATICS, 2010, 11
  • [6] Revolutionizing protein-protein interaction prediction with deep learning
    Zhang, Jing
    Durham, Jesse
    Cong, Qian
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2024, 85
  • [7] Active learning for human protein-protein interaction prediction
    Thahir P Mohamed
    Jaime G Carbonell
    Madhavi K Ganapathiraju
    BMC Bioinformatics, 11
  • [8] Deep learning frameworks for protein-protein interaction prediction
    Hu, Xiaotian
    Feng, Cong
    Ling, Tianyi
    Chen, Ming
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2022, 20 : 3223 - 3233
  • [9] Classification and prediction of protein-protein interaction interface using machine learning algorithm
    Das, Subhrangshu
    Chakrabarti, Saikat
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [10] Improving protein-protein interaction site prediction using deep residual neural network
    Hu, Jun
    Dong, Ming
    Tang, Yu-Xuan
    Zhang, Gui-Jun
    ANALYTICAL BIOCHEMISTRY, 2023, 670