TUnA: an uncertainty-aware transformer model for sequence-based protein-protein interaction prediction

被引:1
|
作者
Ko, Young Su [1 ]
Parkinson, Jonathan [1 ]
Liu, Cong [1 ]
Wang, Wei [1 ,2 ]
机构
[1] Univ Calif San Diego, Dept Chem & Biochem, La Jolla, CA 92093 USA
[2] Univ Calif San Diego, Dept Cellular & Mol Med, La Jolla, CA 92093 USA
基金
美国国家卫生研究院;
关键词
protein-protein interaction prediction; deep learning; uncertainty awareness;
D O I
10.1093/bib/bbae359
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Protein-protein interactions (PPIs) are important for many biological processes, but predicting them from sequence data remains challenging. Existing deep learning models often cannot generalize to proteins not present in the training set and do not provide uncertainty estimates for their predictions. To address these limitations, we present TUnA, a Transformer-based uncertainty-aware model for PPI prediction. TUnA uses ESM-2 embeddings with Transformer encoders and incorporates a Spectral-normalized Neural Gaussian Process. TUnA achieves state-of-the-art performance and, importantly, evaluates uncertainty for unseen sequences. We demonstrate that TUnA's uncertainty estimates can effectively identify the most reliable predictions, significantly reducing false positives. This capability is crucial in bridging the gap between computational predictions and experimental validation.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Sequence-Based Prediction of Fuzzy Protein Interactions
    Miskei, Marton
    Horvath, Attila
    Vendruscolo, Michele
    Fuxreiter, Monika
    JOURNAL OF MOLECULAR BIOLOGY, 2020, 432 (07) : 2289 - 2303
  • [42] Prediction of membrane protein types by fusing protein-protein interaction and protein sequence information
    Zhang, Xiaolin
    Chen, Lei
    BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS, 2020, 1868 (12):
  • [43] Sequence-based Prediction of Protein-Protein Interactions Using Gray Wolf Optimizer-Based Relevance Vector Machine
    An, Ji-Yong
    You, Zhu-Hong
    Zhou, Yong
    Wang, Da-Fu
    EVOLUTIONARY BIOINFORMATICS, 2019, 15
  • [44] In Silico Protein-Protein Interaction Prediction with Sequence Alignment and Classifier Stacking
    Marini, Simone
    Xu, Qian
    Yang, Qiang
    CURRENT PROTEIN & PEPTIDE SCIENCE, 2011, 12 (07) : 614 - 620
  • [45] Evolutionary profiles improve protein-protein interaction prediction from sequence
    Hamp, Tobias
    Rost, Burkhard
    BIOINFORMATICS, 2015, 31 (12) : 1945 - 1950
  • [46] Protein-Protein Interaction Prediction Model Based on ProtBert-BiGRU-Attention
    Gao, Qian
    Zhang, Chi
    Li, Ming
    Yu, Tianfei
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2024, 31 (09) : 797 - 814
  • [47] Human disease-gene classification with integrative sequence-based and topological features of protein-protein interaction networks
    Smalter, Aaron
    Lei, Seak Fei
    Chen, Xue-wen
    2007 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, PROCEEDINGS, 2007, : 209 - 214
  • [48] Seq-BEL: Sequence-Based Ensemble Learning for Predicting Virus-Human Protein-Protein Interaction
    Ma, Yingjun
    He, Tingting
    Tan, Yuting
    Jiang, Xingpeng
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (03) : 1322 - 1333
  • [49] Prediction of Protein-Protein Interactions Using Vision Transformer and Language Model
    Jha, Kanchan
    Saha, Sriparna
    Karmakar, Sourav
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (05) : 3215 - 3225
  • [50] Combining sequence and network information to enhance protein-protein interaction prediction
    Liu, Leilei
    Zhu, Xianglei
    Ma, Yi
    Piao, Haiyin
    Yang, Yaodong
    Hao, Xiaotian
    Fu, Yue
    Wang, Li
    Peng, Jiajie
    BMC BIOINFORMATICS, 2020, 21 (Suppl 16)