TUnA: an uncertainty-aware transformer model for sequence-based protein-protein interaction prediction

被引:1
|
作者
Ko, Young Su [1 ]
Parkinson, Jonathan [1 ]
Liu, Cong [1 ]
Wang, Wei [1 ,2 ]
机构
[1] Univ Calif San Diego, Dept Chem & Biochem, La Jolla, CA 92093 USA
[2] Univ Calif San Diego, Dept Cellular & Mol Med, La Jolla, CA 92093 USA
基金
美国国家卫生研究院;
关键词
protein-protein interaction prediction; deep learning; uncertainty awareness;
D O I
10.1093/bib/bbae359
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Protein-protein interactions (PPIs) are important for many biological processes, but predicting them from sequence data remains challenging. Existing deep learning models often cannot generalize to proteins not present in the training set and do not provide uncertainty estimates for their predictions. To address these limitations, we present TUnA, a Transformer-based uncertainty-aware model for PPI prediction. TUnA uses ESM-2 embeddings with Transformer encoders and incorporates a Spectral-normalized Neural Gaussian Process. TUnA achieves state-of-the-art performance and, importantly, evaluates uncertainty for unseen sequences. We demonstrate that TUnA's uncertainty estimates can effectively identify the most reliable predictions, significantly reducing false positives. This capability is crucial in bridging the gap between computational predictions and experimental validation.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] MULTIMODAL PRE-TRAINING MODEL FOR SEQUENCE-BASED PREDICTION OF PROTEIN-PROTEIN INTERACTION
    Xue, Yang
    Liu, Zijing
    Fang, Xiaomin
    Wang, Fan
    MACHINE LEARNING IN COMPUTATIONAL BIOLOGY, VOL 165, 2021, 165 : 34 - 46
  • [2] Evolution of Sequence-based Bioinformatics Tools for Protein-protein Interaction Prediction
    Khatun, Mst Shamima
    Shoombuatong, Watshara
    Hasan, Md Mehedi
    Kurata, Hiroyuki
    CURRENT GENOMICS, 2020, 21 (06) : 454 - 463
  • [3] Cracking the black box of deep sequence-based protein-protein interaction prediction
    Bernett, Judith
    Blumenthal, David B.
    List, Markus
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (02)
  • [4] Sequence-based protein-protein interaction prediction via support vector machine
    Yongcui Wang
    Jiguang Wang
    Zhixia Yang
    Naiyang Deng
    Journal of Systems Science and Complexity, 2010, 23 : 1012 - 1023
  • [5] DeNovo: virus-host sequence-based protein-protein interaction prediction
    Eid, Fatma-Elzahraa
    ElHefnawi, Mahmoud
    Heath, Lenwood S.
    BIOINFORMATICS, 2016, 32 (08) : 1144 - 1150
  • [6] Sequence-based protein-protein interaction prediction via support vector machine
    Wang, Yongcui
    Wang, Jiguang
    Yang, Zhixia
    Deng, Naiyang
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2010, 23 (05) : 1012 - 1023
  • [7] Sequence-based protein-protein interaction prediction optimized for target selection in biological experiments
    Ye, Jiankuan
    Kulikowski, Casimir
    Muchnik, Ilya
    2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 236 - 239
  • [8] Recent developments of sequence-based prediction of protein-protein interactions
    Murakami, Yoichi
    Mizuguchi, Kenji
    BIOPHYSICAL REVIEWS, 2022, 14 (06) : 1393 - 1411
  • [9] Sequence-based prediction of protein-protein interaction sites with L1-logreg classifier
    Dhole, Kaustubh
    Singh, Gurdeep
    Pai, Priyadarshini P.
    Mondal, Sukanta
    JOURNAL OF THEORETICAL BIOLOGY, 2014, 348 : 47 - 54
  • [10] Sequence-based prediction of protein-protein interactions by means of codon usage
    Najafabadi, Hamed Shateri
    Salavati, Reza
    GENOME BIOLOGY, 2008, 9 (05)