Contrastive learning of protein representations with graph neural networks for structural and functional annotations

被引:0
|
作者
Luo, Jiaqi [1 ]
Luo, Yunan [2 ]
机构
[1] Tsinghua Univ, Inst Interdisciplinary Informat Sci, Beijing, Peoples R China
[2] Georgia Inst Technol, Sch Computat Sci & Engn, Atlanta, GA 30332 USA
关键词
Protein annotation; Protein structure and function; Deep learning; Graph neural network; Contrastive learning; Representation learning; SEQUENCE;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Although protein sequence data is growing at an ever-increasing rate, the protein universe is still sparsely annotated with functional and structural annotations. Computational approaches have become efficient solutions to infer annotations for unlabeled proteins by transferring knowledge from proteins with experimental annotations. Despite the increasing availability of protein structure data and the high coverage of high-quality predicted structures, e.g., by AlphaFold, many existing computational tools still only rely on sequence data to predict structural or functional annotations, including alignment algorithms such as BLAST and several sequence-based deep learning models. Here, we develop PenLight, a general deep learning framework for protein structural and functional annotations. PenLight uses a graph neural network (GNN) to integrate 3D protein structure data and protein language model representations. In addition, PenLight applies a contrastive learning strategy to train the GNN for learning protein representations that reflect similarities beyond sequence identity, such as semantic similarities in the function or structure space. We bench-marked PenLight on a structural classification task and a functional annotation task, where PenLight achieved higher prediction accuracy and coverage than state-of-the-art methods.
引用
收藏
页码:109 / 120
页数:12
相关论文
共 50 条
  • [21] Entropy Neural Estimation for Graph Contrastive Learning
    Ma, Yixuan
    Zhang, Xiaolin
    Zhang, Peng
    Zhan, Kun
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 435 - 443
  • [22] Neural Graph Similarity Computation with Contrastive Learning
    Hu, Shengze
    Zeng, Weixin
    Zhang, Pengfei
    Tang, Jiuyang
    APPLIED SCIENCES-BASEL, 2022, 12 (15):
  • [23] Learning Invariant Representations of Graph Neural Networks via Cluster Generalization
    Xia, Donglin
    Wang, Xiao
    Liu, Nian
    Shi, Chuan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [24] Representations of Graph States with Neural Networks
    Ying Yang
    Acta Mathematica Sinica, English Series, 2023, 39 : 685 - 694
  • [25] Graph Convolutional Neural Networks for Learning Attribute Representations for Word Spotting
    Wolf, Fabian
    Fischer, Andreas
    Fink, Gernot A.
    DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT I, 2021, 12821 : 50 - 64
  • [26] Representations of Graph States with Neural Networks
    Yang, Ying
    ACTA MATHEMATICA SINICA-ENGLISH SERIES, 2023, 39 (04) : 685 - 694
  • [27] Representations of Graph States with Neural Networks
    Ying YANG
    ActaMathematicaSinica,EnglishSeries, 2023, (04) : 685 - 694
  • [28] Uncovering the Structural Fairness in Graph Contrastive Learning
    Wang, Ruijia
    Wang, Xiao
    Shi, Chuan
    Song, Le
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [29] Probing Negative Sampling for Contrastive Learning to Learn Graph Representations
    Chen, Shiyi
    Wang, Ziao
    Zhang, Xinni
    Zhang, Xiaofeng
    Peng, Dan
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT II, 2021, 12976 : 434 - 449
  • [30] Contrastive Learning with Generated Representations for Inductive Knowledge Graph Embedding
    Li, Qian
    Joty, Shafiq
    Wang, Daling
    Feng, Shi
    Zhang, Yifei
    Qin, Chengwei
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 14273 - 14287