GPTrans: A Biological Language Model-Based Approach for Predicting Disease-Associated Mutations in G Protein-Coupled Receptors

被引:0
|
作者
Wang, Xiaohua [1 ]
Zhang, Ming [1 ]
Yang, Xibei [1 ]
Yu, Dong-Jun [2 ]
Ge, Fang [3 ,4 ]
机构
[1] Jiangsu Univ Sci & Technol, Sch Comp, Zhenjiang 212100, Jiangsu, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[3] Nanjing Univ Posts & Telecommun, State Key Lab Organ Elect & Informat Displays, Nanjing 210023, Peoples R China
[4] Nanjing Univ Posts & Telecommun, Inst Adv Mat IAM, Nanjing 210023, Peoples R China
基金
中国国家自然科学基金;
关键词
AMINO-ACID SUBSTITUTIONS; DRUG DISCOVERY; VARIANTS; SERVER;
D O I
10.1021/acs.jcim.4c01999
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Accurately predicting mutations in G protein-coupled receptors (GPCRs) is critical for advancing disease diagnosis and drug discovery. In response to this imperative, GPTrans has emerged as a highly accurate predictor of disease-related mutations in GPCRs. The core innovation of GPTrans resides in the design of a novel feature extraction network, that is capable of integrating features from both wildtype and mutant protein variant sites, utilizing multifeature connections within a transformer framework to ensure comprehensive feature extraction. A key aspect of GPTrans's effectiveness is our introduction of an innovative deep feature integration strategy, which merges embeddings and class tokens from multiple protein language models, including evolutionary scale modeling and ProtTrans, thus shedding light on the biochemical properties of proteins. Leveraging transformer components and a self-attention mechanism, GPTrans captures higher-level representations of protein features. Employing both wildtype and mutation site information for feature fusion not only enriches the predictive feature set but also avoids the common issue of overestimation associated with sequence-based predictions. This approach distinguishes GPTrans, enabling it to significantly outperform existing methods. Our evaluations across diverse GPCR data sets, including ClinVar and MutHTP, demonstrate GPTrans's superior performance, with average AUC values of 0.874 and 0.590 in 10-fold cross-validation. Notably, compared to the AlphaMissense method, GPTrans exhibited a remarkable 38.03% improvement in accuracy when predicting disease-associated mutations in the MutHTP data set. A thorough analysis of the predicted results further validates the model's effectiveness. The source code, data sets, and prediction results for GPTrans are available for academic use at https://github.com/EduardWang/GPTrans.
引用
收藏
页码:9626 / 9642
页数:17
相关论文
共 50 条
  • [41] Conformational Ensemble View of G Protein-Coupled Receptors and the Effect of Mutations and Ligand Binding
    Abrol, Ravinder
    Kim, Soo-Kyung
    Bray, Jenelle K.
    Trzaskowski, Bartosz
    Goddard, William A., III
    G PROTEIN COUPLED RECEPTORS: STRUCTURE, 2013, 520 : 31 - 48
  • [42] Pan-cancer functional analysis of somatic mutations in G protein-coupled receptors
    B. J. Bongers
    M. Gorostiola González
    X. Wang
    H. W. T. van Vlijmen
    W. Jespers
    H. Gutiérrez-de-Terán
    K. Ye
    A. P. IJzerman
    L. H. Heitman
    G. J. P. van Westen
    Scientific Reports, 12
  • [43] Pan-cancer functional analysis of somatic mutations in G protein-coupled receptors
    Bongers, B. J.
    Gonzalez, M. Gorostiola
    Wang, X.
    van Vlijmen, H. W. T.
    Jespers, W.
    Gutierrez-de-Teran, H.
    Ye, K.
    IJzerman, A. P.
    Heitman, L. H.
    van Westen, G. J. P.
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [44] Progress in Structure Based Drug Design for G Protein-Coupled Receptors
    Congreve, Miles
    Langmead, Christopher J.
    Mason, Jonathan S.
    Marshall, Fiona H.
    JOURNAL OF MEDICINAL CHEMISTRY, 2011, 54 (13) : 4283 - 4311
  • [45] G Protein-Coupled Receptors: Target-Based In Silico Screening
    Senderowitz, Hanoch
    Marantz, Yael
    CURRENT PHARMACEUTICAL DESIGN, 2009, 15 (35) : 4049 - 4068
  • [46] Which preconditioning-associated G protein-coupled receptors are expressed on the sarcolemma?
    Xin, Wenkuan
    Cohen, Michael V.
    Rich, Thomas C.
    Downey, James M.
    FASEB JOURNAL, 2009, 23
  • [47] Towards understanding the structure and function of g protein-coupled receptors: a multidisciplinary approach
    Shukla, A.
    Reinhart, C.
    Michel, H.
    FEBS JOURNAL, 2006, 273 : 89 - 89
  • [48] Drug Repurposing on G Protein-Coupled Receptors Using a Computational Profiling Approach
    de Felice, Alessandra
    Aureli, Simone
    Limongelli, Vittorio
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2021, 8
  • [49] Functionalized Congener Approach to the Design of Ligands for G Protein-Coupled Receptors (GPCRs)
    Jacobson, Kenneth A.
    BIOCONJUGATE CHEMISTRY, 2009, 20 (10) : 1816 - 1835
  • [50] Orphan G protein-coupled receptors (GPCRs): biological functions and potential drug targets
    Xiao-long Tang
    Ying Wang
    Da-li Li
    Jian Luo
    Ming-yao Liu
    Acta Pharmacologica Sinica, 2012, 33 : 363 - 371