Classifying protein kinase conformations with machine learning

被引:2
|
作者
Reveguk, Ivan [1 ]
Simonson, Thomas [1 ]
机构
[1] Ecole Polytech, Lab Biol Struct Cellule, CNRS, UMR7654, Palaiseau, France
关键词
ATPase; data mining; structural biology; XGBoost; CRYSTAL-STRUCTURE; C-ABL; ACTIVATION; SELECTION; INHIBITION; BINDING; DOMAIN; MECHANISMS; TRANSITION; PLASTICITY;
D O I
10.1002/pro.4918
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Protein kinases are key actors of signaling networks and important drug targets. They cycle between active and inactive conformations, distinguished by a few elements within the catalytic domain. One is the activation loop, whose conserved DFG motif can occupy DFG-in, DFG-out, and some rarer conformations. Annotation and classification of the structural kinome are important, as different conformations can be targeted by different inhibitors and activators. Valuable resources exist; however, large-scale applications will benefit from increased automation and interpretability of structural annotation. Interpretable machine learning models are described for this purpose, based on ensembles of decision trees. To train them, a set of catalytic domain sequences and structures was collected, somewhat larger and more diverse than existing resources. The structures were clustered based on the DFG conformation and manually annotated. They were then used as training input. Two main models were constructed, which distinguished active/inactive and in/out/other DFG conformations. They considered initially 1692 structural variables, spanning the whole catalytic domain, then identified ("learned") a small subset that sufficed for accurate classification. The first model correctly labeled all but 3 of 3289 structures as active or inactive, while the second assigned the correct DFG label to all but 17 of 8826 structures. The most potent classifying variables were all related to well-known structural elements in or near the activation loop and their ranking gives insights into the conformational preferences. The models were used to automatically annotate 3850 kinase structures predicted recently with the Alphafold2 tool, showing that Alphafold2 reproduced the active/inactive but not the DFG-in proportions seen in the Protein Data Bank. We expect the models will be useful for understanding and engineering kinases.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Classifying kinase conformations using a machine learning approach
    McSkimming, Daniel Ian
    Rasheed, Khaled
    Kannan, Natarajan
    BMC BIOINFORMATICS, 2017, 18
  • [2] Classifying kinase conformations using a machine learning approach
    Daniel Ian McSkimming
    Khaled Rasheed
    Natarajan Kannan
    BMC Bioinformatics, 18
  • [3] Multitask Machine Learning for Classifying Highly and Weakly Potent Kinase Inhibitors
    Rodriguez-Perez, Raquel
    Bajorath, Juergen
    ACS OMEGA, 2019, 4 (02): : 4367 - 4375
  • [4] The ABC of protein kinase conformations
    Moebitz, Henrik
    BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS, 2015, 1854 (10): : 1555 - 1566
  • [5] Machine learning for classifying learning objects
    Ranganathan, Girish R.
    Biletskiy, Yevgen
    MacIsaac, Dawn
    2006 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-5, 2006, : 739 - +
  • [6] Classifying "kinase inhibitor-likeness" by using machine-learning methods
    Briem, H
    Günther, J
    CHEMBIOCHEM, 2005, 6 (03) : 558 - 566
  • [7] Applying Machine Learning Techniques for Classifying Cyclin-Dependent Kinase Inhibitors
    Abdelbaky, Ibrahim Z.
    Al-Sadek, Ahmed F.
    Badr, Amr A.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (11) : 229 - 235
  • [8] Redefining the protein kinase conformational space with machine learning
    Rahman, Rayees
    Ung, Peter
    Schlessinger, Avner
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 257
  • [9] Redefining the Protein Kinase Conformational Space with Machine Learning
    Ung, Peter Man-Un
    Rahman, Rayees
    Schlessinger, Avner
    CELL CHEMICAL BIOLOGY, 2018, 25 (07): : 916 - +
  • [10] Redefining the Protein Kinase Conformational Space with Machine Learning
    Ung, Peter Man-Un
    Rahman, Rayees
    Schlessinger, Avner
    BIOPHYSICAL JOURNAL, 2019, 116 (03) : 58A - 59A