Machine Learning Approaches for Predicting Protein Complex Similarity

被引：0

作者：

Farhoodi, Roshanak ^{[1
]}

Akbal-Delibas, Bahar ^{[2
]}

Haspel, Nurit ^{[1
]}

机构：

[1] Univ Massachusetts, Dept Comp Sci, Boston, MA 02125 USA

[2] Kadir Has Univ, Dept Comp Engn, Istanbul, Turkey

来源：

JOURNAL OF COMPUTATIONAL BIOLOGY | 2017年 / 24卷 / 01期

关键词：

machine learning; neural networks; protein docking and refinement; RMSD prediction; scoring functions; EVOLUTIONARY TRACE; WEB SERVER; DOCKING; ELECTROSTATICS; DESOLVATION; REFINEMENT; ALGORITHMS;

D O I：

10.1089/cmb.2016.0137

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Discriminating native-like structures from false positives with high accuracy is one of the biggest challenges in protein-protein docking. While there is an agreement on the existence of a relationship between various favorable intermolecular interactions (e.g., Van der Waals, electrostatic, and desolvation forces) and the similarity of a conformation to its native structure, the precise nature of this relationship is not known. Existing protein-protein docking methods typically formulate this relationship as a weighted sum of selected terms and calibrate their weights by using a training set to evaluate and rank candidate complexes. Despite improvements in the predictive power of recent docking methods, producing a large number of false positives by even state-of-the-art methods often leads to failure in predicting the correct binding of many complexes. With the aid of machine learning methods, we tested several approaches that not only rank candidate structures relative to each other but also predict how similar each candidate is to the native conformation. We trained a two-layer neural network, a multilayer neural network, and a network of Restricted Boltzmann Machines against extensive data sets of unbound complexes generated by RosettaDock and PyDock. We validated these methods with a set of refinement candidate structures. We were able to predict the root mean squared deviations (RMSDs) of protein complexes with a very small, often less than 1.5 angstrom, error margin when trained with structures that have RMSD values of up to 7 angstrom. In our most recent experiments with the protein samples having RMSD values up to 27 angstrom, the average prediction error was still relatively small, attesting to the potential of our approach in predicting the correct binding of protein-protein complexes.

引用

页码：40 / 51

页数：12

共 50 条

[41] Predicting novel microRNA: a comprehensive comparison of machine learning approaches
Stegmayer, Georgina
Di Persia, Leandro E.
Rubiolo, Mariano
Gerard, Matias
Pividori, Milton
Yones, Cristian
Bugnon, Leandro A.
Rodriguez, Tadeo
Raad, Jonathan
Milone, Diego H.
BRIEFINGS IN BIOINFORMATICS, 2019, 20 (05) : 1607 - 1620
[42] Machine learning for predicting protein properties: A comprehensive review
Wang, Yizhen
Zhang, Yanyun
Zhan, Xuhui
He, Yuhao
Yang, Yongfu
Cheng, Li
Alghazzawi, Daniyal
NEUROCOMPUTING, 2024, 597
[43] Predicting Protein Crystal Solvent Content with Machine Learning
McDonagh, D.
Waterman, D.
Keegan, R.
ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2024, 80
[44] A Machine Learning Protocol for Predicting Protein Infrared Spectra
Ye, Sheng
Zhong, Kai
Zhang, Jinxiao
Hu, Wei
Hirst, Jonathan D.
Zhang, Guozhen
Mukamel, Shaul
Jiang, Jun
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2020, 142 (45) : 19071 - 19077
[45] Predicting protein condensate formation using machine learning
van Mierlo, Guido
Jansen, Jurriaan R. G.
Wang, Jie
Poser, Ina
van Heeringen, Simon J.
Vermeulen, Michiel
CELL REPORTS, 2021, 34 (05):
[46] Protein backbone angle prediction with machine learning approaches
Kuang, R
Leslie, CS
Yang, AS
BIOINFORMATICS, 2004, 20 (10) : 1612 - 1621
[47] Machine Learning Approaches for Quality Assessment of Protein Structures
Chen, Jiarui
Siu, Shirley W., I
BIOMOLECULES, 2020, 10 (04)
[48] Novel machine learning approaches revolutionize protein knowledge
Bordin, Nicola
Dallago, Christian
Heinzinger, Michael
Kim, Stephanie
Littmann, Maria
Rauer, Clemens
Steinegger, Martin
Rost, Burkhard
Orengo, Christine
TRENDS IN BIOCHEMICAL SCIENCES, 2023, 48 (04) : 345 - 359
[49] Similarity-Based Machine Learning Model for Predicting the Metabolic Pathways of Compounds
Jia, Yanjuan
Zhao, Ran
Chen, Lei
IEEE ACCESS, 2020, 8 : 130687 - 130696
[50] Application of Machine Learning Approaches for Protein-protein Interactions Prediction
Zhang, Mengying
Su, Qiang
Lu, Yi
Zhao, Manman
Niu, Bing
MEDICINAL CHEMISTRY, 2017, 13 (06) : 506 - 514

← 1 2 3 4 5 →