Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction

被引:35
|
作者
Liu, Xiang [1 ]
Feng, Huitao [1 ,2 ]
Wu, Jie [3 ]
Xia, Kelin [4 ]
机构
[1] Nankai Univ, Tianjin, Peoples R China
[2] Chongqing Univ Technol, Math Sci Res Ctr, Chongqing, Peoples R China
[3] Hebei Normal Univ, Shijiazhuang, Hebei, Peoples R China
[4] Nanyang Technol Univ, Singapore, Singapore
关键词
Persistent spectral hypergraph; Machine learning; Hodge Laplacian; Drug design; HOMOLOGY; DESCRIPTORS; DIGRAPHS; GRAPHS;
D O I
10.1093/bib/bbab127
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Molecular descriptors are essential to not only quantitative structure activity/property relationship (QSAR/QSPR) models, but also machine learning based chemical and biological data analysis. In this paper, we propose persistent spectral hypergraph (PSH) based molecular descriptors or fingerprints for the first time. Our PSH-based molecular descriptors are used in the characterization of molecular structures and interactions, and further combined with machine learning models, in particular gradient boosting tree (GBT), for protein-ligand binding affinity prediction. Different from traditional molecular descriptors, which are usually based on molecular graph models, a hypergraph-based topological representation is proposed for protein-ligand interaction characterization. Moreover, a filtration process is introduced to generate a series of nested hypergraphs in different scales. For each of these hypergraphs, its eigen spectrum information can be obtained from the corresponding (Hodge) Laplacain matrix. PSH studies the persistence and variation of the eigen spectrum of the nested hypergraphs during the filtration process. Molecular descriptors or fingerprints can be generated from persistent attributes, which are statistical or combinatorial functions of PSH, and combined with machine learning models, in particular, GBT. We test our PSH-GBT model on three most commonly used datasets, including PDBbind-2007, PDBbind-2013 and PDBbind-2016. Our results, for all these databases, are better than all existing machine learning models with traditional molecular descriptors, as far as we know.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Binding Affinity Prediction for Protein-Ligand Complexes Based on β Contacts and B Factor
    Liu, Qian
    Kwoh, Chee Keong
    Li, Jinyan
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2013, 53 (11) : 3076 - 3085
  • [32] Exploring protein-ligand binding affinity prediction with electron density-based geometric deep learning
    Isert, Clemens
    Atz, Kenneth
    Riniker, Sereina
    Schneider, Gisbert
    RSC ADVANCES, 2024, 14 (07) : 4492 - 4502
  • [33] A machine learning approach to predicting protein-ligand binding affinity with applications to molecular docking
    Ballester, Pedro J.
    Mitchell, John B. O.
    BIOINFORMATICS, 2010, 26 (09) : 1169 - 1175
  • [34] Protein-ligand binding affinity prediction model based on graph attention network
    Yuan, Hong
    Huang, Jing
    Li, Jin
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2021, 18 (06) : 9148 - 9162
  • [35] Effects of data quality and quantity on deep learning for protein-ligand binding affinity prediction
    Fan, Frankie J.
    Shi, Yun
    BIOORGANIC & MEDICINAL CHEMISTRY, 2022, 72
  • [36] Join Persistent Homology (JPH)-Based Machine Learning for Metalloprotein-Ligand Binding Affinity Prediction
    Wang, Yaxing
    Liu, Xiang
    Zhang, Yipeng
    Wang, Xiangjun
    Xia, Kelin
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2025,
  • [37] A Folding-Docking-Affinity framework for protein-ligand binding affinity prediction
    Ming-Hsiu Wu
    Ziqian Xie
    Degui Zhi
    Communications Chemistry, 8 (1)
  • [38] Learning protein-ligand binding affinity with atomic environment vectors
    Meli, Rocco
    Anighoro, Andrew
    Bodkin, Mike J.
    Morris, Garrett M.
    Biggin, Philip C.
    JOURNAL OF CHEMINFORMATICS, 2021, 13 (01)
  • [39] Learning protein-ligand binding affinity with atomic environment vectors
    Rocco Meli
    Andrew Anighoro
    Mike J. Bodkin
    Garrett M. Morris
    Philip C. Biggin
    Journal of Cheminformatics, 13
  • [40] Predicting the impacts of mutations on protein-ligand binding affinity based on molecular dynamics simulations and machine learning methods
    Wang, Debby D.
    Le Ou-Yang
    Xie, Haoran
    Zhu, Mengxu
    Hong Yan
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2020, 18 : 439 - 454