Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction

被引:35
|
作者
Liu, Xiang [1 ]
Feng, Huitao [1 ,2 ]
Wu, Jie [3 ]
Xia, Kelin [4 ]
机构
[1] Nankai Univ, Tianjin, Peoples R China
[2] Chongqing Univ Technol, Math Sci Res Ctr, Chongqing, Peoples R China
[3] Hebei Normal Univ, Shijiazhuang, Hebei, Peoples R China
[4] Nanyang Technol Univ, Singapore, Singapore
关键词
Persistent spectral hypergraph; Machine learning; Hodge Laplacian; Drug design; HOMOLOGY; DESCRIPTORS; DIGRAPHS; GRAPHS;
D O I
10.1093/bib/bbab127
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Molecular descriptors are essential to not only quantitative structure activity/property relationship (QSAR/QSPR) models, but also machine learning based chemical and biological data analysis. In this paper, we propose persistent spectral hypergraph (PSH) based molecular descriptors or fingerprints for the first time. Our PSH-based molecular descriptors are used in the characterization of molecular structures and interactions, and further combined with machine learning models, in particular gradient boosting tree (GBT), for protein-ligand binding affinity prediction. Different from traditional molecular descriptors, which are usually based on molecular graph models, a hypergraph-based topological representation is proposed for protein-ligand interaction characterization. Moreover, a filtration process is introduced to generate a series of nested hypergraphs in different scales. For each of these hypergraphs, its eigen spectrum information can be obtained from the corresponding (Hodge) Laplacain matrix. PSH studies the persistence and variation of the eigen spectrum of the nested hypergraphs during the filtration process. Molecular descriptors or fingerprints can be generated from persistent attributes, which are statistical or combinatorial functions of PSH, and combined with machine learning models, in particular, GBT. We test our PSH-GBT model on three most commonly used datasets, including PDBbind-2007, PDBbind-2013 and PDBbind-2016. Our results, for all these databases, are better than all existing machine learning models with traditional molecular descriptors, as far as we know.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Persistent spectral-based machine learning (PerSpect ML) for protein-ligand binding affinity prediction
    Meng, Zhenyu
    Xia, Kelin
    SCIENCE ADVANCES, 2021, 7 (19)
  • [2] Persistent Path-Spectral (PPS) Based Machine Learning for Protein-Ligand Binding Affinity Prediction
    Liu, Ran
    Liu, Xiang
    Wu, Jie
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2023, 63 (03) : 1066 - 1075
  • [3] Persistent Directed Flag Laplacian (PDFL)-Based Machine Learning for Protein-Ligand Binding Affinity Prediction
    Zia, Mushal
    Jones, Benjamin
    Feng, Hongsong
    Wei, Guo-Wei
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2025,
  • [4] Ollivier Persistent Ricci Curvature-Based Machine Learning for the Protein-Ligand Binding Affinity Prediction
    Wee, JunJie
    Xia, Kelin
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2021, 61 (04) : 1617 - 1626
  • [5] Integration of element specific persistent homology and machine learning for protein-ligand binding affinity prediction
    Cang, Zixuan
    Wei, Guo-Wei
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN BIOMEDICAL ENGINEERING, 2018, 34 (02)
  • [6] Forman persistent Ricci curvature (FPRC)-based machine learning models for protein-ligand binding affinity prediction
    Wee, JunJie
    Xia, Kelin
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (06)
  • [7] Protein-Ligand Binding Affinity Prediction Based on Deep Learning
    Lu, Yaoyao
    Liu, Junkai
    Jiang, Tengsheng
    Guan, Shixuan
    Wu, Hongjie
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2022, PT II, 2022, 13394 : 310 - 316
  • [8] Prediction of protein-ligand binding affinity with deep learning
    Wang, Yuxiao
    Jiao, Qihong
    Wang, Jingxuan
    Cai, Xiaojun
    Zhao, Wei
    Cui, Xuefeng
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2023, 21 : 5796 - 5806
  • [9] Dowker complex based machine learning (DCML) models for protein-ligand binding affinity prediction
    Liu, Xiang
    Feng, Huitao
    Wu, Jie
    Xia, Kelin
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (04)
  • [10] An Analysis of Proteochemometric and Conformal Prediction Machine Learning Protein-Ligand Binding Affinity Models
    Parks, Conor
    Gaieb, Zied
    Amaro, Rommie E.
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2020, 7