Synergizing GA-XGBoost and QSAR modeling: Breaking down activity aliffs in HDAC1 inhibitors

被引:0
|
作者
Jawarkar, Rahul D. [1 ]
Mali, Suraj [2 ]
Deshmukh, Prashant K. [3 ]
Ingle, Rahul G. [4 ]
Al-Hussain, Sami A.
Al-Mutairi, Aamal A. [5 ]
Zaki, Magdi E. A. [5 ]
机构
[1] Dr Rajendra Gode Inst Pharm, Dept Med Chem, Univ Mardi Rd, Amravati 444602, MS, India
[2] DY Patil Deemed Be Univ, Sch Pharm, Sect 7, Navi Mumbai 400706, India
[3] Dr Rajendra Gode Coll Pharm, Dept Pharmaceut, Buldana Rd, Malkapur 443101, India
[4] Datta Meghe Inst Higher Educ & Res, Datta Meghe Coll Pharm, Wardha, India
[5] Imam Mohammad Ibn Saud Islamic Univ IMSIU, Coll Sci, Dept Chem, Riyadh 11623, Saudi Arabia
来源
JOURNAL OF MOLECULAR GRAPHICS & MODELLING | 2025年 / 135卷
关键词
HDAC1; Extreme gradient boosting analysis; Shapley additive explanations; Genetic algorithms; Activity cliff; HISTONE DEACETYLASE INHIBITORS; POLAR SURFACE-AREA; DRUG DISCOVERY; PHARMACOPHORE; CANCER; ACETYLATION; MECHANISMS; DOCKING; BIOLOGY; TARGET;
D O I
10.1016/j.jmgm.2024.108915
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The work being presented now combines severe gradient boosting with Shapley values, a thriving merger within the field of explainable artificial intelligence. We also use a genetic algorithm to analyse the HDAC1 inhibitory activity of a broad pool of 1274 molecules experimentally reported for HDAC1 inhibition. We conduct this analysis to ascertain the HDAC1 inhibitory activity of these molecules. Based on a rigorous investigation of extreme gradient boosting, the proposed method suggests using a genetic algorithm to identify pharmacophoric features. The statistical acceptability of extreme gradient boosting analysis is robust, with parameters such as R2 tr = 0.8797, R 2 L10 %= 0.8831, Q 2 F1 = 0.9459, Q 2 F2 = 0.9452, and Q 2 F3 = 0.9474. This is the driving force behind the invention of nine Py-descriptor-containing genetic algorithms. Shapley additive explanations formed the basis for the interpretation, assigning a significant value to each variable in the model. This is followed by the use of counterfactual cases to analyse the impact of the discovered molecular descriptors on HDAC1 inhibition. An examination of the molecular descriptors, which include acc_N_3B, fsp2NringC8B, fsp3NC7B, and sp2N_sp3C_3B, demonstrates that these descriptors provide insight into the function that the nitrogen atom plays in influencing HDAC1's inhibitory activity. Furthermore, the investigation shed light on the significant role that the hybridized carbon atoms located in sp2 and sp3 play in HDAC1 inhibition. Thus, the QSAR results are in conformity with the reported findings. In addition, activity cliff analysis supports the QSAR findings. Thus, the genetic algorithm- extreme gradient-boosting GA-XGBoost model is easy to understand and makes decent predictions. Based on this, it indicates that "explainable AI" may prove to be beneficial in the future for the purpose of identifying and using structural features in the process of medication development.
引用
收藏
页数:17
相关论文
共 22 条
  • [21] Pyrazolo[1,5-a][1,3,5]triazin-2-thioxo-4-ones derivatives as thymidine phosphorylase inhibitors: Structure, drug-like calculations and quantitative structure-activity relationships (QSAR) modeling
    Manachou, Marwa
    Gouid, Zied
    Almi, Zineb
    Belaidi, Salah
    Boughdiri, Salima
    Hochlaf, Majdi
    JOURNAL OF MOLECULAR STRUCTURE, 2020, 1199
  • [22] Energy-Based Pharmacophore and Three-Dimensional Quantitative Structure-Activity Relationship (3D-QSAR) Modeling Combined with Virtual Screening To Identify Novel Small-Molecule Inhibitors of Silent Mating-Type Information Regulation 2 Homologue 1 (SIRT1)
    Pulla, Venkat Koushik
    Sriram, Dinavahi Saketh
    Viswanadha, Srikant
    Sriram, Dharmarajan
    Yogeeswari, Perumal
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2016, 56 (01) : 173 - 187