Uncovering and tackling fundamental limitations of compound potency predictions using machine learning models

被引:1
|
作者
Janela, Tiago [1 ,2 ]
Bajorath, Juergen [1 ,2 ,3 ]
机构
[1] Univ Bonn, Dept Life Sci Informat & Data Sci, B IT, Friedrich Hirzebruch Allee 5-6, D-53115 Bonn, Germany
[2] Univ Bonn, Lamarr Inst Machine Learning & Artificial Intellig, Friedrich Hirzebruch Allee 5-6, D-53115 Bonn, Germany
[3] Univ Bonn, Limes Inst, Program Unit Chem Biol & Med Chem, B IT, Friedrich Hirzebruch Allee 5-6, D-53115 Bonn, Germany
来源
CELL REPORTS PHYSICAL SCIENCE | 2024年 / 5卷 / 06期
关键词
DRUG DISCOVERY; QSAR;
D O I
10.1016/j.xcrp.2024.101988
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Molecular property predictions play a central role in computer-aided drug discovery. Although a variety of physicochemical (e.g., solubility or chemical reactivity) or physiological properties (e.g., metabolic stability or toxicity) can be predicted, biological activity is by far the most frequently investigated compound feature. Activity predictions are carried out in a qualitative (target-based activity, through compound classification) or quantitative (compound potency or studies have evaluated and compared different machine learning methods for activity and potency predictions, recently with a focus on deep learning. Regardless of the methods used, these studies generally rely on conventional benchmark settings. Recent work has shown that potency prediction benchmarks have severe general limitations that have long been unnoticed but prevent a reliable assessment of different methods and their relative performance. In this perspective, we outline general limitations of benchmark settings for compound potency predictions, introduce potential alternatives enabling a more realistic assessment of state-of-the-art predictive models, and discuss future directions for elucidating predictions and further increasing their impact.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Analysis of Conditions for Reliable Predictions by Moodle Machine Learning Models
    Bognar, Laszlo
    Fauszt, Tibor
    Nagy, Gabor Zsolt
    INTERNATIONAL JOURNAL OF EMERGING TECHNOLOGIES IN LEARNING, 2021, 16 (06) : 106 - 121
  • [22] Machine learning proteochemometric models for Cereblon glue activity predictions
    Prael, Francis J.
    Cox, Jiayi
    Sturm, Noe
    Kutchukian, Peter
    Forrester, William C.
    Michaud, Gregory
    Blank, Jutta
    Shen, Lingling
    Rodriguez-Perez, Raquel
    ARTIFICIAL INTELLIGENCE IN THE LIFE SCIENCES, 2024, 6
  • [23] MolPredictX: Online Biological Activity Predictions by Machine Learning Models
    Scotti, Marcus Tullius
    Herrera-Acevedo, Chonny
    Barros de Menezes, Renata Priscila
    Martin, Holli-Joi
    Muratov, Eugene N.
    de Souza Silva, Avilla Italo
    Albuquerque, Emmanuella Faustino
    Calado, Lucas Ferreira
    Coy-Barrera, Ericsson
    Scotti, Luciana
    MOLECULAR INFORMATICS, 2022, 41 (12)
  • [24] An Empirical Study on Machine Learning Models for Wind Power Predictions
    Liu, Yiqian
    Zhang, Huajie
    2016 15TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2016), 2016, : 758 - 763
  • [25] Toward interpretable machine learning: evaluating models of heterogeneous predictions
    Zhang, Ruixun
    ANNALS OF OPERATIONS RESEARCH, 2024,
  • [26] On Human Predictions with Explanations and Predictions of Machine Learning Models: A Case Study on Deception Detection
    Lai, Vivian
    Tan, Chenhao
    FAT*'19: PROCEEDINGS OF THE 2019 CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, 2019, : 29 - 38
  • [27] Tackling pandemics in smart cities using machine learning architecture
    Ngabo, Desire
    Dong, Wang
    Ibeke, Ebuka
    Iwendi, Celestine
    Masabo, Emmanuel
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2021, 18 (06) : 8444 - 8461
  • [28] On Some Limitations of Current Machine Learning Weather Prediction Models
    Bonavita, Massimo
    GEOPHYSICAL RESEARCH LETTERS, 2024, 51 (12)
  • [29] Predictions of solute transport in a compound channel using turbulence models
    Shiono, K
    Scott, CF
    Kearney, D
    JOURNAL OF HYDRAULIC RESEARCH, 2003, 41 (03) : 247 - 258
  • [30] Explainable haemoglobin deferral predictions using machine learning models: Interpretation and consequences for the blood supply
    Vinkenoog, Marieke
    van Leeuwen, Matthijs
    Janssen, Mart P.
    VOX SANGUINIS, 2022, 117 (11) : 1262 - 1270