Assessing machine learning approaches for predicting failures of investigational drug candidates during clinical trials

被引:7
|
作者
John, Lijo [1 ,2 ,3 ]
Mahanta, Hridoy Jyoti [1 ,3 ]
Soujanya, Y. [2 ,3 ]
Sastry, G. Narahari [1 ,2 ,3 ]
机构
[1] CSIR North East Inst Sci & Technol, Adv Computat & Data Sci Div, Jorhat 785006, Assam, India
[2] CSIR Indian Inst Chem Technol, Polymers & Funct Mat Div, Hyderabad 500007, India
[3] Acad Sci & Innovat Res AcSIR, Ghaziabad 201002, Uttar Pradesh, India
关键词
Machine learning; Clinical trials; Molecular descriptors; Fingerprints; Feature selection; FEATURE-SELECTION; SUCCESS; ALERTS; SYSTEM;
D O I
10.1016/j.compbiomed.2022.106494
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
One of the major challenges in drug development is having acceptable levels of efficacy and safety throughout all the phases of clinical trials followed by the successful launch in the market. While there are many factors such as molecular properties, toxicity parameters, mechanism of action at the target site, etc. that regulates the thera-peutic action of a compound, a holistic approach directed towards data-driven studies will invariably strengthen the predictive toxicological sciences. Our quest for the current study is to find out various reasons as to why an investigational candidate would fail in the clinical trials after multiple iterations of refinement and optimization. We have compiled a dataset that comprises of approved and withdrawn drugs as well as toxic compounds and essentially have used time-split based approach to generate the training and validation set. Five highly robust and scalable machine learning binary classifiers were used to develop the predictive models that were trained with features like molecular descriptors and fingerprints and then validated rigorously to achieve acceptable performance in terms of a set of performance metrics. The mean AUC scores for all the five classifiers with the hold-out test set were obtained in the range of 0.66-0.71. The models were further used to predict the probability score for the clinical candidate dataset. The top compounds predicted to be toxic were analyzed to estimate different dimensions of toxicity. Apparently, through this study, we propose that with the appropriate use of feature extraction and machine learning methods, one can estimate the likelihood of success or failure of investigational drugs candidates thereby opening an avenue for future trends in computational toxicological studies. The models developed in the study can be accessed at https://github.com/gnsastry/predicting_clinical_t rials.git.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Predicting Cascading Failures in Power Grids using Machine Learning Algorithms
    Shuvro, Rezoan A.
    Das, Pankaz
    Hayat, Majeed M.
    Talukder, Mitun
    2019 51ST NORTH AMERICAN POWER SYMPOSIUM (NAPS), 2019,
  • [42] Predicting Football Match Outcomes With Machine Learning Approaches
    Choi B.S.
    Foo L.K.
    Chua S.-L.
    Mendel, 2023, 29 (02) : 229 - 236
  • [43] Predicting US bank failures and stress testing with machine learning algorithms
    Hu, Wendi
    Shao, Chujian
    Zhang, Wenyu
    FINANCE RESEARCH LETTERS, 2025, 75
  • [44] Enhancing manufacturing process by predicting component failures using machine learning
    Saadat, Raihanus
    Syed-Mohamad, Sharifah Mashita
    Azmi, Athira
    Keikhosrokiani, Pantea
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (20): : 18155 - 18169
  • [45] A machine learning framework for predicting drug-drug interactions
    Mei, Suyu
    Zhang, Kun
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [46] IDA 2016 Industrial Challenge: Using Machine Learning for Predicting Failures
    Costa, Camila Ferreira
    Nascimento, Mario A.
    ADVANCES IN INTELLIGENT DATA ANALYSIS XV, 2016, 9897 : 381 - 386
  • [47] Predicting Cargo Train Failures: A Machine Learning Approach for a Lightweight Prototype
    Kauschke, Sebastian
    Fuernkranz, Johannes
    Janssen, Frederik
    DISCOVERY SCIENCE, (DS 2016), 2016, 9956 : 151 - 166
  • [48] "Memory Loss" in Commodity Hardware? Predicting DIMM Failures with Machine Learning
    Giurgiu, Ioana
    Wiesmann, Dorothea
    Bird, John
    SYSTOR'17: PROCEEDINGS OF THE 10TH ACM INTERNATIONAL SYSTEMS AND STORAGE CONFERENCE, 2017,
  • [49] Machine Learning Tools for Improving the Efficiency of Drug Development Clinical Trials in Neurodegenerative Diseases
    Ennist, David
    Beaulieu, Danielle
    Jahandideh, Samad
    Taylor, Albert
    NEUROLOGY, 2018, 90
  • [50] Predicting Employability of Candidates: Comparative Study of Different Machine Learning Models
    Hitharth, K. B. Sai
    Dhanya, N. M.
    PROCEEDINGS OF EMERGING TRENDS AND TECHNOLOGIES ON INTELLIGENT SYSTEMS (ETTIS 2021), 2022, 1371 : 179 - 190