Assessing machine learning approaches for predicting failures of investigational drug candidates during clinical trials

被引:7
|
作者
John, Lijo [1 ,2 ,3 ]
Mahanta, Hridoy Jyoti [1 ,3 ]
Soujanya, Y. [2 ,3 ]
Sastry, G. Narahari [1 ,2 ,3 ]
机构
[1] CSIR North East Inst Sci & Technol, Adv Computat & Data Sci Div, Jorhat 785006, Assam, India
[2] CSIR Indian Inst Chem Technol, Polymers & Funct Mat Div, Hyderabad 500007, India
[3] Acad Sci & Innovat Res AcSIR, Ghaziabad 201002, Uttar Pradesh, India
关键词
Machine learning; Clinical trials; Molecular descriptors; Fingerprints; Feature selection; FEATURE-SELECTION; SUCCESS; ALERTS; SYSTEM;
D O I
10.1016/j.compbiomed.2022.106494
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
One of the major challenges in drug development is having acceptable levels of efficacy and safety throughout all the phases of clinical trials followed by the successful launch in the market. While there are many factors such as molecular properties, toxicity parameters, mechanism of action at the target site, etc. that regulates the thera-peutic action of a compound, a holistic approach directed towards data-driven studies will invariably strengthen the predictive toxicological sciences. Our quest for the current study is to find out various reasons as to why an investigational candidate would fail in the clinical trials after multiple iterations of refinement and optimization. We have compiled a dataset that comprises of approved and withdrawn drugs as well as toxic compounds and essentially have used time-split based approach to generate the training and validation set. Five highly robust and scalable machine learning binary classifiers were used to develop the predictive models that were trained with features like molecular descriptors and fingerprints and then validated rigorously to achieve acceptable performance in terms of a set of performance metrics. The mean AUC scores for all the five classifiers with the hold-out test set were obtained in the range of 0.66-0.71. The models were further used to predict the probability score for the clinical candidate dataset. The top compounds predicted to be toxic were analyzed to estimate different dimensions of toxicity. Apparently, through this study, we propose that with the appropriate use of feature extraction and machine learning methods, one can estimate the likelihood of success or failure of investigational drugs candidates thereby opening an avenue for future trends in computational toxicological studies. The models developed in the study can be accessed at https://github.com/gnsastry/predicting_clinical_t rials.git.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Machine learning approaches for drug combination therapies
    Paltun, Betul Guvenc
    Kaski, Samuel
    Mamitsuka, Hiroshi
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (06)
  • [32] Investigational antibody-drug conjugates in clinical trials for the treatment of breast cancer
    Okines, Alicia F. C.
    Ulrich, Lara
    EXPERT OPINION ON INVESTIGATIONAL DRUGS, 2021, 30 (08) : 789 - 795
  • [33] A Comparison of Machine Learning Approaches for Predicting Employee Attrition
    Guerranti, Filippo
    Dimitri, Giovanna Maria
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [34] Application of machine learning approaches for predicting hemophilia A severity
    Rawal, Atul
    Kidchob, Christopher
    Ou, Jiayi
    Sauna, Zuben E.
    JOURNAL OF THROMBOSIS AND HAEMOSTASIS, 2024, 22 (07) : 1909 - 1918
  • [35] Predicting Employee Attrition Using Machine Learning Approaches
    Raza, Ali
    Munir, Kashif
    Almutairi, Mubarak
    Younas, Faizan
    Fareed, Mian Muhammad Sadiq
    APPLIED SCIENCES-BASEL, 2022, 12 (13):
  • [36] MACHINE LEARNING APPROACHES TO UNDERSTANDING AND PREDICTING PATTERNS OF ADHERENCE
    Chakraborty, Shayok
    Bhattacharya, Aditya
    Tian, Shubo
    Roque, Nelson
    He, Zhe
    Boot, Walter
    INNOVATION IN AGING, 2021, 5 : 551 - 551
  • [37] Machine Learning Approaches for Predicting Protein Complex Similarity
    Farhoodi, Roshanak
    Akbal-Delibas, Bahar
    Haspel, Nurit
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2017, 24 (01) : 40 - 51
  • [38] Predicting Location of Tweets Using Machine Learning Approaches
    Alsaqer, Mohammed
    Alelyani, Salem
    Mohana, Mohamed
    Alreemy, Khalid
    Alqahtani, Ali
    APPLIED SCIENCES-BASEL, 2023, 13 (05):
  • [39] Machine Learning for Predicting Infrastructure Faults and Job Failures in Clouds: A Survey
    Shayesteh, Behshid
    Ebrahimzadeh, Amin
    Glitho, Roch
    IEEE COMMUNICATIONS MAGAZINE, 2025, 63 (01) : 148 - 154
  • [40] Enhancing manufacturing process by predicting component failures using machine learning
    Raihanus Saadat
    Sharifah Mashita Syed-Mohamad
    Athira Azmi
    Pantea Keikhosrokiani
    Neural Computing and Applications, 2022, 34 : 18155 - 18169