A deep learning based multi-model approach for predicting drug-like chemical compound's toxicity

被引:4
|
作者
Saravanan, Konda Mani [3 ]
Wan, Jiang-Fan [4 ]
Dai, Liujiang [5 ]
Zhang, Jiajun [1 ,2 ,6 ]
Zhang, John Z. H. [1 ,2 ]
Zhang, Haiping [1 ,2 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Fac Synthet Biol, Shenzhen 518055, Peoples R China
[2] Chinese Acad Sci, Inst Synthet Biol, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[3] Bharath Inst Higher Educ & Res, Dept Biotechnol, Chennai 600073, Tamil Nadu, India
[4] Guangdong Hong Kong Macao Greater Bay Area Ctr Dru, Shenzhen 518000, Peoples R China
[5] Chinese Acad Sci, Inst Biomed & Biotechnol, Guangdong Immune Cell Therapy Engn & Technol Res C, Ctr Prot & Cell Based Drugs,Shenzhen Inst Adv Tech, Shenzhen 518055, Peoples R China
[6] Hunan Univ Technol & Business, Coll Sci, Changsha 410205, Peoples R China
基金
美国国家科学基金会;
关键词
Deep learning models; Multi-model approach; Toxicity prediction; Drug screening; Small molecules; ANIMAL-MODELS;
D O I
10.1016/j.ymeth.2024.04.020
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Ensuring the safety and efficacy of chemical compounds is crucial in small-molecule drug development. In the later stages of drug development, toxic compounds pose a significant challenge, losing valuable resources and time. Early and accurate prediction of compound toxicity using deep learning models offers a promising solution to mitigate these risks during drug discovery. In this study, we present the development of several deep-learning models aimed at evaluating different types of compound toxicity, including acute toxicity, carcinogenicity, hERG_cardiotoxicity (the human ether-a-go-go related gene caused cardiotoxicity), hepatotoxicity, and mutagenicity. To address the inherent variations in data size, label type, and distribution across different types of toxicity, we employed diverse training strategies. Our first approach involved utilizing a graph convolutional network (GCN) regression model to predict acute toxicity, which achieved notable performance with Pearson R 0.76, 0.74, and 0.65 for intraperitoneal, intravenous, and oral administration routes, respectively. Furthermore, we trained multiple GCN binary classification models, each tailored to a specific type of toxicity. These models exhibited high area under the curve (AUC) scores, with an impressive AUC of 0.69, 0.77, 0.88, and 0.79 for predicting carcinogenicity, hERG_cardiotoxicity, mutagenicity, and hepatotoxicity, respectively. Additionally, we have used the approved drug dataset to determine the appropriate threshold value for the prediction score in model usage. We integrated these models into a virtual screening pipeline to assess their effectiveness in identifying potential low-toxicity drug candidates. Our findings indicate that this deep learning approach has the potential to significantly reduce the cost and risk associated with drug development by expediting the selection of compounds with low toxicity profiles. Therefore, the models developed in this study hold promise as critical tools for early drug candidate screening and selection.
引用
收藏
页码:164 / 175
页数:12
相关论文
共 50 条
  • [31] MSDRP: a deep learning model based on multisource data for predicting drug response
    Zhao, Haochen
    Zhang, Xiaoyu
    Zhao, Qichang
    Li, Yaohang
    Wang, Jianxin
    BIOINFORMATICS, 2023, 39 (09)
  • [32] A joint multi-model machine learning prediction approach based on confidence for ship stability
    Jiang, Chaicheng
    Xiang, Xianbo
    Xiang, Gong
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (03) : 3873 - 3890
  • [33] A novel approach based on deep residual learning to predict drug's anatomical therapeutic chemical code
    Zhao, Haochen
    Ni, Peng
    Yan, Cheng
    Li, Yaohang
    Wang, Jianxin
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 921 - 926
  • [34] Temperature Prediction for Stored Grain: A Multi-model Fusion Approach Based on Machine Learning
    Chen, Donghao
    Liu, Binkun
    2024 9TH INTERNATIONAL CONFERENCE ON ELECTRONIC TECHNOLOGY AND INFORMATION SCIENCE, ICETIS 2024, 2024, : 661 - 665
  • [35] A hybrid deep learning technique based integrated multi-model data fusion for forensic investigation
    Senthil, P.
    Selvakumar, S.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (05) : 6849 - 6862
  • [36] Automated Brain Tumor Identification in Biomedical Radiology Images: A Multi-Model Ensemble Deep Learning Approach
    Natha, Sarfaraz
    Laila, Umme
    Gashim, Ibrahim Ahmed
    Mahboob, Khalid
    Saeed, Muhammad Noman
    Noaman, Khaled Mohammed
    APPLIED SCIENCES-BASEL, 2024, 14 (05):
  • [37] A novel deep learning-based multi-model ensemble method for the prediction of neuromuscular disorders
    Khamparia, Aditya
    Singh, Aman
    Anand, Divya
    Gupta, Deepak
    Khanna, Ashish
    Kumar, N. Arun
    Tan, Joseph
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (15): : 11083 - 11095
  • [38] PlantDet: A Robust Multi-Model Ensemble Method Based on Deep Learning For Plant Disease Detection
    Shovon, Md. Sakib Hossain
    Mozumder, Shakrin Jahan
    Pal, Osim Kumar
    Mridha, M. F.
    Asai, Nobuyoshi
    Shin, Jungpil
    IEEE ACCESS, 2023, 11 : 34846 - 34859
  • [39] A hybrid deep learning technique based integrated multi-model data fusion for forensic investigation
    Senthil P.
    Selvakumar S.
    Journal of Intelligent and Fuzzy Systems, 2022, 43 (05): : 6849 - 6862
  • [40] Predicting drug-drug interactions based on multi-view and multichannel attention deep learning
    Huang, Liyu
    Chen, Qingfeng
    Lan, Wei
    HEALTH INFORMATION SCIENCE AND SYSTEMS, 2023, 11 (01)