DeepSA: a deep-learning driven predictor of compound synthesis accessibility

被引:0
|
作者
Shihang Wang
Lin Wang
Fenglei Li
Fang Bai
机构
[1] ShanghaiTech University,Shanghai Institute for Advanced Immunochemical Studies and School of Life Science and Technology
[2] ShanghaiTech University,School of Information Science and Technology
[3] Shanghai Clinical Research and Trial Center,undefined
来源
关键词
Synthetic accessibility; Drug design; Deep learning; Chemical language model;
D O I
暂无
中图分类号
学科分类号
摘要
With the continuous development of artificial intelligence technology, more and more computational models for generating new molecules are being developed. However, we are often confronted with the question of whether these compounds are easy or difficult to synthesize, which refers to synthetic accessibility of compounds. In this study, a deep learning based computational model called DeepSA, was proposed to predict the synthesis accessibility of compounds, which provides a useful tool to choose molecules. DeepSA is a chemical language model that was developed by training on a dataset of 3,593,053 molecules using various natural language processing (NLP) algorithms, offering advantages over state-of-the-art methods and having a much higher area under the receiver operating characteristic curve (AUROC), i.e., 89.6%, in discriminating those molecules that are difficult to synthesize. This helps users select less expensive molecules for synthesis, reducing the time and cost required for drug discovery and development. Interestingly, a comparison of DeepSA with a Graph Attention-based method shows that using SMILES alone can also efficiently visualize and extract compound’s informative features. DeepSA is available online on the below web server (https://bailab.siais.shanghaitech.edu.cn/services/deepsa/) of our group, and the code is available at https://github.com/Shihang-Wang-58/DeepSA.
引用
收藏
相关论文
共 50 条
  • [1] DeepSA: a deep-learning driven predictor of compound synthesis accessibility
    Wang, Shihang
    Wang, Lin
    Li, Fenglei
    Bai, Fang
    JOURNAL OF CHEMINFORMATICS, 2023, 15 (01)
  • [2] Model-driven deep-learning
    Xu, Zongben
    Sun, Jian
    NATIONAL SCIENCE REVIEW, 2018, 5 (01) : 22 - 24
  • [3] Model-driven deep-learning
    Zongben Xu
    Jian Sun
    National Science Review, 2018, 5 (01) : 22 - 24
  • [4] Is Monte Carlo uncertainty a good predictor of manual adjustments of deep-learning contours?
    Ionescu, G.
    Looney, P.
    Willaime, J. M. Y.
    Vaasen, F.
    van Elmpt, W.
    Gooding, M. J.
    RADIOTHERAPY AND ONCOLOGY, 2022, 170 : S169 - S170
  • [5] Deep-learning seismology
    Mousavi, S. Mostafa
    Beroza, Gregory C.
    SCIENCE, 2022, 377 (6607) : 725 - +
  • [6] Deep-learning boost
    Rachel Won
    Nature Photonics, 2018, 12 : 443 - 443
  • [7] DeepDriveMD: Deep-Learning Driven Adaptive Molecular Simulations for Protein Folding
    Lee, Hyungro
    Ma, Heng
    Turilli, Matteo
    Bhowmik, Debsindhu
    Jha, Shantenu
    Ramanathan, Arvind
    PROCEEDINGS OF 2019 IEEE/ACM THIRD WORKSHOP ON DEEP LEARNING ON SUPERCOMPUTERS (DLS), 2019, : 12 - 19
  • [8] Deep-learning boost
    Won, Rachel
    NATURE PHOTONICS, 2018, 12 (08) : 443 - 443
  • [9] Deep-learning for microscopy
    Daryl J. V. David
    Nature Cell Biology, 2022, 24 : 1321 - 1321
  • [10] Deep-learning for microscopy
    David, Daryl J. V.
    NATURE CELL BIOLOGY, 2022, 24 (09) : 1321 - 1321