DeepSA: a deep-learning driven predictor of compound synthesis accessibility

被引:0
|
作者
Shihang Wang
Lin Wang
Fenglei Li
Fang Bai
机构
[1] ShanghaiTech University,Shanghai Institute for Advanced Immunochemical Studies and School of Life Science and Technology
[2] ShanghaiTech University,School of Information Science and Technology
[3] Shanghai Clinical Research and Trial Center,undefined
来源
关键词
Synthetic accessibility; Drug design; Deep learning; Chemical language model;
D O I
暂无
中图分类号
学科分类号
摘要
With the continuous development of artificial intelligence technology, more and more computational models for generating new molecules are being developed. However, we are often confronted with the question of whether these compounds are easy or difficult to synthesize, which refers to synthetic accessibility of compounds. In this study, a deep learning based computational model called DeepSA, was proposed to predict the synthesis accessibility of compounds, which provides a useful tool to choose molecules. DeepSA is a chemical language model that was developed by training on a dataset of 3,593,053 molecules using various natural language processing (NLP) algorithms, offering advantages over state-of-the-art methods and having a much higher area under the receiver operating characteristic curve (AUROC), i.e., 89.6%, in discriminating those molecules that are difficult to synthesize. This helps users select less expensive molecules for synthesis, reducing the time and cost required for drug discovery and development. Interestingly, a comparison of DeepSA with a Graph Attention-based method shows that using SMILES alone can also efficiently visualize and extract compound’s informative features. DeepSA is available online on the below web server (https://bailab.siais.shanghaitech.edu.cn/services/deepsa/) of our group, and the code is available at https://github.com/Shihang-Wang-58/DeepSA.
引用
收藏
相关论文
共 50 条
  • [41] Partially Deep-Learning Encryption Technique
    Mousa, Hamdy M.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (02): : 4277 - 4291
  • [42] A Deep-Learning Model for Cancer Therapies
    不详
    CLINICAL PHARMACOLOGY & THERAPEUTICS, 2021, 109 (02) : 284 - 284
  • [43] Deep-learning based tractography for neonates
    Mukherjee, Sovanlal
    Paquette, Natacha
    Gajawelli, Niharika
    Wang, Yalin
    Wallace, Julia
    Nelson, Marvin D.
    Panigrahy, Ashok
    Lepore, Natasha
    16TH INTERNATIONAL SYMPOSIUM ON MEDICAL INFORMATION PROCESSING AND ANALYSIS, 2020, 11583
  • [44] Barrett oesophagus: deep-learning diagnosis?
    Hindson J.
    Nature Reviews Gastroenterology & Hepatology, 2021, 18 (6) : 368 - 368
  • [45] Deep-learning continuous gravitational waves
    Dreissigacker, Christoph
    Sharma, Rahul
    Messenger, Chris
    Zhao, Ruining
    Prix, Reinhard
    PHYSICAL REVIEW D, 2019, 100 (04)
  • [46] A deep-learning approach to mining conditions
    Gallego, Fernando O.
    Corchuelo, Rafael
    KNOWLEDGE-BASED SYSTEMS, 2020, 193
  • [47] Deep-learning jets with uncertainties and more
    Bollweg, Sasha
    Haussmann, Manuel
    Kasieczka, Gregor
    Luchmann, Michel
    Plehn, Tilman
    Thompson, Jennifer
    SCIPOST PHYSICS, 2020, 8 (01):
  • [48] SNOTBOT: A Whale of a Deep-Learning Project
    Keller, Bryn
    Willke, Ted
    IEEE SPECTRUM, 2019, 56 (12) : 41 - +
  • [49] Deep-Learning Electron Diffractive Imaging
    Chang, Dillan J.
    O'Leary, Colum M.
    Su, Cong
    Jacobs, Daniel A.
    Kahn, Salman
    Zettl, Alex
    Ciston, Jim
    Ercius, Peter
    Miao, Jianwei
    PHYSICAL REVIEW LETTERS, 2023, 130 (01)
  • [50] Re: A Call for Deep-learning Healthcare
    Bian, Jiang
    Min, Jae S.
    Prosperi, Mattia
    Wang, Mo
    EPIDEMIOLOGY, 2020, 31 (02) : E22 - E22