DeepSA: a deep-learning driven predictor of compound synthesis accessibility

被引:0
|
作者
Shihang Wang
Lin Wang
Fenglei Li
Fang Bai
机构
[1] ShanghaiTech University,Shanghai Institute for Advanced Immunochemical Studies and School of Life Science and Technology
[2] ShanghaiTech University,School of Information Science and Technology
[3] Shanghai Clinical Research and Trial Center,undefined
来源
关键词
Synthetic accessibility; Drug design; Deep learning; Chemical language model;
D O I
暂无
中图分类号
学科分类号
摘要
With the continuous development of artificial intelligence technology, more and more computational models for generating new molecules are being developed. However, we are often confronted with the question of whether these compounds are easy or difficult to synthesize, which refers to synthetic accessibility of compounds. In this study, a deep learning based computational model called DeepSA, was proposed to predict the synthesis accessibility of compounds, which provides a useful tool to choose molecules. DeepSA is a chemical language model that was developed by training on a dataset of 3,593,053 molecules using various natural language processing (NLP) algorithms, offering advantages over state-of-the-art methods and having a much higher area under the receiver operating characteristic curve (AUROC), i.e., 89.6%, in discriminating those molecules that are difficult to synthesize. This helps users select less expensive molecules for synthesis, reducing the time and cost required for drug discovery and development. Interestingly, a comparison of DeepSA with a Graph Attention-based method shows that using SMILES alone can also efficiently visualize and extract compound’s informative features. DeepSA is available online on the below web server (https://bailab.siais.shanghaitech.edu.cn/services/deepsa/) of our group, and the code is available at https://github.com/Shihang-Wang-58/DeepSA.
引用
收藏
相关论文
共 50 条
  • [31] A Biology-Driven Deep-Learning Based Cell Survival Model for Proton Radiation
    Lai, Y.
    Shen, C.
    Chi, Y.
    Guan, F.
    Jia, X.
    MEDICAL PHYSICS, 2021, 48 (06)
  • [32] Adaptive Data-Driven Deep-Learning Surrogate Model for Frontal Polymerization in Dicyclopentadiene
    Liu, Qibang
    Abueidda, Diab
    Vyas, Sagar
    Gao, Yuan
    Koric, Seid
    Geubelle, Philippe H.
    JOURNAL OF PHYSICAL CHEMISTRY B, 2024, 128 (05): : 1220 - 1230
  • [33] A trial of deep-learning detection in colonoscopy
    Dickson I.
    Nature Reviews Gastroenterology & Hepatology, 2020, 17 (4) : 194 - 194
  • [34] On the Implicit Bias in Deep-Learning Algorithms
    Vardi, Gal
    COMMUNICATIONS OF THE ACM, 2023, 66 (06) : 86 - 93
  • [35] Improving Testing of Deep-Learning Systems
    Deokuliar, Harsh
    Sangwan, Raghvinder S.
    Badr, Yoaukim
    Srinivasan, Satish M.
    COMMUNICATIONS OF THE ACM, 2024, 67 (03) : 44 - 48
  • [36] New Cognitive Deep-Learning CAPTCHA
    Trong, Nghia Dinh
    Huong, Thien Ho
    Hoang, Vinh Truong
    SENSORS, 2023, 23 (04)
  • [37] Deep-learning for ionogram automatic scaling
    Xiao, Zhuowei
    Wang, Jian
    Li, Juan
    Zhao, Biqiang
    Hu, Lianhuan
    Liu, Libo
    ADVANCES IN SPACE RESEARCH, 2020, 66 (04) : 942 - 950
  • [38] Deep-learning in Identification of Vocal Pathologies
    Teixeira, Felipe L.
    Teixeira, Joao P.
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 4: BIOSIGNALS, 2020, : 288 - 295
  • [39] Detection of diabetic blindness with Deep-Learning
    Singh, Anukriti
    Kim, Wooyoung
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2440 - 2447
  • [40] Deep-Learning Inversion of Seismic Data
    Li, Shucai
    Liu, Bin
    Ren, Yuxiao
    Chen, Yangkang
    Yang, Senlin
    Wang, Yunhai
    Jiang, Peng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (03): : 2135 - 2149