Molecular Property Prediction and Molecular Design Using a Supervised Grammar Variational Autoencoder

被引:13
|
作者
Oliveira, Andre F. [2 ]
Da Silva, Juarez L. F. [1 ]
Quiles, Marcos G. [3 ]
机构
[1] Univ Sao Paulo, Sao Carlos Inst Chem, BR-13560970 Sao Carlos, SP, Brazil
[2] Natl Inst Space Res, Associate Lab Comp & Appl Math, BR-12227010 Sao Jose Dos Campos, SP, Brazil
[3] Univ Fed Sao Paulo, Inst Sci & Technol, BR-12247014 Sao Jose Dos Campos, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
DISCOVERY;
D O I
10.1021/acs.jcim.1c01573
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Some of the most common applications of machine learning (ML) algorithms dealing with small molecules usually fall within two distinct domains, namely, the prediction of molecular properties and the design of novel molecules with some desirable property. Here we unite these applications under a single molecular representation and ML algorithm by modifying the grammar variational autoencoder (GVAE) model with the incorporation of property information into its training procedure, thus creating a supervised GVAE (SGVAE). Results indicate that the biased latent space generated by this approach can successfully be used to predict the molecular properties of the input molecules, produce novel and unique molecules with some desired property and also estimate the properties of random sampled molecules. We illustrate these possibilities by sampling novel molecules from the latent space with specific values of the lowest unoccupied molecular orbital (LUMO) energy after training the model using the QM9 data set. Furthermore, the trained model is also used to predict the properties of a hold-out set and the resulting mean absolute error (MAE) shows values close to chemical accuracy for the dipole moment and atomization energies, even outperforming ML models designed to exclusive predict molecular properties using the SMILES as molecular representation. Therefore, these results show that the proposed approach is a viable way to provide generative ML models with molecular property information in a way that the generation of novel molecules is likely to achieve better results, with the benefit that these new molecules can also have their molecular properties accurately predicted.
引用
收藏
页码:817 / 828
页数:12
相关论文
共 50 条
  • [31] Geometry-Based Molecular Generation With Deep Constrained Variational Autoencoder
    Li, Chunyan
    Yao, Junfeng
    Wei, Wei
    Niu, Zhangming
    Zeng, Xiangxiang
    Li, Jin
    Wang, Jianmin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 4852 - 4861
  • [32] Uncertainty Quantification Using Neural Networks for Molecular Property Prediction
    Hirschfeld, Lior
    Swanson, Kyle
    Yang, Kevin
    Barzilay, Regina
    Coley, Connor W.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2020, 60 (08) : 3770 - 3780
  • [33] Self-supervised learning with chemistry-aware fragmentation for effective molecular property prediction
    Xie, Ailin
    Zhang, Ziqiao
    Guan, Jihong
    Zhou, Shuigeng
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (05)
  • [34] GDMol: Generative Double-Masking Self-Supervised Learning for Molecular Property Prediction
    Liu, Yingxu
    Fan, Qing
    Xu, Chengcheng
    Ning, Xiangzhen
    Wang, Yu
    Liu, Yang
    Zhang, Yanmin
    Chen, Yadong
    Liu, Haichun
    MOLECULAR INFORMATICS, 2024,
  • [35] Molecular sharing and molecular-specific representations for multimodal molecular property prediction
    Tian, Xuecong
    Zhang, Sizhe
    Su, Ying
    Huang, Wanhua
    Zhang, Yongzheng
    Ma, Xuan
    Li, Keao
    Lv, Xiaoyi
    Chen, Chen
    Chen, Cheng
    APPLIED SOFT COMPUTING, 2024, 163
  • [36] A multiscale molecular structural neural network for molecular property prediction
    Shi, Zhiwei
    Ma, Miao
    Ning, Hanyang
    Yang, Bo
    Dang, Jingshuang
    MOLECULAR DIVERSITY, 2025,
  • [37] Assigning confidence to molecular property prediction
    Nigam, AkshatKumar
    Pollice, Robert
    Hurley, Matthew F. D.
    Hickman, Riley J.
    Aldeghi, Matteo
    Yoshikawa, Naruki
    Chithrananda, Seyone
    Voelz, Vincent A.
    Aspuru-Guzik, Alan
    EXPERT OPINION ON DRUG DISCOVERY, 2021, 16 (09) : 1009 - 1023
  • [38] Autonomous design of new chemical reactions using a variational autoencoder
    Tempke, Robert
    Musho, Terence
    COMMUNICATIONS CHEMISTRY, 2022, 5 (01)
  • [39] Inverse Design Method of Pressure Distribution Using Variational Autoencoder
    Song, Chao
    Luo, Xiao
    Liu, Hongyang
    Yu, Yonggang
    Li, Weibin
    2023 ASIA-PACIFIC INTERNATIONAL SYMPOSIUM ON AEROSPACE TECHNOLOGY, VOL II, APISAT 2023, 2024, 1051 : 1595 - 1610
  • [40] Autonomous design of new chemical reactions using a variational autoencoder
    Robert Tempke
    Terence Musho
    Communications Chemistry, 5