Molecular Property Prediction and Molecular Design Using a Supervised Grammar Variational Autoencoder

被引:13
|
作者
Oliveira, Andre F. [2 ]
Da Silva, Juarez L. F. [1 ]
Quiles, Marcos G. [3 ]
机构
[1] Univ Sao Paulo, Sao Carlos Inst Chem, BR-13560970 Sao Carlos, SP, Brazil
[2] Natl Inst Space Res, Associate Lab Comp & Appl Math, BR-12227010 Sao Jose Dos Campos, SP, Brazil
[3] Univ Fed Sao Paulo, Inst Sci & Technol, BR-12247014 Sao Jose Dos Campos, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
DISCOVERY;
D O I
10.1021/acs.jcim.1c01573
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Some of the most common applications of machine learning (ML) algorithms dealing with small molecules usually fall within two distinct domains, namely, the prediction of molecular properties and the design of novel molecules with some desirable property. Here we unite these applications under a single molecular representation and ML algorithm by modifying the grammar variational autoencoder (GVAE) model with the incorporation of property information into its training procedure, thus creating a supervised GVAE (SGVAE). Results indicate that the biased latent space generated by this approach can successfully be used to predict the molecular properties of the input molecules, produce novel and unique molecules with some desired property and also estimate the properties of random sampled molecules. We illustrate these possibilities by sampling novel molecules from the latent space with specific values of the lowest unoccupied molecular orbital (LUMO) energy after training the model using the QM9 data set. Furthermore, the trained model is also used to predict the properties of a hold-out set and the resulting mean absolute error (MAE) shows values close to chemical accuracy for the dipole moment and atomization energies, even outperforming ML models designed to exclusive predict molecular properties using the SMILES as molecular representation. Therefore, these results show that the proposed approach is a viable way to provide generative ML models with molecular property information in a way that the generation of novel molecules is likely to achieve better results, with the benefit that these new molecules can also have their molecular properties accurately predicted.
引用
收藏
页码:817 / 828
页数:12
相关论文
共 50 条
  • [1] Semi-supervised Variational Autoencoder for Survival Prediction
    Palsson, Sveinn
    Cerri, Stefano
    Dittadi, Andrea
    Van Leemput, Koen
    BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES (BRAINLES 2019), PT II, 2020, 11993 : 124 - 134
  • [2] Embedding of Molecular Structure Using Molecular Hypergraph Variational Autoencoder with Metric Learning
    Koge, Daiki
    Ono, Naoaki
    Huang, Ming
    Altaf-Ul-Amin, Md.
    Kanaya, Shigehiko
    MOLECULAR INFORMATICS, 2021, 40 (02)
  • [3] Molecular generative model based on conditional variational autoencoder for de novo molecular design
    Lim, Jaechang
    Ryu, Seongok
    Kim, Jin Woo
    Kim, Woo Youn
    JOURNAL OF CHEMINFORMATICS, 2018, 10
  • [4] Molecular generative model based on conditional variational autoencoder for de novo molecular design
    Jaechang Lim
    Seongok Ryu
    Jin Woo Kim
    Woo Youn Kim
    Journal of Cheminformatics, 10
  • [5] Supervised desertification classification using Siamese Variational Autoencoder
    Chouikhi, Farah
    Ben Abbes, Ali
    Farah, Imed Riadh
    INTERNATIONAL JOURNAL OF IMAGE AND DATA FUSION, 2025, 16 (01)
  • [6] MoVAE: A Variational AutoEncoder for Molecular Graph Generation
    Lin, Zerun
    Zhang, Yuhan
    Duan, Lixin
    Ou-Yang, Le
    Zhao, Peilin
    PROCEEDINGS OF THE 2023 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2023, : 514 - 522
  • [7] IceCoder: Identification of Ice Phases in Molecular Simulation Using Variational Autoencoder
    Maity, Dibyendu
    Chakrabarty, Suman
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2025, 21 (04) : 1916 - 1928
  • [8] Hierarchical Molecular Graph Self-Supervised Learning for property prediction
    Zang, Xuan
    Zhao, Xianbing
    Tang, Buzhou
    COMMUNICATIONS CHEMISTRY, 2023, 6 (01)
  • [9] Hierarchical Molecular Graph Self-Supervised Learning for property prediction
    Xuan Zang
    Xianbing Zhao
    Buzhou Tang
    Communications Chemistry, 6
  • [10] Structural Dynamics Feature Learning Using a Supervised Variational Autoencoder
    Bacsa, Kiran
    Liu, Wei
    Abdallah, Imad
    Chatzi, Eleni
    JOURNAL OF ENGINEERING MECHANICS, 2025, 151 (02)