Molecular Property Prediction and Molecular Design Using a Supervised Grammar Variational Autoencoder

被引:13
|
作者
Oliveira, Andre F. [2 ]
Da Silva, Juarez L. F. [1 ]
Quiles, Marcos G. [3 ]
机构
[1] Univ Sao Paulo, Sao Carlos Inst Chem, BR-13560970 Sao Carlos, SP, Brazil
[2] Natl Inst Space Res, Associate Lab Comp & Appl Math, BR-12227010 Sao Jose Dos Campos, SP, Brazil
[3] Univ Fed Sao Paulo, Inst Sci & Technol, BR-12247014 Sao Jose Dos Campos, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
DISCOVERY;
D O I
10.1021/acs.jcim.1c01573
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Some of the most common applications of machine learning (ML) algorithms dealing with small molecules usually fall within two distinct domains, namely, the prediction of molecular properties and the design of novel molecules with some desirable property. Here we unite these applications under a single molecular representation and ML algorithm by modifying the grammar variational autoencoder (GVAE) model with the incorporation of property information into its training procedure, thus creating a supervised GVAE (SGVAE). Results indicate that the biased latent space generated by this approach can successfully be used to predict the molecular properties of the input molecules, produce novel and unique molecules with some desired property and also estimate the properties of random sampled molecules. We illustrate these possibilities by sampling novel molecules from the latent space with specific values of the lowest unoccupied molecular orbital (LUMO) energy after training the model using the QM9 data set. Furthermore, the trained model is also used to predict the properties of a hold-out set and the resulting mean absolute error (MAE) shows values close to chemical accuracy for the dipole moment and atomization energies, even outperforming ML models designed to exclusive predict molecular properties using the SMILES as molecular representation. Therefore, these results show that the proposed approach is a viable way to provide generative ML models with molecular property information in a way that the generation of novel molecules is likely to achieve better results, with the benefit that these new molecules can also have their molecular properties accurately predicted.
引用
收藏
页码:817 / 828
页数:12
相关论文
共 50 条
  • [41] An un-supervised approach for backorder prediction using deep autoencoder
    Saraogi G.
    Gupta D.
    Sharma L.
    Rana A.
    Recent Advances in Computer Science and Communications, 2021, 14 (02) : 500 - 511
  • [42] A SEMI-SUPERVISED APPROACH FOR IDENTIFYING ABNORMAL HEART SOUNDS USING VARIATIONAL AUTOENCODER
    Banerjee, Rohan
    Ghose, Avik
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1249 - 1253
  • [43] New Learning Models Based on the Combination of Variational Graph Autoencoder/Graph Autoencoder with Context Self-supervised Learning for Link Prediction
    Zhang, Jian
    Gao, Yun Hai
    Zhang, Gui Yun
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT I, ICIC 2024, 2024, 14875 : 245 - 256
  • [44] Autonomous Vehicle Path Prediction Using Conditional Variational Autoencoder Networks
    Jagadish, D. N.
    Chauhan, Arun
    Mahto, Lakshman
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT I, 2021, 12712 : 129 - 139
  • [45] A Semi-Supervised Learning Method for MiRNA-Disease Association Prediction Based on Variational Autoencoder
    Ji, Cunmei
    Wang, Yutian
    Gao, Zhen
    Li, Lei
    Ni, Jiancheng
    Zheng, Chunhou
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (04) : 2049 - 2059
  • [46] Application of Generative Autoencoder in De Novo Molecular Design
    Blaschke, Thomas
    Olivecrona, Marcus
    Engkvist, Ola
    Bajorath, Jurgen
    Chen, Hongming
    MOLECULAR INFORMATICS, 2018, 37 (1-2)
  • [47] Molecular design using quantum chemical calculations for property estimation
    Lehmann, A
    Maranas, CD
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2004, 43 (13) : 3419 - 3432
  • [48] QSARtuna: An Automated QSAR Modeling Platform for Molecular Property Prediction in Drug Design
    Mervin, Lewis
    Voronov, Alexey
    Kabeshov, Mikhail
    Engkvist, Ola
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2024, 64 (14) : 5365 - 5374
  • [49] VAE-Sim: A Novel Molecular Similarity Measure Based on a Variational Autoencoder
    Samanta, Soumitra
    O'Hagan, Steve
    Swainston, Neil
    Roberts, Timothy J.
    Kell, Douglas B.
    MOLECULES, 2020, 25 (15):
  • [50] Molecular Descriptors Property Prediction Using Transformer-Based Approach
    Tran, Tuan
    Ekenna, Chinwe
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (15)