Molecular Property Prediction and Molecular Design Using a Supervised Grammar Variational Autoencoder

被引:13
|
作者
Oliveira, Andre F. [2 ]
Da Silva, Juarez L. F. [1 ]
Quiles, Marcos G. [3 ]
机构
[1] Univ Sao Paulo, Sao Carlos Inst Chem, BR-13560970 Sao Carlos, SP, Brazil
[2] Natl Inst Space Res, Associate Lab Comp & Appl Math, BR-12227010 Sao Jose Dos Campos, SP, Brazil
[3] Univ Fed Sao Paulo, Inst Sci & Technol, BR-12247014 Sao Jose Dos Campos, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
DISCOVERY;
D O I
10.1021/acs.jcim.1c01573
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Some of the most common applications of machine learning (ML) algorithms dealing with small molecules usually fall within two distinct domains, namely, the prediction of molecular properties and the design of novel molecules with some desirable property. Here we unite these applications under a single molecular representation and ML algorithm by modifying the grammar variational autoencoder (GVAE) model with the incorporation of property information into its training procedure, thus creating a supervised GVAE (SGVAE). Results indicate that the biased latent space generated by this approach can successfully be used to predict the molecular properties of the input molecules, produce novel and unique molecules with some desired property and also estimate the properties of random sampled molecules. We illustrate these possibilities by sampling novel molecules from the latent space with specific values of the lowest unoccupied molecular orbital (LUMO) energy after training the model using the QM9 data set. Furthermore, the trained model is also used to predict the properties of a hold-out set and the resulting mean absolute error (MAE) shows values close to chemical accuracy for the dipole moment and atomization energies, even outperforming ML models designed to exclusive predict molecular properties using the SMILES as molecular representation. Therefore, these results show that the proposed approach is a viable way to provide generative ML models with molecular property information in a way that the generation of novel molecules is likely to achieve better results, with the benefit that these new molecules can also have their molecular properties accurately predicted.
引用
收藏
页码:817 / 828
页数:12
相关论文
共 50 条
  • [21] PotentialNet for Molecular Property Prediction
    Feinberg, Evan N.
    Sur, Debnil
    Wu, Zhenqin
    Husic, Brooke E.
    Mai, Huanghao
    Li, Yang
    Sun, Saisai
    Yang, Jianyi
    Ramsundar, Bharath
    Pande, Vijay S.
    ACS CENTRAL SCIENCE, 2018, 4 (11) : 1520 - 1530
  • [22] ASGN: An Active Semi-supervised Graph Neural Network for Molecular Property Prediction
    Hao, Zhongkai
    Lu, Chengqiang
    Huang, Zhenya
    Wang, Hao
    Hu, Zheyuan
    Liu, Qi
    Chen, Enhong
    Lee, Cheekong
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 731 - 739
  • [23] Motif-based Graph Self-Supervised Learning for Molecular Property Prediction
    Zhang, Zaixi
    Liu, Qi
    Wang, Hao
    Lu, Chengqiang
    Lee, Chee-Kong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [24] Device Image-IV Mapping using Variational Autoencoder for Inverse Design and Forward Prediction
    Lu, Thomas
    Lu, Albert
    Wong, Hiu Yung
    2023 INTERNATIONAL CONFERENCE ON SIMULATION OF SEMICONDUCTOR PROCESSES AND DEVICES, SISPAD, 2023, : 161 - 164
  • [25] Device Image-IV Mapping using Variational Autoencoder for Inverse Design and Forward Prediction
    Lu, Thomas
    Lu, Albert
    Wong, Hiu Yung
    International Conference on Simulation of Semiconductor Processes and Devices, SISPAD, 2023, : 161 - 164
  • [26] Improving Performance in Software Defect Prediction Using Variational Autoencoder
    Eivazpour, Z.
    Keyvanpour, Mohammad Reza
    2019 IEEE 5TH CONFERENCE ON KNOWLEDGE BASED ENGINEERING AND INNOVATION (KBEI 2019), 2019, : 644 - 649
  • [27] Kernel-elastic autoencoder for molecular design
    Li, Haote
    Shee, Yu
    Allen, Brandon
    Maschietto, Federica
    Morgunov, Anton
    Batista, Victor
    PNAS NEXUS, 2024, 3 (04):
  • [28] Semi-supervised Learning Using Variational Autoencoder - A Cluster Based Approach
    Vengalil, Sunil Kumar
    Sinha, Neelam
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2021, 2024, 13102 : 529 - 536
  • [29] Degradation Prediction of Semiconductor Lasers Using Conditional Variational Autoencoder
    Abdelli, Khouloud
    Griesser, Helmut
    Neumeyr, Christian
    Hohenleitner, Robert
    Pachnicke, Stephan
    JOURNAL OF LIGHTWAVE TECHNOLOGY, 2022, 40 (18) : 6213 - 6221
  • [30] Molecular Property Prediction of Modified Gedunin Using Machine Learning
    Aly, Mohammed
    Alotaibi, Abdullah Shawan
    MOLECULES, 2023, 28 (03):