A deep learning framework for accurate reaction prediction and its application on high-throughput experimentation data

被引:15
|
作者
Li, Baiqing [1 ]
Su, Shimin [1 ]
Zhu, Chan [1 ]
Lin, Jie [1 ]
Hu, Xinyue [1 ]
Su, Lebin [1 ]
Yu, Zhunzhun [1 ]
Liao, Kuangbiao [1 ]
Chen, Hongming [1 ]
机构
[1] Guangzhou Lab, Guangzhou 510005, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
NEURAL-NETWORKS; CHEMISTRY; RETROSYNTHESIS; PLATFORM;
D O I
10.1186/s13321-023-00732-w
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In recent years, it has been seen that artificial intelligence (AI) starts to bring revolutionary changes to chemical synthesis. However, the lack of suitable ways of representing chemical reactions and the scarceness of reaction data has limited the wider application of AI to reaction prediction. Here, we introduce a novel reaction representation, GraphRXN, for reaction prediction. It utilizes a universal graph-based neural network framework to encode chemical reactions by directly taking two-dimension reaction structures as inputs. The GraphRXN model was evaluated by three publically available chemical reaction datasets and gave on-par or superior results compared with other baseline models. To further evaluate the effectiveness of GraphRXN, wet-lab experiments were carried out for the purpose of generating reaction data. GraphRXN model was then built on high-throughput experimentation data and a decent accuracy (R-2 of 0.712) was obtained on our in-house data. This highlights that the GraphRXN model can be deployed in an integrated workflow which combines robotics and AI technologies for forward reaction prediction.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Uncovering the key dimensions of high-throughput biomolecular data using deep learning
    Zhang, Shixiong
    Li, Xiangtao
    Lin, Qiuzhen
    Lin, Jiecong
    Wong, Ka-Chun
    NUCLEIC ACIDS RESEARCH, 2020, 48 (10)
  • [22] Deep learning enables high-quality and high-throughput prediction of enzyme commission numbers
    Ryu, Jae Yong
    Kim, Hyun Uk
    Lee, Sang Yup
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (28) : 13996 - 14001
  • [23] Deep learning-based cytoskeleton segmentation for accurate high-throughput measurement of cytoskeleton density
    Horiuchi, Ryota
    Kamimura, Asuka
    Hanaki, Yuga
    Matsumoto, Hikari
    Ueda, Minako
    Higaki, Takumi
    PROTOPLASMA, 2024,
  • [24] A high-throughput experimentation platform for data-driven discovery in electrochemistry
    Lin, Dian-Zhao
    Pan, Kai-Jui
    Li, Yuyin
    Zhang, Lingyu
    Jayarapu, Krish N.
    Li, Tianchen
    Tran, Jasmine Vy
    Goddard, William A.
    Luo, Zhengtang
    Liu, Yayuan
    SCIENCE ADVANCES, 2025, 11 (14):
  • [25] High-Throughput Screening and Accurate Prediction of Ionic Liquid Viscosities Using Interpretable Machine Learning
    Mohan, Mood
    Jetti, Karuna Devi
    Guggilam, Sreelekha
    Smith, Micholas Dean
    Kidder, Michelle K.
    Smith, Jeremy C.
    ACS SUSTAINABLE CHEMISTRY & ENGINEERING, 2024, 12 (18): : 7040 - 7054
  • [26] Data management system for high-throughput experimentation in materials science.
    Tucker, J
    Loewenhauser, G
    Hill, JR
    Eichinger, BE
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2001, 222 : U278 - U278
  • [27] Kinetic data acquisition in high-throughput Fischer-Tropsch experimentation
    Hazemann, Paul
    Decottignies, Dominique
    Maury, Sylvie
    Humbert, Severine
    Berliet, Adrien
    Daniel, Cecile
    Schuurman, Yves
    CATALYSIS SCIENCE & TECHNOLOGY, 2020, 10 (21) : 7331 - 7343
  • [28] Application of data mining and evolutionary optimization in catalyst discovery and high-throughput experimentation techniques, strategies, and software
    Ohrenberg, A
    von Törne, C
    Schuppert, A
    Knab, B
    QSAR & COMBINATORIAL SCIENCE, 2005, 24 (01): : 29 - 37
  • [29] Spreeze: High-Throughput Parallel Reinforcement Learning Framework
    Hou, Jing
    Chen, Guang
    Zhang, Ruiqi
    Li, Zhijun
    Gu, Shangding
    Jiang, Changjun
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2025, 36 (02) : 282 - 292
  • [30] High-throughput Sequencing Technology and Its Application
    Zhu Qiang-long
    Liu Shi
    Gao Peng
    Luan Fei-shi
    JournalofNortheastAgriculturalUniversity(EnglishEdition), 2014, 21 (03) : 84 - 96