A deep learning framework for accurate reaction prediction and its application on high-throughput experimentation data

被引:15
|
作者
Li, Baiqing [1 ]
Su, Shimin [1 ]
Zhu, Chan [1 ]
Lin, Jie [1 ]
Hu, Xinyue [1 ]
Su, Lebin [1 ]
Yu, Zhunzhun [1 ]
Liao, Kuangbiao [1 ]
Chen, Hongming [1 ]
机构
[1] Guangzhou Lab, Guangzhou 510005, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
NEURAL-NETWORKS; CHEMISTRY; RETROSYNTHESIS; PLATFORM;
D O I
10.1186/s13321-023-00732-w
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In recent years, it has been seen that artificial intelligence (AI) starts to bring revolutionary changes to chemical synthesis. However, the lack of suitable ways of representing chemical reactions and the scarceness of reaction data has limited the wider application of AI to reaction prediction. Here, we introduce a novel reaction representation, GraphRXN, for reaction prediction. It utilizes a universal graph-based neural network framework to encode chemical reactions by directly taking two-dimension reaction structures as inputs. The GraphRXN model was evaluated by three publically available chemical reaction datasets and gave on-par or superior results compared with other baseline models. To further evaluate the effectiveness of GraphRXN, wet-lab experiments were carried out for the purpose of generating reaction data. GraphRXN model was then built on high-throughput experimentation data and a decent accuracy (R-2 of 0.712) was obtained on our in-house data. This highlights that the GraphRXN model can be deployed in an integrated workflow which combines robotics and AI technologies for forward reaction prediction.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] BRAIN CANCER PREDICTION USING MACHINE LEARNING METHODS AND HIGH-THROUGHPUT MOLECULAR DATA
    Ma, B. S.
    Chang, Q.
    Geng, Y.
    Liu, G. H.
    Dong, H.
    Sun, Y. Q.
    JOURNAL OF INVESTIGATIVE MEDICINE, 2017, 65 (07) : A1 - A1
  • [42] An application of a relational database system for high-throughput prediction of elemental compositions from accurate mass values
    Sakurai, Nozomu
    Ara, Takeshi
    Kanaya, Shigehiko
    Nakamura, Yukiko
    Iijima, Yoko
    Enomoto, Mitsuo
    Motegi, Takeshi
    Aoki, Koh
    Suzuki, Hideyuki
    Shibata, Daisuke
    BIOINFORMATICS, 2013, 29 (02) : 290 - 291
  • [43] Data-Driven Design of Polymer-Based Biomaterials: High-throughput Simulation, Experimentation, and Machine Learning
    Webb, Michael A.
    Patel, Roshan A.
    ACS APPLIED BIO MATERIALS, 2023, 7 (02) : 510 - 527
  • [44] Complex-Solid-Solution Electrocatalyst Discovery by Computational Prediction and High-Throughput Experimentation**
    Batchelor, Thomas A. A.
    Loeffler, Tobias
    Xiao, Bin
    Krysiak, Olga A.
    Strotkoetter, Valerie
    Pedersen, Jack K.
    Clausen, Christian M.
    Savan, Alan
    Li, Yujiao
    Schuhmann, Wolfgang
    Rossmeisl, Jan
    Ludwig, Alfred
    ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 2021, 60 (13) : 6932 - 6937
  • [45] An integrated framework for accurate trajectory prediction based on deep learning
    Zhao, Shuo
    Li, Zhaozhi
    Zhu, Zikun
    Chang, Charles
    Li, Xin
    Chen, Ying-Chi
    Yang, Bo
    APPLIED INTELLIGENCE, 2024, 54 (20) : 10161 - 10175
  • [46] Metric Learning for High-Throughput Combinatorial Data Sets
    Vaddi, Kiran
    Wodo, Olga
    ACS COMBINATORIAL SCIENCE, 2019, 21 (11) : 726 - 735
  • [47] The Application of Cheminformatics in the Analysis of High-Throughput Screening Data
    Walters, W. Patrick
    Aronov, Alexander
    Goldman, Brian
    McClain, Brian
    Perola, Emanuele
    Weiss, Jonathan
    FRONTIERS IN MOLECULAR DESIGN AND CHEMIAL INFORMATION SCIENCE - HERMAN SKOLNIK AWARD SYMPOSIUM 2015: JURGEN BAJORATH, 2016, 1222 : 269 - 282
  • [48] SoFIA: a data integration framework for annotating high-throughput datasets
    Childs, Liam Harold
    Mamlouk, Soulafa
    Brandt, Joergen
    Sers, Christine
    Leser, Ulf
    BIOINFORMATICS, 2016, 32 (17) : 2590 - 2597
  • [49] Rethomics: An R framework to analyse high-throughput behavioural data
    Geissmann, Quentin
    Rodriguez, Luis Garcia
    Beckwith, Esteban J.
    Gilestro, Giorgio F.
    PLOS ONE, 2019, 14 (01):
  • [50] Deep learning: as the new frontier in high-throughput plant phenotyping
    Sunny Arya
    Karansher Singh Sandhu
    Jagmohan Singh
    Sudhir kumar
    Euphytica, 2022, 218