Holistic Prediction of the pKa in Diverse Solvents Based on a Machine-Learning Approach

被引:175
|
作者
Yang, Qi [1 ]
Li, Yao [1 ]
Yang, Jin-Dong [1 ]
Liu, Yidi [1 ]
Zhang, Long [1 ]
Luo, Sanzhong [1 ]
Cheng, Jin-Pei [1 ]
机构
[1] Tsinghua Univ, Ctr Basic Mol Sci, Dept Chem, Beijing 100084, Peoples R China
关键词
iBond; machine learning; neural network; organocatalysts; pK(a)prediction; XGBoost; ACID DISSOCIATION-CONSTANTS; DENSITY-FUNCTIONAL THEORY; EQUILIBRIUM ACIDITIES; ACCURATE PREDICTION; DIMETHYL-SULFOXIDE; CARBOXYLIC-ACIDS; AQUEOUS-SOLUTION; BONDING DONORS; VALUES; DRUGS;
D O I
10.1002/anie.202008528
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
While many approaches to predict aqueous pK(a)values exist, the fast and accurate prediction of non-aqueous pK(a)values is still challenging. Based on the iBonD experimental pK(a)database (39 solvents), a holistic pK(a)prediction model was established using machine learning. Structural and physical-organic-parameter-based descriptors (SPOC) were introduced to represent the electronic and structural features of the molecules. The models trained with a neural network or the XGBoost algorithm showed the best prediction performance with a low MAE value of 0.87 pK(a)units. The approach allows a comprehensive mapping of all possible pK(a)correlations between different solvents and it was validated by predicting the aqueous pK(a)and micro-pK(a)of pharmaceutical molecules and pK(a)values of organocatalysts in DMSO and MeCN with high accuracy. An online prediction platform was constructed based on the current model, which can provide pK(a)prediction for different types of X-H acidity in the most commonly used solvents.
引用
收藏
页码:19282 / 19291
页数:10
相关论文
共 50 条
  • [21] Parametrization of Sunspot Groups Based on Machine-Learning Approach
    Illarionov, Egor
    Tlatov, Andrey
    SOLAR PHYSICS, 2022, 297 (02)
  • [22] Parametrization of Sunspot Groups Based on Machine-Learning Approach
    Egor Illarionov
    Andrey Tlatov
    Solar Physics, 2022, 297
  • [23] A new approach of clustering based machine-learning algorithm
    Al-Omary, Alauddin Yousif
    Jamil, Mohammad Shahid
    KNOWLEDGE-BASED SYSTEMS, 2006, 19 (04) : 248 - 258
  • [24] A Holistic Approach on Airfare Price Prediction Using Machine Learning Techniques
    Kalampokas, Theofanis
    Tziridis, Konstantinos
    Kalampokas, Nikolaos
    Nikolaou, Alexandros
    Vrochidou, Eleni
    Papakostas, George A.
    IEEE ACCESS, 2023, 11 : 46627 - 46643
  • [25] Machine-Learning Aided Peer Prediction
    Liu, Yang
    Chen, Yiling
    EC'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON ECONOMICS AND COMPUTATION, 2017, : 63 - 80
  • [26] Prediction of cholinergic compounds by machine-learning
    Wijeyesakere S.J.
    Wilson D.M.
    Sue Marty M.
    Wilson, Daniel M. (MWilson3@dow.com), 1600, Elsevier B.V. (13):
  • [27] A Machine-Learning Approach for the Prediction of Fly-Ash Concrete Strength
    Shao, Shanqing
    Gong, Aimin
    Wang, Ran
    Chen, Xiaoshuang
    Xu, Jing
    Wang, Fulai
    Liu, Feipeng
    FDMP-FLUID DYNAMICS & MATERIALS PROCESSING, 2023, 19 (12): : 3007 - 3019
  • [28] A machine-learning fatigue life prediction approach of additively manufactured metals
    Bao, Hongyixi
    Wu, Shengchuan
    Wu, Zhengkai
    Kang, Guozheng
    Peng, Xin
    Withers, Philip J.
    ENGINEERING FRACTURE MECHANICS, 2021, 242
  • [29] A Machine-Learning Approach for the Prediction of Enzymatic Activity of Proteins in Metagenomic Samples
    Koutsandreas, Theodoros
    Pilalis, Eleftherios
    Chatziioannou, Aristotelis
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2013, 2013, 412 : 81 - 87
  • [30] Machine-learning based prediction of crash response of tubular structures
    Sakaridis, Emmanouil
    Karathanasopoulos, Nikolaos
    Mohr, Dirk
    INTERNATIONAL JOURNAL OF IMPACT ENGINEERING, 2022, 166