Holistic Prediction of the pKa in Diverse Solvents Based on a Machine-Learning Approach

被引:175
|
作者
Yang, Qi [1 ]
Li, Yao [1 ]
Yang, Jin-Dong [1 ]
Liu, Yidi [1 ]
Zhang, Long [1 ]
Luo, Sanzhong [1 ]
Cheng, Jin-Pei [1 ]
机构
[1] Tsinghua Univ, Ctr Basic Mol Sci, Dept Chem, Beijing 100084, Peoples R China
关键词
iBond; machine learning; neural network; organocatalysts; pK(a)prediction; XGBoost; ACID DISSOCIATION-CONSTANTS; DENSITY-FUNCTIONAL THEORY; EQUILIBRIUM ACIDITIES; ACCURATE PREDICTION; DIMETHYL-SULFOXIDE; CARBOXYLIC-ACIDS; AQUEOUS-SOLUTION; BONDING DONORS; VALUES; DRUGS;
D O I
10.1002/anie.202008528
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
While many approaches to predict aqueous pK(a)values exist, the fast and accurate prediction of non-aqueous pK(a)values is still challenging. Based on the iBonD experimental pK(a)database (39 solvents), a holistic pK(a)prediction model was established using machine learning. Structural and physical-organic-parameter-based descriptors (SPOC) were introduced to represent the electronic and structural features of the molecules. The models trained with a neural network or the XGBoost algorithm showed the best prediction performance with a low MAE value of 0.87 pK(a)units. The approach allows a comprehensive mapping of all possible pK(a)correlations between different solvents and it was validated by predicting the aqueous pK(a)and micro-pK(a)of pharmaceutical molecules and pK(a)values of organocatalysts in DMSO and MeCN with high accuracy. An online prediction platform was constructed based on the current model, which can provide pK(a)prediction for different types of X-H acidity in the most commonly used solvents.
引用
收藏
页码:19282 / 19291
页数:10
相关论文
共 50 条
  • [41] A Machine-Learning Approach to Time Discrimination
    Hansen, Peter
    2010 IEEE NUCLEAR SCIENCE SYMPOSIUM CONFERENCE RECORD (NSS/MIC), 2010, : 2132 - 2133
  • [42] Image-based crystal detection: a machine-learning approach
    Liu, Roy
    Freund, Yoav
    Spraggon, Glen
    ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2008, 64 : 1187 - 1195
  • [43] Machine-Learning Approach for Design of Nanomagnetic-Based Antennas
    Gianfagna, Carmine
    Yu, Huan
    Swaminathan, Madhavan
    Pulugurtha, Raj
    Tummala, Rao
    Antonini, Giulio
    JOURNAL OF ELECTRONIC MATERIALS, 2017, 46 (08) : 4963 - 4975
  • [44] A novel machine-learning based approach to predict flares of psoriasis
    Ramelyte, E.
    Djamei, V.
    Maul, T. J.
    Anzengruber, F.
    Navarini, A.
    EXPERIMENTAL DERMATOLOGY, 2018, 27 (03) : E44 - E45
  • [45] Theory Identity: A Machine-Learning Approach
    Larsen, Kai R.
    Hovorka, Dirk
    West, Jevin
    Birt, James
    Pfaff, James R.
    Chambers, Trevor W.
    Sampedro, Zebula R.
    Zager, Nick
    Vanstone, Bruce
    2014 47TH HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS), 2014, : 4639 - 4648
  • [46] Machine-Learning Approach for Design of Nanomagnetic-Based Antennas
    Carmine Gianfagna
    Huan Yu
    Madhavan Swaminathan
    Raj Pulugurtha
    Rao Tummala
    Giulio Antonini
    Journal of Electronic Materials, 2017, 46 : 4963 - 4975
  • [47] Rapid Prediction of Brain Injury Pattern in mTBI by Combining FE Analysis With a Machine-Learning Based Approach
    Shim, Vickie B.
    Holdsworth, Samantha
    Champagne, Allen A.
    Coverdale, Nicole S.
    Cook, Douglas J.
    Lee, Tae-Rin
    Wang, Alan D.
    Li, Shaofan
    Fernandez, Justin W.
    IEEE ACCESS, 2020, 8 (179457-179465) : 179457 - 179465
  • [48] Prediction of project activity delays caused by variation orders: a machine-learning approach
    Nishat, Mirza Muntasir
    Neraas, Sander Magnussen
    Marsov, Andrei
    Olsson, Nils O. E.
    12TH NORDIC CONFERENCE ON CONSTRUCTION ECONOMICS AND ORGANISATION, 2024, 2024, 1389
  • [49] Inpatient stroke rehabilitation: prediction of clinical outcomes using a machine-learning approach
    Harari, Yaar
    O'Brien, Megan K.
    Lieber, Richard L.
    Jayaraman, Arun
    JOURNAL OF NEUROENGINEERING AND REHABILITATION, 2020, 17 (01)
  • [50] Machine-Learning Based Prediction Model for Prognosis of IgA Nephropathy Patients
    Park, Sehoon
    Koh, Eun Sil
    Baek, Chung Hee
    Kim, Yong Chul
    Lee, Jung Pyo
    Kim, Dong Ki
    Han, Seung Hyeok
    Chin, Ho Jun
    Joo, Kwon Wook
    Kim, Yon Su
    Lee, Hajeong
    JOURNAL OF THE AMERICAN SOCIETY OF NEPHROLOGY, 2022, 33 (11): : 800 - 801