Multifidelity Statistical Machine Learning for Molecular Crystal Structure Prediction

被引:49
|
作者
Egorova, Olga [1 ]
Hafizi, Roohollah [2 ]
Woods, David C. [1 ]
Day, Graeme M. [2 ]
机构
[1] Univ Southampton, Stat Sci Res Inst, Southampton SO17 1BJ, Hants, England
[2] Univ Southampton, Sch Chem, Computat Syst Chem, Southampton SO17 1BJ, Hants, England
来源
JOURNAL OF PHYSICAL CHEMISTRY A | 2020年 / 124卷 / 39期
基金
英国工程与自然科学研究理事会;
关键词
TOTAL-ENERGY CALCULATIONS; LATTICE ENERGIES; OXALIC-ACID; POLYMORPHISM; DESIGN; ALTERNATION; LANDSCAPES; ACCURACY; DYNAMICS; MODEL;
D O I
10.1021/acs.jpca.0c05006
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
The prediction of crystal structures from first-principles requires highly accurate energies for large numbers of putative crystal structures. High accuracy of solid state density functional theory (DFT) calculations is often required, but hundreds or more structures can be present in the low energy region of interest, so that the associated computational costs are prohibitive. Here, we apply statistical machine learning to predict expensive hybrid functional DFT (PBEO) calculations using a multifidelity approach to re-evaluate the energies of crystal structures predicted with an inexpensive force field. The method uses an autoregressive Gaussian process, making use of less expensive GGA DFT (PBE) calculations to bridge the gap between the force field and PBEO energies. The method is benchmarked on the crystal structure landscapes of three small, hydrogen-bonded organic molecules and shown to produce accurate predictions of energies and crystal structure ranking using small numbers of the most expensive calculations; the PBEO energies can be predicted with errors of less than 1 kJ mol(-1) with between 4.2 and 6.8% of the cost of the full calculations. As the model that we have developed is probabilistic, we discuss how the uncertainties in predicted energies impact the assessment of the energetic ranking of crystal structures.
引用
收藏
页码:8065 / 8078
页数:14
相关论文
共 50 条
  • [21] Learning structure-energy relationships for the prediction of molecular crystal structures
    Day, Graeme M.
    ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2021, 77 : C475 - C475
  • [22] Integration of generative machine learning with the heuristic crystal structure prediction code FUSE
    Collins, Christopher M.
    Sayeed, Hasan M.
    Darling, George R.
    Claridge, John B.
    Sparks, Taylor D.
    Rosseinsky, Matthew J.
    FARADAY DISCUSSIONS, 2025, 256 (00) : 85 - 103
  • [23] Multifidelity aerodynamic flow field prediction using random forest-based machine learning
    Nagawkar, Jethro
    Leifsson, Leifur
    AEROSPACE SCIENCE AND TECHNOLOGY, 2022, 123
  • [24] Crystal structure prediction based on statistical potentials
    Hofmann, DWM
    Lengauer, T
    JOURNAL OF MOLECULAR MODELING, 1998, 4 (03) : 132 - 144
  • [25] Crystal Structure Prediction based on Statistical Potentials
    Detlef W. M. Hofmann
    Thomas Lengauer
    Molecular modeling annual, 1998, 4 : 132 - 144
  • [26] Machine learning in protein structure prediction
    AlQuraishi, Mohammed
    CURRENT OPINION IN CHEMICAL BIOLOGY, 2021, 65 : 1 - 8
  • [27] Crop Type Prediction: A Statistical and Machine Learning Approach
    Bhuyan, Bikram Pratim
    Tomar, Ravi
    Singh, T. P.
    Cherif, Amar Ramdane
    SUSTAINABILITY, 2023, 15 (01)
  • [28] Statistical Machine Learning Approaches to Liver Disease Prediction
    Mostafa, Fahad
    Hasan, Easin
    Williamson, Morgan
    Khan, Hafiz
    LIVERS, 2021, 1 (04): : 294 - 312
  • [29] Statistical and Machine Learning Methods for Electricity Demand Prediction
    Kotillova, Alexandra
    Koprinska, Irena
    Rana, Mashud
    NEURAL INFORMATION PROCESSING, ICONIP 2012, PT II, 2012, 7664 : 535 - 542
  • [30] A Review and Analysis of Machine Learning and Statistical Approaches for Prediction
    Nisha, K. G.
    Sreekumar, K.
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICICCT), 2017, : 135 - 139