Automatic Feature Selection for Atom-Centered Neural Network Potentials Using a Gradient Boosting Decision Algorithm

被引:0
|
作者
Li, Renzhe [1 ]
Wang, Jiaqi [1 ]
Singh, Akksay [1 ,2 ,3 ]
Li, Bai [1 ]
Song, Zichen [1 ,4 ]
Zhou, Chuan [1 ]
Li, Lei [1 ]
机构
[1] Southern Univ Sci & Technol, Dept Mat Sci & Engn, Shenzhen 518055, Peoples R China
[2] Univ Texas Austin, Dept Chem, Austin, TX 78712 USA
[3] Univ Texas Austin, Inst Computat Engn & Sci, Austin, TX 78712 USA
[4] City Univ Hong Kong, Dept Mat Sci & Engn, Kowloon, Hong Kong 999077, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
LIQUID-METAL; FORCE-FIELD; APPROXIMATION; DYNAMICS; PERFORMANCE; SIMULATION; MODEL;
D O I
10.1021/acs.jctc.4c01176
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Atom-centered neural network (ANN) potentials have shown high accuracy and computational efficiency in modeling atomic systems. A crucial step in developing reliable ANN potentials is the proper selection of atom-centered symmetry functions (ACSFs), also known as atomic features, to describe atomic environments. Inappropriate selection of ACSFs can lead to poor-quality ANN potentials. Here, we propose a gradient boosting decision tree (GBDT)-based framework for the automatic selection of optimal ACSFs. This framework takes uniformly distributed sets of ACSFs as input and evaluates their relative importance. The ACSFs with high average importance scores are selected and used to train an ANN potential. We applied this method to the Ge system, resulting in an ANN potential with root-mean-square errors (RMSE) of 10.2 meV/atom for energy and 84.8 meV/& Aring; for force predictions, utilizing only 18 ACSFs to achieve a balance between accuracy and computational efficiency. The framework is validated using the grid searching method, demonstrating that ACSFs selected with our framework are in the optimal region. Furthermore, we also compared our method with commonly used feature selection algorithms. The results show that our algorithm outperforms the others in terms of effectiveness and accuracy. This study highlights the significance of the ACSF parameter effect on the ANN performance and presents a promising method for automatic ACSF selection, facilitating the development of machine learning potentials.
引用
收藏
页码:10564 / 10573
页数:10
相关论文
共 50 条
  • [1] Systematic Identification of Atom-Centered Symmetry Functions for the Development of Neural Network Potentials
    Mudassir, Mohammed Wasay
    Srinivasan, Sriram Goverapet
    Mynam, Mahesh
    Rai, Beena
    JOURNAL OF PHYSICAL CHEMISTRY A, 2022, 126 (44): : 8337 - 8347
  • [2] Atom-centered symmetry functions for constructing high-dimensional neural network potentials
    Behler, Joerg
    JOURNAL OF CHEMICAL PHYSICS, 2011, 134 (07):
  • [3] Charge-Optimized Electrostatic Interaction Atom-Centered Neural Network Algorithm
    Song, Zichen
    Han, Jian
    Henkelman, Graeme
    Li, Lei
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2024, 20 (05) : 2088 - 2097
  • [4] Pair-distribution-function guided optimization of fingerprints for atom-centered neural network potentials
    Li, Lei
    Li, Hao
    Seymour, Ieuan D.
    Koziol, Lucas
    Henkelman, Graeme
    JOURNAL OF CHEMICAL PHYSICS, 2020, 152 (22): : 224102
  • [5] Feature selection algorithm recommendation for gene expression data through gradient boosting and neural network metamodels
    Aduviri, Robert
    Matos, Daniel
    Villanueva, Edwin
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 2726 - 2728
  • [6] High-dimensional neural network potentials for magnetic systems using spin-dependent atom-centered symmetry functions
    Eckhoff, Marco
    Behler, Joerg
    NPJ COMPUTATIONAL MATERIALS, 2021, 7 (01)
  • [7] High-dimensional neural network potentials for magnetic systems using spin-dependent atom-centered symmetry functions
    Marco Eckhoff
    Jörg Behler
    npj Computational Materials, 7
  • [8] Boosting feature selection for Neural Network based regression
    Bailly, Kevin
    Milgram, Maurice
    NEURAL NETWORKS, 2009, 22 (5-6) : 748 - 756
  • [9] Performance of optimized atom-centered potentials for weakly bonded systems using density functional theory
    von Lilienfeld, OA
    Tavernelli, I
    Rothlisberger, U
    Sebastiani, D
    PHYSICAL REVIEW B, 2005, 71 (19)
  • [10] Accurate Modeling of Water Clusters with Density-Functional Theory Using Atom-Centered Potentials
    Holmes, Jake D.
    Otero-de-la-Roza, Alberto
    DiLabio, Gino A.
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2017, 13 (09) : 4205 - 4215