SemiH: DFT Hamiltonian neural network training with semi-supervised learning

被引：0

作者：

Cho, Yucheol ^{[1
]}

Choi, Guenseok ^{[1
]}

Ham, Gyeongdo ^{[1
]}

Shin, Mincheol ^{[1
]}

Kim, Daeshik ^{[1
]}

机构：

[1] Korea Adv Inst Sci & Technol KAIST, Sch Elect Engn, Daejeon 34141, South Korea

来源：

MACHINE LEARNING-SCIENCE AND TECHNOLOGY | 2024年 / 5卷 / 03期

关键词：

density functional theory; neural network Hamiltonian; message-passing neural network; semi-supervised learning; unlabeled data; pseudo Hamiltonian; graph neural network; DENSITY-FUNCTIONAL THEORY; ELECTRONIC-STRUCTURE;

D O I：

10.1088/2632-2153/ad7227

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Over the past decades, density functional theory (DFT) calculations have been utilized in various fields such as materials science and semiconductor devices. However, due to the inherent nature of DFT calculations, which rigorously consider interactions between atoms, they require significant computational cost. To address this, extensive research has recently focused on training neural networks to replace DFT calculations. However, previous methods for training neural networks necessitated an extensive number of DFT simulations to acquire the ground truth (Hamiltonians). Conversely, when dealing with a limited amount of training data, deep learning models often display increased errors in predicting Hamiltonians and band structures for testing data. This phenomenon poses the potential risk of generating inaccurate physical interpretations, including the emergence of unphysical branches within band structures. To tackle this challenge, we propose a novel deep learning-based method for calculating DFT Hamiltonians, specifically tailored to produce accurate results with limited training data. Our framework not only employs supervised learning with the calculated Hamiltonian but also generates pseudo Hamiltonians (targets for unlabeled data) and trains the neural networks on unlabeled data. Particularly, our approach, which leverages unlabeled data, is noteworthy as it marks the first attempt in the field of neural network Hamiltonians. Our framework showcases the superior performance of our framework compared to the state-of-the-art approach across various datasets, such as MoS2, Bi2Te3, HfO2, and InGaAs. Moreover, our framework demonstrates enhanced generalization performance by effectively utilizing unlabeled data, achieving noteworthy results when evaluated on data more complex than the training set, such as configurations with more atoms and temperature ranges outside the training data.

引用

页数：16

共 50 条

[1] A Neural Network for Semi-supervised Learning on Manifolds
Genkin, Alexander
Sengupta, Anirvan M.
Chklovskii, Dmitri
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: THEORETICAL NEURAL COMPUTATION, PT I, 2019, 11727 : 375 - 386
[2] Centroid Neural Network with Pairwise Constraints for Semi-supervised Learning
Minh Tran Ngoc
Dong-Chul Park
Neural Processing Letters, 2018, 48 : 1721 - 1747
[3] Safe semi-supervised learning using a bayesian neural network
Bae, Jinsoo
Lee, Minjung
Kim, Seoung Bum
INFORMATION SCIENCES, 2022, 612 : 453 - 464
[4] Centroid Neural Network with Pairwise Constraints for Semi-supervised Learning
Minh Tran Ngoc
Park, Dong-Chul
NEURAL PROCESSING LETTERS, 2018, 48 (03) : 1721 - 1747
[5] GANN: Graph Alignment Neural Network for semi-supervised learning
Song, Linxuan
Tu, Wenxuan
Zhou, Sihang
Zhu, En
PATTERN RECOGNITION, 2024, 154
[6] Multi-softmax Deep Neural Network for Semi-supervised Training
Su, Hang
Xu, Haihua
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3239 - 3243
[7] SEMI-SUPERVISED BOOTSTRAPPING APPROACH FOR NEURAL NETWORK FEATURE EXTRACTOR TRAINING
Grezl, Frantisek
Karafiat, Martin
2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 470 - 475
[8] Semi-Supervised Adversarial Training of a Lightweight Neural Network for Visual Recognition
Karaman, Kaan
Akkaya, Ibrahim Batuhan
COUNTERTERRORISM, CRIME FIGHTING, FORENSICS, AND SURVEILLANCE TECHNOLOGIES III, 2019, 11166
[9] Generative Adversarial Training for Supervised and Semi-supervised Learning
Wang, Xianmin
Li, Jing
Liu, Qi
Zhao, Wenpeng
Li, Zuoyong
Wang, Wenhao
FRONTIERS IN NEUROROBOTICS, 2021, 15
[10] Learning by Association A versatile semi-supervised training method for neural networks
Haeusser, Philip
Mordvintsev, Alexander
Cremers, Daniel
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 626 - 635

← 1 2 3 4 5 →