Crystal synthesizability prediction using contrastive positive unlabeled learning

被引:0
|
作者
Sun, Tao [1 ,3 ]
Yuan, Jianmei [1 ,2 ,3 ]
机构
[1] Xiangtan Univ, Sch Math & Computat Sci, Xiangtan 411105, Hunan, Peoples R China
[2] Xiangtan Univ, Minist Educ, Key Lab Intelligent Comp & Informat Proc, Xiangtan 411105, Hunan, Peoples R China
[3] Natl Ctr Appl Math Hunan, Xiangtan 411105, Hunan, Peoples R China
关键词
Perovskite materials; Photovoltaic applications; Contrastive learning; Positive unlabeled learning; PEROVSKITE; ALGORITHM;
D O I
10.1016/j.cpc.2024.109465
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
High-throughput screening or generative models rapidly identify crystal structures with the desired properties, but the synthesizable ratio is generally low. Experimentally verifying the synthesizability of individual virtual crystals would entail significant time and resources. Therefore, a method for automatically assessing the synthesizability of virtual crystals is urgently needed. This paper describes an approach that combines contrastive learning and positive unlabeled learning. The resulting contrastive positive unlabeled learning (CPUL) model predicts the crystal-likeness score (CLscore) of virtual materials. The model achieves a true positive (CLscore > 0.5) prediction accuracy of 93.95% on a test set containing 10,000 materials taken from the Materials Project (MP) database. We further validate the model by using all Fe-containing materials from the MP database as the test set, obtaining a true positive rate of 88.89%. This indicates that the CPUL model performs well, even with limited knowledge of the interactions between Fe and the atoms in the crystals. The CPUL model is then used to assess the CLscore of virtual crystals in the MP database and analyze their synthesizability by combining the energy above the hull. Finally, the synthesizability of perovskite materials is predicted using the proposed CPUL model, resulting in seven candidate halide perovskite materials for photovoltaic applications.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Positive-unlabeled learning for the prediction of conformational B-cell epitopes
    Ren, Jing
    Liu, Qian
    Ellis, John
    Li, Jinyan
    BMC BIOINFORMATICS, 2015, 16
  • [22] Landslide Susceptibility Prediction Based on Positive Unlabeled Learning Coupled With Adaptive Sampling
    Fang, Zhice
    Wang, Yi
    Niu, Ruiqing
    Peng, Ling
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 11581 - 11592
  • [23] Positive Unlabeled Link Prediction via Transfer Learning for Gene Network Reconstruction
    Mignone, Paolo
    Pio, Gianvito
    FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2018), 2018, 11177 : 13 - 23
  • [24] Disease candidate genes prediction using positive labeled and unlabeled instances
    Sepideh Molaei
    Saeed Jalili
    BMC Medical Genomics, 18 (1)
  • [25] Conditional generative positive and unlabeled learning
    Papic, Ales
    Kononenko, Igor
    Bosnic, Zoran
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 224
  • [26] Efficient Training for Positive Unlabeled Learning
    Sansone, Emanuele
    De Natale, Francesco G. B.
    Zhou, Zhi-Hua
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (11) : 2584 - 2598
  • [27] Bayesian Classifiers for Positive Unlabeled Learning
    He, Jiazhen
    Zhang, Yang
    Li, Xue
    Wang, Yong
    WEB-AGE INFORMATION MANAGEMENT, 2011, 6897 : 81 - +
  • [28] On Positive and Unlabeled Learning for Text Classification
    Nagy, Istvan T.
    Farkas, Richard
    Csirik, Janos
    TEXT, SPEECH AND DIALOGUE, TSD 2011, 2011, 6836 : 219 - 226
  • [29] Federated Learning with Positive and Unlabeled Data
    Lin, Xinyang
    Chen, Hanting
    Xu, Yixing
    Xu, Chao
    Gui, Xiaolin
    Deng, Yiping
    Wang, Yunhe
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [30] Positive and unlabeled examples help learning
    De Comité, F
    Denis, F
    Gilleron, R
    Letouzey, F
    ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 1999, 1720 : 219 - 230