Discovery of Intermetallic Compounds from Traditional to Machine-Learning Approaches

被引:103
|
作者
Oliynyk, Anton O. [1 ]
Mar, Arthur [1 ]
机构
[1] Univ Alberta, Dept Chem, Edmonton, AB T6G 2G2, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
QUATERNARY ORDERED VARIANTS; DENSITY-FUNCTIONAL THEORY; LA-ND; THERMOELECTRIC PROPERTIES; CRYSTAL-STRUCTURES; HOMOLOGOUS SERIES; HEUSLER COMPOUNDS; GD-TM; RE; PHASES;
D O I
10.1021/acs.accounts.7b00490
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Intermetallic compounds are bestowed by diverse compositions, complex structures, and useful properties for many materials applications. How metallic elements react to form these compounds and what structures they adopt remain challenging questions that defy predictability. Traditional approaches offer some rational strategies to prepare specific classes of intermetallics,, such as targeting members within a modular homologous series, manipulating building blocks to assemble new structures, and filling interstitial sites to create stuffed variants. Because these strategies rely on precedent, they cannot foresee surprising results, by definition. Exploratory synthesis, whether through systematic phase diagram investigations or serendipity, is still essential for expanding our knowledge base. Eventually, the relationships may become too complex for the pattern recognition skills to be reliably or practically performed by humans. Complementing these traditional approaches, new machine-learning approaches may be a viable alternative for materials discovery, not only among intermetallics but also more generally to other chemical compounds. In this Account, we survey our own efforts to discover new intermetallic compounds, encompassing gallides, germanides, phosphides, arsenides, and others. We apply various machine-learning methods (such as support vector machine and random forest algorithms) to confront two significant questions in solid state chemistry. First, what crystal structures are adopted by a compound given an arbitrary composition? Initial efforts have focused on binary equiatomic phases AB, ternary equiatomic phases ABC, and full Heusler phases AB(2)C. Our analysis emphasizes the use of real experimental data and places special value on confirming predictions through experiment. Chemical descriptors are carefully chosen through a rigorous procedure called cluster resolution feature selection. Predictions for crystal structures are quantified by evaluating probabilities. Major results include the discovery of RhCd, the first new binary AB compound to be found in over 15 years, with a CsCl-type structure; the connection between "ambiguous" prediction probabilities and the phenomenon of polymorphism, as illustrated in the case of TiFeP (with TiNiSi- and ZrNiAI-type structures); and the preparation of new predicted Heusler phases MRu2Ga and RuM2Ga (M = first-row transition metal) that are not obvious candidates. Second, how can the search for materials with desired properties be accelerated? One particular application of strong current interest is thermoelectric materials, which present a particular challenge because their optimum performance depends on achieving a balance of many interrelated physical properties. Making use of a recommendation engine developed by Citrine Informatics, we have identified new candidates for thermoelectric materials, including previously unknown compounds (e.g., TiRu2Ga with Heusler structure; Mn(Ru0.4Ge0.6) with CsCl-type structure) and previously reported compounds but counterintuitive candidates (e.g., Gcl(12)Co(5)Bi). An important lesson in these investigations is that the machine-learning model are only as good as the experimental data used to develop them. Thus, experimental work will continue to be necessary to improve the predictions made by machine learning.
引用
收藏
页码:59 / 68
页数:10
相关论文
共 50 条
  • [21] Predicting COPD readmissions: a novel 2e index with traditional regression and machine-learning approaches
    Liew, Chiat Qiao
    Chen, Yen-Pin
    Gao, Jun-Wan
    Ko, Chia-Hsin
    Tsai, Chu-Lin
    INTERNAL AND EMERGENCY MEDICINE, 2024,
  • [22] Discriminating chert origins using machine-learning approaches
    Wei, Zhen
    Li, Xianghui
    Sun, Minjia
    Guo, Ruiqing
    Liu, Guiping
    Xu, Zheting
    Cheng, Yuanfeng
    GEOLOGICAL JOURNAL, 2023, 58 (06) : 2403 - 2417
  • [23] Crop Contamination Forecasting Based on Machine-Learning Approaches
    V. K. Kalichkin
    O. K. Alsova
    K. Yu. Maksimovich
    N. V. Vasilyeva
    Russian Agricultural Sciences, 2022, 48 (2) : 115 - 122
  • [24] Machine-learning approaches for classifying haplogroup from Y chromosome STR data
    Schlecht, Joseph
    Kaplan, Matthew E.
    Barnard, Kobus
    Karafet, Tatiana
    Hammer, Michael F.
    Merchant, Nirav C.
    PLOS COMPUTATIONAL BIOLOGY, 2008, 4 (06)
  • [25] Representation of compounds for machine-learning prediction of physical properties
    Seko, Atsuto
    Hayashi, Hiroyuki
    Nakayama, Keita
    Takahashi, Akira
    Tanaka, Isao
    PHYSICAL REVIEW B, 2017, 95 (14)
  • [26] Machine-learning models for high-throughput materials discovery
    Landrum, GA
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2003, 225 : U560 - U560
  • [27] Reliable and explainable machine-learning methods for accelerated material discovery
    Kailkhura, Bhavya
    Gallagher, Brian
    Kim, Sookyung
    Hiszpanski, Anna
    Han, T. Yong-Jin
    NPJ COMPUTATIONAL MATERIALS, 2019, 5 (1)
  • [28] Reliable and explainable machine-learning methods for accelerated material discovery
    Bhavya Kailkhura
    Brian Gallagher
    Sookyung Kim
    Anna Hiszpanski
    T. Yong-Jin Han
    npj Computational Materials, 5
  • [29] Medical Data Assessment with Traditional, Machine-learning and Deep-learning Techniques
    Lin, Hong
    Satapathy, Suresh Chandra
    Rajinikanth, V.
    CURRENT MEDICAL IMAGING, 2020, 16 (10) : 1185 - 1186
  • [30] Molecular Docking for Drug Discovery: Machine-Learning Approaches for Native Pose Prediction of Protein-Ligand Complexes
    Ashtawy, Hossam M.
    Mahapatra, Nihar R.
    COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS: 10TH INTERNATIONAL MEETING, 2014, 8452 : 15 - 32