Data flow modeling, data mining and QSAR in high-throughput discovery of functional nanomaterials

被引:25
|
作者
Yang, Yang [1 ]
Lin, Tian [1 ]
Weng, Xiao L. [2 ]
Darr, Jawwad A. [2 ]
Wang, Xue Z. [1 ]
机构
[1] Univ Leeds, Inst Particle Sci & Engn, Sch Proc Environm & Mat Engn, Leeds LS2 9JT, W Yorkshire, England
[2] UCL, Dept Chem, London WC1H 0AJ, England
基金
英国工程与自然科学研究理事会;
关键词
Data mining; QSAR; Design of experiments; Genetic algorithm; Nanoparticle; High-throughput; PROCESS OPERATIONAL DATA; CONTINUOUS HYDROTHERMAL SYNTHESIS; CEO2-ZRO2 MIXED OXIDES; SOLID-SOLUTIONS; DECISION TREES; ECOTOXICITY DATA; CERIA; NANOPARTICLES; CATALYSTS; COMBINATORIAL;
D O I
10.1016/j.compchemeng.2010.04.018
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Metal oxide nanoparticles are promising materials in applications for fuel cells, gas sensors and fine chemical catalysis. Their functionality depends excessively on composition, structure as well as synthesis and processing conditions. Continuous hydrothermal flow synthesis (CHFS) reactors are an effective technology to make nanoceramics. In order to increase sample throughput of CHFS, a manual high-throughput continuous hydrothermal (HiTCH) flow synthesis process capable of formulating scores of samples per day was developed. More recently, a fully automated nanoceramics synthesis platform called RAMSI (rapid automated synthesis instrument) based on the HiTCH synthesis technology was developed. When large numbers of nanoceramics are made and formulated into appropriate libraries, automated analytical instruments can be used to allow collection of a large amount of useful data. This paper describes the information flow management system of RAMSI (as well as CHFS) and the data mining system for supporting discovery, QSAR (quantitative structure-activity relationship) modeling and DoE (design of experiments). Case studies demonstrating the use of the high-throughput data mining system are presented. These include clustering of Raman spectra, interpretation of X-ray diffraction (XRD) measurements, and QSAR model building linking XRD data and photocatalytic properties. A genetic algorithm method for DoE is also presented that can guide the experiments to search optimal XRD patterns. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:671 / 678
页数:8
相关论文
共 50 条
  • [1] Information flow modeling and data mining in high-throughput discovery of functional nanomaterials
    Yang, Yang
    Wang, Xue
    19TH EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING, 2009, 26 : 135 - 140
  • [2] QSAR Modeling of Imbalanced High-Throughput Screening Data in PubChem
    Zakharov, Alexey V.
    Peach, Megan L.
    Sitzmann, Markus
    Nicklaus, Marc C.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2014, 54 (03) : 705 - 712
  • [3] Deriving knowledge through data mining high-throughput screening data
    Diller, DJ
    Hobbs, DW
    JOURNAL OF MEDICINAL CHEMISTRY, 2004, 47 (25) : 6373 - 6383
  • [4] Approaches for mining high-throughput screening data sets
    Engels, MFM
    Knapen, K
    Tollenaere, JP
    RATIONAL APPROACHES TO DRUG DESIGN, 2001, : 496 - 505
  • [5] Genome variation discovery with high-throughput sequencing data
    Dalca, Adrian V.
    Brudno, Michael
    BRIEFINGS IN BIOINFORMATICS, 2010, 11 (01) : 3 - 14
  • [6] High-throughput and data mining with ab initio methods
    Morgan, D
    Ceder, G
    Curtarolo, S
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2005, 16 (01) : 296 - 301
  • [7] A Novel Automated Framework for QSAR Modeling of Highly Imbalanced Leishmania High-Throughput Screening Data
    Casanova-Alvarez, Omar
    Morales-Helguera, Aliuska
    Angel Cabrera-Perez, Miguel
    Molina-Ruiz, Reinaldo
    Molina, Christophe
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2021, 61 (07) : 3213 - 3231
  • [8] High-throughput functional annotation and data mining with the Blast2GO suite
    Gotz, Stefan
    Garcia-Gomez, Juan Miguel
    Terol, Javier
    Williams, Tim D.
    Nagaraj, Shivashankar H.
    Nueda, Maria Jose
    Robles, Montserrat
    Talon, Manuel
    Dopazo, Joaquin
    Conesa, Ana
    NUCLEIC ACIDS RESEARCH, 2008, 36 (10) : 3420 - 3435
  • [9] High-throughput QSAR
    Rouzer, Carol A.
    CHEMICAL RESEARCH IN TOXICOLOGY, 2008, 21 (03) : 561 - 562
  • [10] Understanding molecular mechanisms with high-throughput analysis and data mining
    Sucularli, Ceren
    INTERNATIONAL JOURNAL OF MOLECULAR MEDICINE, 2016, 38 : S25 - S25