Data flow modeling, data mining and QSAR in high-throughput discovery of functional nanomaterials

被引:25
|
作者
Yang, Yang [1 ]
Lin, Tian [1 ]
Weng, Xiao L. [2 ]
Darr, Jawwad A. [2 ]
Wang, Xue Z. [1 ]
机构
[1] Univ Leeds, Inst Particle Sci & Engn, Sch Proc Environm & Mat Engn, Leeds LS2 9JT, W Yorkshire, England
[2] UCL, Dept Chem, London WC1H 0AJ, England
基金
英国工程与自然科学研究理事会;
关键词
Data mining; QSAR; Design of experiments; Genetic algorithm; Nanoparticle; High-throughput; PROCESS OPERATIONAL DATA; CONTINUOUS HYDROTHERMAL SYNTHESIS; CEO2-ZRO2 MIXED OXIDES; SOLID-SOLUTIONS; DECISION TREES; ECOTOXICITY DATA; CERIA; NANOPARTICLES; CATALYSTS; COMBINATORIAL;
D O I
10.1016/j.compchemeng.2010.04.018
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Metal oxide nanoparticles are promising materials in applications for fuel cells, gas sensors and fine chemical catalysis. Their functionality depends excessively on composition, structure as well as synthesis and processing conditions. Continuous hydrothermal flow synthesis (CHFS) reactors are an effective technology to make nanoceramics. In order to increase sample throughput of CHFS, a manual high-throughput continuous hydrothermal (HiTCH) flow synthesis process capable of formulating scores of samples per day was developed. More recently, a fully automated nanoceramics synthesis platform called RAMSI (rapid automated synthesis instrument) based on the HiTCH synthesis technology was developed. When large numbers of nanoceramics are made and formulated into appropriate libraries, automated analytical instruments can be used to allow collection of a large amount of useful data. This paper describes the information flow management system of RAMSI (as well as CHFS) and the data mining system for supporting discovery, QSAR (quantitative structure-activity relationship) modeling and DoE (design of experiments). Case studies demonstrating the use of the high-throughput data mining system are presented. These include clustering of Raman spectra, interpretation of X-ray diffraction (XRD) measurements, and QSAR model building linking XRD data and photocatalytic properties. A genetic algorithm method for DoE is also presented that can guide the experiments to search optimal XRD patterns. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:671 / 678
页数:8
相关论文
共 50 条
  • [21] Data Mining Approaches to High-Throughput Crystal Structure and Compound Prediction
    Hautier, Geoffroy
    PREDICTION AND CALCULATION OF CRYSTAL STRUCTURES: METHODS AND APPLICATIONS, 2014, 345 : 139 - 179
  • [22] A high-throughput SNP discovery strategy for RNA-seq data
    Zhao, Yun
    Wang, Ke
    Wang, Wen-li
    Yin, Ting-ting
    Dong, Wei-qi
    Xu, Chang-jie
    BMC GENOMICS, 2019, 20 (1)
  • [23] Learning from the data: Mining of large high-throughput screening databases
    Yan, S. Frank
    King, Frederick J.
    He, Yun
    Caldwell, Jeremy S.
    Zhou, Yingyao
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (06) : 2381 - 2395
  • [24] Discovery of microRNA Regulatory Networks by Integrating Multidimensional High-Throughput Data
    Yang, Jian-Hua
    Qu, Liang-Hu
    MICRORNA CANCER REGULATION: ADVANCED CONCEPTS, BIOINFORMATICS AND SYSTEMS BIOLOGY TOOLS, 2013, 774 : 251 - 266
  • [25] Optimizing depth and type of high-throughput sequencing data for microsatellite discovery
    Chapman, Mark A.
    APPLICATIONS IN PLANT SCIENCES, 2019, 7 (11):
  • [26] A high-throughput SNP discovery strategy for RNA-seq data
    Yun Zhao
    Ke Wang
    Wen-li Wang
    Ting-ting Yin
    Wei-qi Dong
    Chang-jie Xu
    BMC Genomics, 20
  • [27] A high-throughput experimentation platform for data-driven discovery in electrochemistry
    Lin, Dian-Zhao
    Pan, Kai-Jui
    Li, Yuyin
    Zhang, Lingyu
    Jayarapu, Krish N.
    Li, Tianchen
    Tran, Jasmine Vy
    Goddard, William A.
    Luo, Zhengtang
    Liu, Yayuan
    SCIENCE ADVANCES, 2025, 11 (14):
  • [28] A probabilistic approach for SNP discovery in high-throughput human resequencing data
    Hoberman, Rose
    Dias, Joana
    Ge, Bing
    Harmsen, Eef
    Mayhew, Michael
    Verlaan, Dominique J.
    Kwan, Tony
    Dewar, Ken
    Blanchette, Mathieu
    Pastinen, Tomi
    GENOME RESEARCH, 2009, 19 (09) : 1542 - 1552
  • [29] NCBI GEO: archive for high-throughput functional genomic data
    Barrett, Tanya
    Troup, Dennis B.
    Wilhite, Stephen E.
    Ledoux, Pierre
    Rudnev, Dmitry
    Evangelista, Carlos
    Kim, Irene F.
    Soboleva, Alexandra
    Tomashevsky, Maxim
    Marshall, Kimberly A.
    Phillippy, Katherine H.
    Sherman, Patti M.
    Muertter, Rolf N.
    Edgar, Ron
    NUCLEIC ACIDS RESEARCH, 2009, 37 : D885 - D890
  • [30] High-Throughput Flow Cytometry in Drug Discovery
    Ding, Mei
    Edwards, Bruce S.
    SLAS DISCOVERY, 2018, 23 (07) : 599 - 602