Dealing with small sample size problems in process industry using virtual sample generation: a Kriging-based approach

被引:29
|
作者
Zhu, Qun-Xiong [1 ,2 ]
Chen, Zhong-Sheng [1 ,2 ,3 ]
Zhang, Xiao-Han [1 ,2 ]
Rajabifard, Abbas [4 ]
Xu, Yuan [1 ,2 ,4 ]
Chen, Yi-Qun [4 ]
机构
[1] Beijing Univ Chem Technol, Coll Informat Sci & Technol, Beijing 100029, Peoples R China
[2] Minist Educ China, Engn Res Ctr Intelligent PSE, Beijing 100029, Peoples R China
[3] Purdue Univ, Davidson Sch Chem Engn, W Lafayette, IN 47907 USA
[4] Univ Melbourne, Ctr SDI & Land Adm, Dept Infrastruct Engn, Melbourne, Vic 3010, Australia
基金
中国国家自然科学基金;
关键词
Small sample size problems; Virtual sample generation; Kriging interpolation; Soft sensing modeling; High-density polyethylene; ENERGY PREDICTION; TREND-DIFFUSION; INTERPOLATION;
D O I
10.1007/s00500-019-04326-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The operational data of advanced process systems have met with explosive growth, but its fluctuations are so slight that the number of the extracted representative samples is quite limited, making it difficult to reflect the nature of the process and to establish prediction models. In this study, inspired by the process of fisherman repairing nets, a Kriging-based virtual sample generation (VSG) named Kriging-VSG is proposed to generate feasible virtual samples in data sparse regions. Then, the accuracy of prediction models is further enhanced by applying the generated virtual samples. In order to reasonably find data sparse regions, a distance-based criterion is imposed on each dimension to identify important samples with large information gaps. Similar to the process of fisherman repairing nets, a certain dimension is initially fixed at different quantiles. A dimension-wise interpolation process using Kriging is then performed on the center between important samples with large information gaps. To validate the performance of the proposed Kriging-VSG, two numerical simulations and a real-world application from a cascade reaction process for high-density polyethylene are carried out. The results indicate that the proposed Kriging-VSG outperforms other methods.
引用
收藏
页码:6889 / 6902
页数:14
相关论文
共 50 条
  • [41] MetSizeR: selecting the optimal sample size for metabolomic studies using an analysis based approach
    Nyamundanda, Gift
    Gormley, Isobel Claire
    Fan, Yue
    Gallagher, William M.
    Brennan, Lorraine
    BMC BIOINFORMATICS, 2013, 14
  • [42] MetSizeR: selecting the optimal sample size for metabolomic studies using an analysis based approach
    Gift Nyamundanda
    Isobel Claire Gormley
    Yue Fan
    William M Gallagher
    Lorraine Brennan
    BMC Bioinformatics, 14
  • [43] Novel virtual sample generation using Gibbs Sampling integrated with GRNN for handling small data in soft sensing
    Zhu, Qun-Xiong
    Zhao, Qi-Qian
    Xu, Yuan
    He, Yan-Lin
    2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 89 - 94
  • [44] Two-phase reverse neural network approach for modeling a complicate manufacturing process with small sample size
    Wang, GN
    8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS, AND INFORMATICS, VOL XVI, PROCEEDINGS, 2004, : 280 - 285
  • [45] Nonlinear probabilistic virtual sample generation using Gaussian process latent variable model and fitting for rubber material
    Chen, Wenlong
    Chen, Kai
    COMPUTATIONAL MATERIALS SCIENCE, 2023, 230
  • [46] Novel virtual sample generation method based on data augmentation and weighted interpolation for soft sensing with small data
    Song, Xiao-Lu
    He, Yan-Lin
    Li, Xing-Yuan
    Zhu, Qun-Xiong
    Xu, Yuan
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 225
  • [47] A Monte Carlo and Kernel Density Estimation based virtual sample generation method for small data modeling problem
    Zhu, Qun-Xiong
    Wang, Zhi-Hui
    He, Yan-Lin
    Xu, Yuan
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 1123 - 1128
  • [48] A maximum uncertainty LDA-based approach for limited sample size problems — with application to face recognition
    Department of Electrical Engineering, Centro Universitário da FEI, São Paulo, Brazil
    不详
    J. Braz. Comput. Soc., 2006, 2 (7-18):
  • [49] A maximum uncertainty LDA-based approach for limited sample size problems - with application to face recognition
    Thomaz, CE
    Gillies, DF
    SIBGRAPI 2005: XVIII BRAZILIAN SYMPOSIUM ON COMPUTER GRAPHICS AND IMAGE PROCESSING, CONFERENCE PROCEEDINGS, 2005, : 89 - 96
  • [50] A Virtual Sample Generation Method Based on Differential Evolution Algorithm for Overall Trend of Small Sample Data: Used for Lithium-ion Battery Capacity Degradation Data
    Kang, Guoqing
    Wu, Lifeng
    Guan, Yong
    Peng, Zhen
    IEEE ACCESS, 2019, 7 : 123255 - 123267