Dealing with small sample size problems in process industry using virtual sample generation: a Kriging-based approach

被引:29
|
作者
Zhu, Qun-Xiong [1 ,2 ]
Chen, Zhong-Sheng [1 ,2 ,3 ]
Zhang, Xiao-Han [1 ,2 ]
Rajabifard, Abbas [4 ]
Xu, Yuan [1 ,2 ,4 ]
Chen, Yi-Qun [4 ]
机构
[1] Beijing Univ Chem Technol, Coll Informat Sci & Technol, Beijing 100029, Peoples R China
[2] Minist Educ China, Engn Res Ctr Intelligent PSE, Beijing 100029, Peoples R China
[3] Purdue Univ, Davidson Sch Chem Engn, W Lafayette, IN 47907 USA
[4] Univ Melbourne, Ctr SDI & Land Adm, Dept Infrastruct Engn, Melbourne, Vic 3010, Australia
基金
中国国家自然科学基金;
关键词
Small sample size problems; Virtual sample generation; Kriging interpolation; Soft sensing modeling; High-density polyethylene; ENERGY PREDICTION; TREND-DIFFUSION; INTERPOLATION;
D O I
10.1007/s00500-019-04326-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The operational data of advanced process systems have met with explosive growth, but its fluctuations are so slight that the number of the extracted representative samples is quite limited, making it difficult to reflect the nature of the process and to establish prediction models. In this study, inspired by the process of fisherman repairing nets, a Kriging-based virtual sample generation (VSG) named Kriging-VSG is proposed to generate feasible virtual samples in data sparse regions. Then, the accuracy of prediction models is further enhanced by applying the generated virtual samples. In order to reasonably find data sparse regions, a distance-based criterion is imposed on each dimension to identify important samples with large information gaps. Similar to the process of fisherman repairing nets, a certain dimension is initially fixed at different quantiles. A dimension-wise interpolation process using Kriging is then performed on the center between important samples with large information gaps. To validate the performance of the proposed Kriging-VSG, two numerical simulations and a real-world application from a cascade reaction process for high-density polyethylene are carried out. The results indicate that the proposed Kriging-VSG outperforms other methods.
引用
收藏
页码:6889 / 6902
页数:14
相关论文
共 50 条
  • [31] Component-based global k-NN classifier for small sample size problems
    Zhang, Nan
    Yang, Jian
    Qian, Jian-jun
    PATTERN RECOGNITION LETTERS, 2012, 33 (13) : 1689 - 1694
  • [32] False alarm reduction in drilling process monitoring using virtual sample generation and qualitative trend analysis
    Li, Yupeng
    Cao, Weihua
    Gopaluni, R. Bhushan
    Hu, Wenkai
    Cao, Liang
    Wu, Min
    CONTROL ENGINEERING PRACTICE, 2023, 133
  • [33] A genetic algorithm-based virtual sample generation technique to improve small data set learning
    Li, Der-Chiang
    Wen, I-Hsiang
    NEUROCOMPUTING, 2014, 143 : 222 - 230
  • [34] A novel approach for small sample size family-based association studies: sequential tests
    Ozlem Ilk
    Farid Rajabli
    Dilay Ciglidag Dungul
    Hilal Ozdag
    Hakki Gokhan Ilk
    European Journal of Human Genetics, 2011, 19 : 915 - 920
  • [35] A novel approach for small sample size family-based association studies: sequential tests
    Ilk, Ozlem
    Rajabli, Farid
    Dungul, Dilay Ciglidag
    Ozdag, Hilal
    Ilk, Hakki Gokhan
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2011, 19 (08) : 915 - 920
  • [36] Using an attribute conversion approach for sample generation to learn small data with highly uncertain features
    Li, Der-Chiang
    Shi, Qi-Shi
    Li, Ming-Da
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2018, 56 (14) : 4954 - 4967
  • [37] Sequential Radial Basis Function-Based Optimization Method Using Virtual Sample Generation
    Tang, Yifan
    Long, Teng
    Shi, Renhe
    Wu, Yufei
    Wang, G. Gary
    JOURNAL OF MECHANICAL DESIGN, 2020, 142 (11)
  • [38] Empirical process approach to some two-sample problems based on ranked set samples
    Kaushik Ghosh
    Ram C. Tiwari
    Annals of the Institute of Statistical Mathematics, 2007, 59 : 757 - 787
  • [39] Empirical process approach to some two-sample problems based on ranked set samples
    Ghosh, Kaushik
    Tiwari, Ram C.
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2007, 59 (04) : 757 - 787
  • [40] A framework based on multivariate distribution-based virtual sample generation and DNN for predicting water quality with small data
    El Bilali, Ali
    Lamane, Houda
    Taleb, Abdeslam
    Nafii, Ayoub
    JOURNAL OF CLEANER PRODUCTION, 2022, 368