Dealing with small sample size problems in process industry using virtual sample generation: a Kriging-based approach

被引:29
|
作者
Zhu, Qun-Xiong [1 ,2 ]
Chen, Zhong-Sheng [1 ,2 ,3 ]
Zhang, Xiao-Han [1 ,2 ]
Rajabifard, Abbas [4 ]
Xu, Yuan [1 ,2 ,4 ]
Chen, Yi-Qun [4 ]
机构
[1] Beijing Univ Chem Technol, Coll Informat Sci & Technol, Beijing 100029, Peoples R China
[2] Minist Educ China, Engn Res Ctr Intelligent PSE, Beijing 100029, Peoples R China
[3] Purdue Univ, Davidson Sch Chem Engn, W Lafayette, IN 47907 USA
[4] Univ Melbourne, Ctr SDI & Land Adm, Dept Infrastruct Engn, Melbourne, Vic 3010, Australia
基金
中国国家自然科学基金;
关键词
Small sample size problems; Virtual sample generation; Kriging interpolation; Soft sensing modeling; High-density polyethylene; ENERGY PREDICTION; TREND-DIFFUSION; INTERPOLATION;
D O I
10.1007/s00500-019-04326-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The operational data of advanced process systems have met with explosive growth, but its fluctuations are so slight that the number of the extracted representative samples is quite limited, making it difficult to reflect the nature of the process and to establish prediction models. In this study, inspired by the process of fisherman repairing nets, a Kriging-based virtual sample generation (VSG) named Kriging-VSG is proposed to generate feasible virtual samples in data sparse regions. Then, the accuracy of prediction models is further enhanced by applying the generated virtual samples. In order to reasonably find data sparse regions, a distance-based criterion is imposed on each dimension to identify important samples with large information gaps. Similar to the process of fisherman repairing nets, a certain dimension is initially fixed at different quantiles. A dimension-wise interpolation process using Kriging is then performed on the center between important samples with large information gaps. To validate the performance of the proposed Kriging-VSG, two numerical simulations and a real-world application from a cascade reaction process for high-density polyethylene are carried out. The results indicate that the proposed Kriging-VSG outperforms other methods.
引用
收藏
页码:6889 / 6902
页数:14
相关论文
共 50 条
  • [21] Enhanced virtual sample generation based on manifold features: Applications to developing soft sensor using small data
    He, Yan-Lin
    Hua, Qiang
    Zhu, Qun-Xiong
    Lu, Shan
    ISA Transactions, 2022, 126 : 398 - 406
  • [22] Novel Virtual Sample Generation Based on Locally Linear Embedding for Optimizing the Small Sample Problem: Case of Soft Sensor Applications
    Zhu, Qun-Xiong
    Zhang, Xiao-Han
    He, Yan-Lin
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2020, 59 (40) : 17977 - 17986
  • [23] Novel virtual sample generation using conditional GAN for developing soft sensor with small data
    Zhu, Qun-Xiong
    Hou, Kun-Rui
    Chen, Zhong-Sheng
    Gao, Zi-Shu
    Xu, Yuan
    He, Yan-Lin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 106
  • [24] Novel manifold learning based virtual sample generation for optimizing soft sensor with small data
    Zhang, Xiao-Han
    Xu, Yuan
    He, Yan-Lin
    Zhu, Qun-Xiong
    ISA TRANSACTIONS, 2021, 109 : 229 - 241
  • [25] Virtual Sample Generation and Ensemble Learning Based Image Source Identification With Small Training Samples
    Wu, Shiqi
    Wang, Bo
    Zhao, Jianxiang
    Zhao, Mengnan
    Zhong, Kun
    Guo, Yanqing
    INTERNATIONAL JOURNAL OF DIGITAL CRIME AND FORENSICS, 2021, 13 (03) : 34 - 46
  • [26] Novel Virtual Sample Generation Using Target-Relevant Autoencoder for Small Data-Based Soft Sensor
    Tian, Ye
    Xu, Yuan
    Zhu, Qun-Xiong
    He, Yan-Lin
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70
  • [27] A Bootstrap based Virtual Sample Generation Method for Improving the Accuracy of Modeling Complex Chemical Processes using Small Datasets
    Zhu, Qun-Xiong
    Gong, Hong-Fei
    Xu, Yuan
    He, Yan-Lin
    2017 6TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS (DDCLS), 2017, : 84 - 88
  • [28] A Virtual Sample Generation Approach for Speculative Multithreading Using Feature Sets and Abstract Syntax Trees
    Liu, Bin
    Zhao, Yinliang
    Li, Meirong
    Liu, Yanzhao
    Feng, Boqin
    2012 13TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS, AND TECHNOLOGIES (PDCAT 2012), 2012, : 39 - 44
  • [29] Using the information embedded in the testing sample to break the limits caused by the small sample size in microarray-based classification
    Manli Zhu
    Aleix M Martinez
    BMC Bioinformatics, 9
  • [30] Using the information embedded in the testing sample to break the limits caused by the small sample size in microarray-based classification
    Zhu, Manli
    Martinez, Aleix M.
    BMC BIOINFORMATICS, 2008, 9 (1)