Co-training based virtual sample generation for solving the small sample size problem in process industry

被引:14
|
作者
Zhu, Qun-Xiong
Zhang, Hong-Tao
Tian, Ye
Zhang, Ning
Xu, Yuan [1 ]
He, Yan-Lin [1 ]
机构
[1] Beijing Univ Chem Technol, Coll Informat Sci & Technol, Beijing 100029, Peoples R China
关键词
Virtual sample generation; Small sample size; Industrial process; Soft sensor; SOFT SENSOR; FORECASTING-MODEL; TREND-DIFFUSION; PREDICTION; SETS;
D O I
10.1016/j.isatra.2022.08.021
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the development of industrialization, the production scale and complexity of process industries are getting larger and larger. But, limited by the small amounts of samples and the uneven sample distribution in the process industry, it is difficult to establish accurate and efficient data-driven soft sensor models to predict some variables. To further develop the application of soft sensor models, generating new virtual samples based on the original sample distribution to extend the sample set is an ideal approach to solve this problem. In this paper, a novel virtual sample generation method based on the co-training of two K-Nearest Neighbor (KNN) models is proposed. First, according to the sparse parameter, sparse regions in each dimension of the feature space are identified. Second, the input features of virtual samples are generated in these sparse regions by performing interpolation operations. Third, the outputs of virtual samples are predicted by double KNN regressors based on co-training. The qualified virtual samples are screened and the model is updated using these virtual samples to improve the prediction accuracy of the double KNN models. To verify the effectiveness and superiority of the proposed virtual sample generation method based on the co-training (CTVSG), case studies are conducted using two standard functions and a Purified Terephthalic Acid (PTA) industrial dataset, where the effectiveness of CTVSG is confirmed.(c) 2022 ISA. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:290 / 301
页数:12
相关论文
共 50 条
  • [1] Dealing with small sample size problems in process industry using virtual sample generation: a Kriging-based approach
    Zhu, Qun-Xiong
    Chen, Zhong-Sheng
    Zhang, Xiao-Han
    Rajabifard, Abbas
    Xu, Yuan
    Chen, Yi-Qun
    SOFT COMPUTING, 2020, 24 (09) : 6889 - 6902
  • [2] Dealing with small sample size problems in process industry using virtual sample generation: a Kriging-based approach
    Qun-Xiong Zhu
    Zhong-Sheng Chen
    Xiao-Han Zhang
    Abbas Rajabifard
    Yuan Xu
    Yi-Qun Chen
    Soft Computing, 2020, 24 : 6889 - 6902
  • [3] Solving the small sample size problem of LDA
    Huang, R
    Liu, QS
    Lu, HQ
    Ma, SD
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL III, PROCEEDINGS, 2002, : 29 - 32
  • [4] Co-training method based on margin sample addition
    Liu Z.
    Gao Z.
    Li X.
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2018, 39 (03): : 45 - 53
  • [5] A Novel Virtual Sample Generation Method to Overcome the Small Sample Size Problem in Computer Aided Medical Diagnosing
    Wedyan, Mohammad
    Crippa, Alessandro
    Al-Jumaily, Adel
    ALGORITHMS, 2019, 12 (08)
  • [6] Enhancing co-training algorithms by sample representativeness
    Zou, Xitao
    Zou, Xianchun
    Yu, Guoxian
    Fu, Xinghu
    Journal of Computational Information Systems, 2014, 10 (16): : 6883 - 6890
  • [7] A novel virtual sample generation method based on a modified conditional Wasserstein GAN to address the small sample size problem in soft sensing
    He, Yan-Lin
    Li, Xing-Yuan
    Ma, Jia-Hui
    Lu, Shan
    Zhu, Qun-Xiong
    JOURNAL OF PROCESS CONTROL, 2022, 113 : 18 - 28
  • [8] Integrating virtual sample generation with input-training neural network for solving small sample size problems: application to purified terephthalic acid solvent system
    Zhong-Sheng Chen
    Qun-Xiong Zhu
    Yuan Xu
    Yan-Lin He
    Qing-Lin Su
    Yiqing C. Liu
    Zoltan K. Nagy
    Soft Computing, 2021, 25 : 6489 - 6504
  • [9] Integrating virtual sample generation with input-training neural network for solving small sample size problems: application to purified terephthalic acid solvent system
    Chen, Zhong-Sheng
    Zhu, Qun-Xiong
    Xu, Yuan
    He, Yan-Lin
    Su, Qing-Lin
    Liu, Yiqing C.
    Nagy, Zoltan K.
    SOFT COMPUTING, 2021, 25 (08) : 6489 - 6504
  • [10] On Solving the Small Sample Size Problem for Marginal Fisher Analysis
    Dornaika, Fadi
    Bosagzadeh, Alireza
    IMAGE ANALYSIS AND RECOGNITION, 2013, 7950 : 116 - 123