A feature-based soft sensor for spectroscopic data analysis

被引:23
|
作者
Shah, Devarshi [1 ]
Wang, Jin [1 ]
He, Q. Peter [1 ]
机构
[1] Auburn Univ, Auburn, AL 36849 USA
基金
美国国家科学基金会;
关键词
Soft sensor; Variable selection; Multivariate regression; Partial least squares; Kernel partial least squares; Statistics pattern analysis; NIR; UV/Vis; Chemometrics; PARTIAL LEAST-SQUARES; VARIABLE SELECTION; LATENT STRUCTURES; PLS; REGRESSION; PROJECTION;
D O I
10.1016/j.jprocont.2019.03.016
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the last few decades, spectroscopic techniques such as near-infrared (NIR) and UV/vis spectroscopies have gained wide applications. As a result, various soft sensors have been developed to predict sample properties from its spectroscopic readings. Because the readings at different wavelengths are highly correlated, it has been shown that variable selection could significantly improve a soft sensor's prediction performance and reduce the model complexity. Currently, almost all variable selection methods focus on how to select the variables (i.e., wavelengths or wavelength segments) that are strongly correlated with the dependent variable to improve the prediction performance. Although many successful applications have been reported, such variable selection methods do have their limitations, such as high sensitivity to the choice of training data, and deteriorated performance when testing on new samples. One possible reason is the removal of useful wavelengths or segments of wavelengths during the calibration process, which could be "tilted" to overfit or capture the noise or unknown disturbances contained in the calibration data. As a result, the model prediction performance may deteriorate significantly when the model is extrapolated or applied to new samples. To address this limitation, we propose a feature-based soft sensor approach utilizing statistics pattern analysis (SPA). Instead of selecting certain wavelengths or wavelength segments, the SPA-based method considers the whole spectrum which is divided into segments, and extracts different features over each spectrum segment to build the soft sensor. In other words, the SPA model contains the complete information from the full spectrum without any selection or removal, which we believe is the main reason for the high robustness of the SPA-based method. In addition, we propose a Monte Carlo validation and testing (MCVT) procedure and three MCVT-based performance indices for consistent and fair comparison of different soft sensor methods across different datasets. The MCVT procedure and indices are generally applicable for model comparison in other applications. Four case studies are presented to demonstrate the performance of the feature-based soft sensor and to compare it with a full partial least squares (PLS), a least absolute shrinkage and selection operator (Lasso), and a synergy interval PLS (SiPLS) based models following the proposed MCVT procedure. In addition, we examine the potential of kernel PLS (KPLS) based soft sensor approaches, examine their performances, and discuss their pros and cons. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:98 / 107
页数:10
相关论文
共 50 条
  • [1] Feature-based biological sensor fusion
    Blasch, EP
    Gainey, JC
    FUSION'98: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MULTISOURCE-MULTISENSOR INFORMATION FUSION, VOLS 1 AND 2, 1998, : 702 - 709
  • [2] Feature-Based Statistical Analysis of Combustion Simulation Data
    Bennett, Janine C.
    Krishnamoorthy, Vaidyanathan
    Liu, Shusen
    Grout, Ray W.
    Hawkes, Evatt R.
    Chen, Jacqueline H.
    Shepherd, Jason
    Pascucci, Valerio
    Bremer, Peer-Timo
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2011, 17 (12) : 1822 - 1831
  • [3] Feature-based data management
    Bhat, Srinivasa K., 1600, (111):
  • [4] Feature-based analysis of large-scale spatio-temporal sensor data on hybrid architectures
    Saltz, Joel H.
    Teodoro, George
    Pan, Tony
    Cooper, Lee A. D.
    Kong, Jun
    Klasky, Scott
    Kurc, Tahsin M.
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2013, 27 (03): : 263 - 272
  • [5] Feature-Based Analysis of Plasma-Based Particle Acceleration Data
    Ruebel, Oliver
    Geddes, Cameron G. R.
    Chen, Min
    Cormier-Michel, Estelle
    Bethel, E. Wes
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2014, 20 (02) : 196 - 210
  • [6] Adaptive Sensor Management for Feature-Based Classification
    Jenkins, Karen
    Castanon, David A.
    49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 522 - 527
  • [7] Feature-based image analysis
    Lillholm, M
    Nielsen, M
    Griffin, LD
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2003, 52 (2-3) : 73 - 95
  • [8] Feature-Based Image Analysis
    Martin Lillholm
    Mads Nielsen
    Lewis D. Griffin
    International Journal of Computer Vision, 2003, 52 : 73 - 95
  • [9] Feature-based data assimilation in geophysics
    Morzfeld, Matthias
    Adams, Jesse
    Lunderman, Spencer
    Orozco, Rafael
    NONLINEAR PROCESSES IN GEOPHYSICS, 2018, 25 (02) : 355 - 374
  • [10] Feature-Based Data Stream Clustering
    Asbagh, Mohsen Jafari
    Abolhassani, Hassan
    PROCEEDINGS OF THE 8TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE, 2009, : 363 - 368