A feature-based soft sensor for spectroscopic data analysis

被引:23
|
作者
Shah, Devarshi [1 ]
Wang, Jin [1 ]
He, Q. Peter [1 ]
机构
[1] Auburn Univ, Auburn, AL 36849 USA
基金
美国国家科学基金会;
关键词
Soft sensor; Variable selection; Multivariate regression; Partial least squares; Kernel partial least squares; Statistics pattern analysis; NIR; UV/Vis; Chemometrics; PARTIAL LEAST-SQUARES; VARIABLE SELECTION; LATENT STRUCTURES; PLS; REGRESSION; PROJECTION;
D O I
10.1016/j.jprocont.2019.03.016
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the last few decades, spectroscopic techniques such as near-infrared (NIR) and UV/vis spectroscopies have gained wide applications. As a result, various soft sensors have been developed to predict sample properties from its spectroscopic readings. Because the readings at different wavelengths are highly correlated, it has been shown that variable selection could significantly improve a soft sensor's prediction performance and reduce the model complexity. Currently, almost all variable selection methods focus on how to select the variables (i.e., wavelengths or wavelength segments) that are strongly correlated with the dependent variable to improve the prediction performance. Although many successful applications have been reported, such variable selection methods do have their limitations, such as high sensitivity to the choice of training data, and deteriorated performance when testing on new samples. One possible reason is the removal of useful wavelengths or segments of wavelengths during the calibration process, which could be "tilted" to overfit or capture the noise or unknown disturbances contained in the calibration data. As a result, the model prediction performance may deteriorate significantly when the model is extrapolated or applied to new samples. To address this limitation, we propose a feature-based soft sensor approach utilizing statistics pattern analysis (SPA). Instead of selecting certain wavelengths or wavelength segments, the SPA-based method considers the whole spectrum which is divided into segments, and extracts different features over each spectrum segment to build the soft sensor. In other words, the SPA model contains the complete information from the full spectrum without any selection or removal, which we believe is the main reason for the high robustness of the SPA-based method. In addition, we propose a Monte Carlo validation and testing (MCVT) procedure and three MCVT-based performance indices for consistent and fair comparison of different soft sensor methods across different datasets. The MCVT procedure and indices are generally applicable for model comparison in other applications. Four case studies are presented to demonstrate the performance of the feature-based soft sensor and to compare it with a full partial least squares (PLS), a least absolute shrinkage and selection operator (Lasso), and a synergy interval PLS (SiPLS) based models following the proposed MCVT procedure. In addition, we examine the potential of kernel PLS (KPLS) based soft sensor approaches, examine their performances, and discuss their pros and cons. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:98 / 107
页数:10
相关论文
共 50 条
  • [41] AUTOMATIC FEATURE-BASED POINT CLOUD REGISTRATION FOR A MOVING SENSOR PLATFORM
    Weinmann, Martin
    Dittrich, Andre
    Hinz, Stefan
    Jutzi, Boris
    ISPRS HANNOVER WORKSHOP 2013, 2013, 40-1 (W-1): : 373 - 378
  • [42] DISTRIBUTED FEATURE-BASED MODULATION CLASSIFICATION USING WIRELESS SENSOR NETWORKS
    Forero, Pedro A.
    Cano, Alfonso
    Giannakis, Georgios B.
    2008 IEEE MILITARY COMMUNICATIONS CONFERENCE: MILCOM 2008, VOLS 1-7, 2008, : 1467 - 1473
  • [43] Feature-based detection and classification of moving objects using LiDAR sensor
    Guo, Ziming
    Cai, Baigen
    Jiang, Wei
    Wang, Jian
    IET INTELLIGENT TRANSPORT SYSTEMS, 2019, 13 (07) : 1088 - 1096
  • [44] A Dynamic Feature-Based Security Detector in Wireless Sensor Network Transceiver
    Tong, Xin
    Xia, Jikang
    Chen, Lan
    Li, Ying
    WEB TECHNOLOGIES AND APPLICATIONS, APWEB 2014, PT II, 2014, 8710 : 344 - 352
  • [45] Quantitative optimization of interoperability during feature-based data exchange
    Zhang, D. J.
    He, F. Z.
    Han, S. H.
    Li, X. X.
    INTEGRATED COMPUTER-AIDED ENGINEERING, 2016, 23 (01) : 31 - 50
  • [46] Towards Feature-Based Performance Regression Using Trajectory Data
    Jankovic, Anja
    Eftimov, Tome
    Doerr, Carola
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2021, 2021, 12694 : 601 - 617
  • [47] Feature-based scalable management mechanism of business process data
    Sun, Jun-Yi
    Li, Hou-Fu
    Han, Yan-Bo
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2011, 17 (08): : 1856 - 1863
  • [48] A Method for Data Exchange between Feature-based CAD Models
    Liu Jing
    Wei Ming
    Zhang Jun
    Wen Kun
    Chen Zheng-ming
    2010 8TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2010, : 5287 - 5290
  • [49] Feature-Based Researcher Identification Framework Using Timeline Data
    Jangwon Gim
    Yunji Jang
    Hanmin Jung
    Do-Heon Jeong
    Wireless Personal Communications, 2016, 91 : 1653 - 1667
  • [50] Feature-Based Researcher Identification Framework Using Timeline Data
    Gim, Jangwon
    Jang, Yunji
    Jung, Hanmin
    Jeong, Do-Heon
    WIRELESS PERSONAL COMMUNICATIONS, 2016, 91 (04) : 1653 - 1667