Variable selection in partial least squares with the weighted variable contribution to the first singular value of the covariance matrix

被引:4
|
作者
Lin, Weilu [1 ]
Hang, Haifeng [1 ]
Zhuang, Yingping [1 ]
Zhang, Siliang [1 ]
机构
[1] East China Univ Sci & Technol, State Key Lab Bioreactor Engn, 130 Meilong Rd, Shanghai 200237, Peoples R China
关键词
Informative variables; Interval variable selection; Partial least squares; Variable contribution; Maximal singular value; Spectroscopy; LATENT STRUCTURES; REGRESSION; PROJECTIONS; REGIONS; PLS;
D O I
10.1016/j.chemolab.2018.11.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The selection of informative variables in partial least squares (PLS) is important in process analytical technology (PAT) applications in the pharmaceutical industry, for example, the calibration of spectrometers. In the past, numerous approaches have been proposed to select the variables in partial least squares. In this work, a new variable selection method for PLS with the weighted variable contribution (PLS-WVC) to the first singular value of the covariance matrix for each PLS component is proposed. Several variants of PLS-WVC with different weighting factors are proposed. One variant of PLS-WVC is equivalent to the PLS with variable importance in projection (PIS-VIP). However, the variants with the correlation between X(gamma)w(gamma), and Y(gamma)q(gamma) as the weighting factor are preferred based on the results of the simulation cases studies. The proposed PLS-WVCs are integrated with interval PLS (iPLS) further to select the informative wavelength intervals for spectroscopic modelling. The utility of the proposed WVC based variable selection methods in PIS is demonstrated with the real spectral data sets.
引用
收藏
页码:113 / 121
页数:9
相关论文
共 50 条
  • [31] Comparison of Sparse and Jack-knife partial least squares regression methods for variable selection
    Karaman, Ibrahim
    Qannari, El Mostafa
    Martens, Harald
    Hedemann, Mette Skou
    Knudsen, Knud Erik Bach
    Kohler, Achim
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2013, 122 : 65 - 77
  • [32] Prediction of Placental Barrier Permeability: A Model Based on Partial Least Squares Variable Selection Procedure
    Zhang, Yong-Hong
    Xia, Zhi-Ning
    Yan, Li
    Liu, Shu-Shen
    MOLECULES, 2015, 20 (05): : 8270 - 8286
  • [33] Multivariate calibration for machine health monitoring: kernel partial least squares combined with variable selection
    Hyun-Woo Cho
    The International Journal of Advanced Manufacturing Technology, 2010, 48 : 691 - 699
  • [34] Variable selection in near infrared spectroscopy based on significance testing in partial least squares regression
    Westad, F
    Martens, H
    JOURNAL OF NEAR INFRARED SPECTROSCOPY, 2000, 8 (02) : 117 - 124
  • [35] Multivariate calibration for machine health monitoring: kernel partial least squares combined with variable selection
    Cho, Hyun-Woo
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2010, 48 (5-8): : 691 - 699
  • [36] Sparse partial least squares regression for on-line variable selection with multivariate data streams
    McWilliams B.
    Montana G.
    Statistical Analysis and Data Mining, 2010, 3 (03): : 170 - 193
  • [37] Influence of variable selection on partial least squares discriminant analysis models for explosive residue classification
    De Lucia, Frank C., Jr.
    Gottfried, Jennifer L.
    SPECTROCHIMICA ACTA PART B-ATOMIC SPECTROSCOPY, 2011, 66 (02) : 122 - 128
  • [38] Partial Least Squares Regression Model with Single Dependent Variable
    Li Yong
    RECENT ADVANCE IN STATISTICS APPLICATION AND RELATED AREAS, VOLS I AND II, 2009, : 1965 - 1968
  • [39] Weighted partial least squares regression by variable grouping strategy for multivariate calibration of near infrared spectra
    Xu, Heng
    Cai, Wensheng
    Shao, Xueguang
    ANALYTICAL METHODS, 2010, 2 (03) : 289 - 294
  • [40] SPARSE VARIABLE SELECTION VIA ITERATIVELY REWEIGHTED LEAST SQUARES
    Deng, Haisong
    Ma, Yizhong
    Shao, Wenze
    PROCEEDINGS OF THE 38TH INTERNATIONAL CONFERENCE ON COMPUTERS AND INDUSTRIAL ENGINEERING, VOLS 1-3, 2008, : 467 - 472