Comparison of Sparse and Jack-knife partial least squares regression methods for variable selection

被引:31
|
作者
Karaman, Ibrahim [1 ]
Qannari, El Mostafa [2 ,3 ]
Martens, Harald [4 ,5 ]
Hedemann, Mette Skou [1 ]
Knudsen, Knud Erik Bach [1 ]
Kohler, Achim [4 ,5 ]
机构
[1] Aarhus Univ, Dept Anim Sci, DK-8830 Tjele, Denmark
[2] LUNAM Univ, ONIRIS, USC Sensometr & Chemometr Lab, F-44322 Nantes, France
[3] INRA, F-44316 Nantes, France
[4] Nofima Norwegian Inst Food Fisheries & Aquacultur, N-1431 As, Norway
[5] Norwegian Univ Life Sci, Dept Math Sci & Technol IMT, Ctr Integrat Genet CIGENE, N-1432 As, Norway
关键词
Sparse PLSR; Jack-knife PLSR; Cross model validation; Perturbation parameter; PRINCIPAL COMPONENT ANALYSIS; PLS REGRESSION; SPECTROSCOPY; VALIDATION; REDUCTION;
D O I
10.1016/j.chemolab.2012.12.005
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The objective of this study was to compare two different techniques of variable selection, Sparse PLSR and Jack-knife PLSR, with respect to their predictive ability and their ability to identify relevant variables. Sparse PLSR is a method that is frequently used in genomics, whereas Jack-knife PLSR is often used by chemometricians. In order to evaluate the predictive ability of both methods, cross model validation was implemented. The performance of both methods was assessed using FTIR spectroscopic data, on the one hand, and a set of simulated data. The stability of the variable selection procedures was highlighted by the frequency of the selection of each variable in the cross model validation segments. Computationally, Jack-knife PLSR was much faster than Sparse PLSR. But while it was found that both methods have more or less the same predictive ability, Sparse PLSR turned out to be generally very stable in selecting the relevant variables, whereas Jack-knife PLSR was very prone to selecting also uninformative variables. To remedy this drawback, a strategy of analysis consisting in adding a perturbation parameter to the uncertainty variances obtained by means of Jack-knife PLSR is demonstrated. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:65 / 77
页数:13
相关论文
共 50 条
  • [31] A new model selection criterion for partial least squares regression
    Martinez, Jose L.
    Saulo, Helton
    Barrios Escobar, Humberto
    Leao, Jeremias
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2017, 169 : 64 - 78
  • [32] Partial least squares regression
    deJong, S
    Phatak, A
    RECENT ADVANCES IN TOTAL LEAST SQUARES TECHNIQUES AND ERRORS-IN-VARIABLES MODELING, 1997, : 25 - 36
  • [33] A Comparison of Sparse Partial Least Squares and Elastic Net in Wavelength Selection on NIR Spectroscopy Data
    Fu, Guang-Hui
    Zong, Min-Jie
    Wang, Feng-Hua
    Yi, Lun-Zhao
    INTERNATIONAL JOURNAL OF ANALYTICAL CHEMISTRY, 2019, 2019
  • [34] A Partial Least Squares based algorithm for parsimonious variable selection
    Mehmood, Tahir
    Martens, Harald
    Saebo, Solve
    Warringer, Jonas
    Snipen, Lars
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2011, 6
  • [35] Variable selection for partial least squares modeling by genetic algorithms
    Chu, XL
    Yuan, HF
    Wang, YB
    Lu, WZ
    CHINESE JOURNAL OF ANALYTICAL CHEMISTRY, 2001, 29 (04) : 437 - 442
  • [36] Variable selection in discriminant partial least-squares analysis
    Alsberg, BK
    Kell, DB
    Goodacre, R
    ANALYTICAL CHEMISTRY, 1998, 70 (19) : 4126 - 4133
  • [37] A Partial Least Squares based algorithm for parsimonious variable selection
    Tahir Mehmood
    Harald Martens
    Solve Sæbø
    Jonas Warringer
    Lars Snipen
    Algorithms for Molecular Biology, 6
  • [38] Comparison of principal components regression, partial least squares regression, multi-block partial least squares regression, and serial partial least squares regression algorithms for the analysis of Fe in iron ore using LIBS
    Yaroshchyk, P.
    Death, D. L.
    Spencer, S. J.
    JOURNAL OF ANALYTICAL ATOMIC SPECTROMETRY, 2012, 27 (01) : 92 - 98
  • [39] Variable selection in random calibration of near-infrared instruments: ridge regression and partial least squares regression settings
    Gusnanto, A
    Pawitan, Y
    Huang, J
    Lane, B
    JOURNAL OF CHEMOMETRICS, 2003, 17 (03) : 174 - 185
  • [40] Variable selection using axis-aligned random projections for partial least-squares regression
    Lin, Youwu
    Zeng, Xin
    Wang, Pei
    Huang, Shuai
    Teo, Kok Lay
    STATISTICS AND COMPUTING, 2024, 34 (03)