Comparison of Sparse and Jack-knife partial least squares regression methods for variable selection

被引:31
|
作者
Karaman, Ibrahim [1 ]
Qannari, El Mostafa [2 ,3 ]
Martens, Harald [4 ,5 ]
Hedemann, Mette Skou [1 ]
Knudsen, Knud Erik Bach [1 ]
Kohler, Achim [4 ,5 ]
机构
[1] Aarhus Univ, Dept Anim Sci, DK-8830 Tjele, Denmark
[2] LUNAM Univ, ONIRIS, USC Sensometr & Chemometr Lab, F-44322 Nantes, France
[3] INRA, F-44316 Nantes, France
[4] Nofima Norwegian Inst Food Fisheries & Aquacultur, N-1431 As, Norway
[5] Norwegian Univ Life Sci, Dept Math Sci & Technol IMT, Ctr Integrat Genet CIGENE, N-1432 As, Norway
关键词
Sparse PLSR; Jack-knife PLSR; Cross model validation; Perturbation parameter; PRINCIPAL COMPONENT ANALYSIS; PLS REGRESSION; SPECTROSCOPY; VALIDATION; REDUCTION;
D O I
10.1016/j.chemolab.2012.12.005
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The objective of this study was to compare two different techniques of variable selection, Sparse PLSR and Jack-knife PLSR, with respect to their predictive ability and their ability to identify relevant variables. Sparse PLSR is a method that is frequently used in genomics, whereas Jack-knife PLSR is often used by chemometricians. In order to evaluate the predictive ability of both methods, cross model validation was implemented. The performance of both methods was assessed using FTIR spectroscopic data, on the one hand, and a set of simulated data. The stability of the variable selection procedures was highlighted by the frequency of the selection of each variable in the cross model validation segments. Computationally, Jack-knife PLSR was much faster than Sparse PLSR. But while it was found that both methods have more or less the same predictive ability, Sparse PLSR turned out to be generally very stable in selecting the relevant variables, whereas Jack-knife PLSR was very prone to selecting also uninformative variables. To remedy this drawback, a strategy of analysis consisting in adding a perturbation parameter to the uncertainty variances obtained by means of Jack-knife PLSR is demonstrated. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:65 / 77
页数:13
相关论文
共 50 条
  • [1] Comparison of variable selection methods in partial least squares regression
    Mehmood, Tahir
    Saebo, Solve
    Liland, Kristian Hovde
    JOURNAL OF CHEMOMETRICS, 2020, 34 (06)
  • [2] Modified Jack-knife estimation of parameter uncertainty in bilinear modelling by partial least squares regression (PLSR)
    Martens, H
    Martens, M
    FOOD QUALITY AND PREFERENCE, 2000, 11 (1-2) : 5 - 16
  • [3] A review of variable selection methods in Partial Least Squares Regression
    Mehmood, Tahir
    Liland, Kristian Hovde
    Snipen, Lars
    Saebo, Solve
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2012, 118 : 62 - 69
  • [4] Sparse partial least squares regression for simultaneous dimension reduction and variable selection
    Chun, Hyonho
    Keles, Suenduez
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2010, 72 : 3 - 25
  • [5] Sparse partial least squares regression for on-line variable selection with multivariate data streams
    McWilliams B.
    Montana G.
    Statistical Analysis and Data Mining, 2010, 3 (03): : 170 - 193
  • [6] A variant of sparse partial least squares for variable selection and data exploration
    Hunt, Megan J. Olson
    Weissfeld, Lisa
    Boudreau, Robert M.
    Aizenstein, Howard
    Newman, Anne B.
    Simonsick, Eleanor M.
    Van Domelen, Dane R.
    Thomas, Fridtjof
    Yaffe, Kristine
    Rosano, Caterina
    FRONTIERS IN NEUROINFORMATICS, 2014, 8
  • [7] Sparse kernel partial least squares regression
    Momma, M
    Bennett, KP
    LEARNING THEORY AND KERNEL MACHINES, 2003, 2777 : 216 - 230
  • [8] Comparison of Different Variable Selection Methods for Partial Least Squares Soft Sensor Development
    Wang, Zi Xiu
    He, Qinghua
    Wang, Jin
    2014 AMERICAN CONTROL CONFERENCE (ACC), 2014, : 3116 - 3121
  • [9] A partition-based variable selection in partial least squares regression
    Li, Chuan-Quan
    Fang, Zhaoyu
    Xu, Qing-Song
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2020, 198
  • [10] On some aspects of variable selection for partial least squares regression models
    Roy, Partha Pratim
    Roy, Kunal
    QSAR & COMBINATORIAL SCIENCE, 2008, 27 (03): : 302 - 313