Comparison of Sparse and Jack-knife partial least squares regression methods for variable selection

被引:31
|
作者
Karaman, Ibrahim [1 ]
Qannari, El Mostafa [2 ,3 ]
Martens, Harald [4 ,5 ]
Hedemann, Mette Skou [1 ]
Knudsen, Knud Erik Bach [1 ]
Kohler, Achim [4 ,5 ]
机构
[1] Aarhus Univ, Dept Anim Sci, DK-8830 Tjele, Denmark
[2] LUNAM Univ, ONIRIS, USC Sensometr & Chemometr Lab, F-44322 Nantes, France
[3] INRA, F-44316 Nantes, France
[4] Nofima Norwegian Inst Food Fisheries & Aquacultur, N-1431 As, Norway
[5] Norwegian Univ Life Sci, Dept Math Sci & Technol IMT, Ctr Integrat Genet CIGENE, N-1432 As, Norway
关键词
Sparse PLSR; Jack-knife PLSR; Cross model validation; Perturbation parameter; PRINCIPAL COMPONENT ANALYSIS; PLS REGRESSION; SPECTROSCOPY; VALIDATION; REDUCTION;
D O I
10.1016/j.chemolab.2012.12.005
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The objective of this study was to compare two different techniques of variable selection, Sparse PLSR and Jack-knife PLSR, with respect to their predictive ability and their ability to identify relevant variables. Sparse PLSR is a method that is frequently used in genomics, whereas Jack-knife PLSR is often used by chemometricians. In order to evaluate the predictive ability of both methods, cross model validation was implemented. The performance of both methods was assessed using FTIR spectroscopic data, on the one hand, and a set of simulated data. The stability of the variable selection procedures was highlighted by the frequency of the selection of each variable in the cross model validation segments. Computationally, Jack-knife PLSR was much faster than Sparse PLSR. But while it was found that both methods have more or less the same predictive ability, Sparse PLSR turned out to be generally very stable in selecting the relevant variables, whereas Jack-knife PLSR was very prone to selecting also uninformative variables. To remedy this drawback, a strategy of analysis consisting in adding a perturbation parameter to the uncertainty variances obtained by means of Jack-knife PLSR is demonstrated. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:65 / 77
页数:13
相关论文
共 50 条
  • [41] Variable selection in partial linear regression using the least angle regression
    Seo, Han Son
    Yoon, Min
    Lee, Hakbae
    KOREAN JOURNAL OF APPLIED STATISTICS, 2021, 34 (06) : 937 - 944
  • [42] Identifying Early Indicators of Tail Biting in Pigs by Variable Selection Using Partial Least Squares Regression
    Drexl, Veronika
    Dittrich, Imme
    Wilder, Thore
    Diers, Sophie
    Krieter, Joachim
    ANIMALS, 2023, 13 (01):
  • [43] Integrative sparse partial least squares
    Liang, Weijuan
    Ma, Shuangge
    Zhang, Qingzhao
    Zhu, Tingyu
    STATISTICS IN MEDICINE, 2021, 40 (09) : 2239 - 2256
  • [44] Global Sparse Partial Least Squares
    Mou, Yi
    You, Xinge
    Jiang, Xiubao
    Xu, Duanquan
    Yu, Shujian
    2014 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2014, : 349 - 352
  • [45] Bayesian Sparse Partial Least Squares
    Vidaurre, Diego
    van Gerven, Marcel A. J.
    Bielza, Concha
    Larranaga, Pedro
    Heskes, Tom
    NEURAL COMPUTATION, 2013, 25 (12) : 3318 - 3339
  • [46] Partial least squares regression as an alternative to current regression methods used in ecology
    Carrascal, Luis M.
    Galvan, Ismael
    Gordo, Oscar
    OIKOS, 2009, 118 (05) : 681 - 690
  • [47] Sparse matrix transform based weight updating in partial least squares regression
    Jiangtao Peng
    Journal of Mathematical Chemistry, 2014, 52 : 2197 - 2209
  • [48] The application of sparse partial least squares regression in electricity consumption of Yunnan province
    Peng Shuting
    Dai Lin
    Guo Tingting
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ELECTRONIC TECHNOLOGY, 2016, 48 : 299 - 303
  • [49] Sparse matrix transform based weight updating in partial least squares regression
    Peng, Jiangtao
    JOURNAL OF MATHEMATICAL CHEMISTRY, 2014, 52 (08) : 2197 - 2209
  • [50] Locally sparse and robust partial least squares in scalar-on-function regression
    Gurer, Sude
    Shang, Han Lin
    Mandal, Abhijit
    Beyaztas, Ufuk
    STATISTICS AND COMPUTING, 2024, 34 (05)