An efficient variable selection method based on variable permutation and model population analysis for multivariate calibration of NIR spectra

被引:29
|
作者
Bin, Jun [1 ]
Ai, Fangfang [2 ]
Fan, Wei [1 ]
Zhou, Jiheng [1 ]
Li, Xin [1 ]
Tang, Wenxian [3 ]
Liang, Yizeng [3 ]
机构
[1] Hunan Agr Univ, Coll Biosci & Biotechnol, Changsha, Hunan, Peoples R China
[2] Shanghai Tobacco Grp Co Ltd, Shanghai, Peoples R China
[3] Cent S Univ, Coll Chem & Chem Engn, Changsha, Hunan, Peoples R China
关键词
Variable selection; Partial least squares; Variable permutation population analysis; Model population analysis; Exponentially decreasing function; Multivariate spectral calibration; WAVELENGTH INTERVAL SELECTION; LEAST-SQUARES REGRESSION; GENETIC ALGORITHMS; RANDOM FROG; SPECTROSCOPY; ELIMINATION; PLS;
D O I
10.1016/j.chemolab.2016.08.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Variable selection plays a pivotal role in the quantitative analysis of near-infrared (NIR) spectra with large number of variables and relatively few samples. In this study, a novel algorithm, namely variable permutation population analysis (VPPA) which combines variable permutation, model population analysis (MPA) and exponentially decreasing function (EDF), was proposed for variable selection to improve the prediction performance in multivariate spectral calibration. This method builds a large number of sub-datasets by Monte Carlo sampling (MCS) strategy in both sample space and variable space firstly, and the importance of each variable is subsequently evaluated using the difference value order of the corresponding partial least squares (PLS) model prediction error before and after the variable permutation. Next, EDF is applied to eliminate the relatively uninformative variables by force. Ultimately, cross validation is utilized to choose the optimal variable subset. A complete methodology for variable selection is constructed through the above four procedures. Three near infrared (NIR) datasets were presented to illustrate the proposed method and evaluate its performance. While PLS is used as the modeling method, the results reveal that VPPA is a potential variable selection method which shows better prediction performance when compared with conventional PLS, subwindow permutation analysis PIS (SPA-PLS), Monte Carlo uninformative variable elimination by PLS (MC-UVE-PLS), competitive adaptive reweighted sampling PLS (CARS-PLS) and genetic algorithm PLS (GA- PIS). Moreover, the proposed approach employs fewer variables than these variable optimization methods mentioned above. Therefore, the VPPA technique can be recommended for practical implementation in multivariate calibration of NIR spectra. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [1] An efficient variable selection method based on random frog for the multivariate calibration of NIR spectra
    Sun, Jingjing
    Yang, Wude
    Feng, Meichen
    Liu, Qifang
    Kubar, Muhammad Saleem
    RSC ADVANCES, 2020, 10 (28) : 16245 - 16253
  • [2] A novel variable selection method based on stability and variable permutation for multivariate calibration
    Chen, Junming
    Yang, Chunhua
    Zhu, Hongqiu
    Li, Yonggang
    Gui, Weihua
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2018, 182 : 188 - 201
  • [3] Using variable combination population analysis for variable selection in multivariate calibration
    Yun, Yong-Huan
    Wang, Wei-Ting
    Deng, Bai-Chuan
    Lai, Guang-Bi
    Liu, Xin-bo
    Ren, Da-Bing
    Liang, Yi-Zeng
    Fan, Wei
    Xu, Qing-Song
    ANALYTICA CHIMICA ACTA, 2015, 862 : 14 - 23
  • [4] A variable selection method based on uninformative variable elimination for multivariate calibration of near-infrared spectra
    Cai, Wensheng
    Li, Yankun
    Shao, Xueguang
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2008, 90 (02) : 188 - 194
  • [5] Variable interaction network based variable selection for multivariate calibration
    Rao, Raghuraj
    Lakshminarayanan, S.
    ANALYTICA CHIMICA ACTA, 2007, 599 (01) : 24 - 35
  • [6] Variable selection in multivariate calibration based on clustering of variable concept
    Farrokhnia, Maryam
    Karimi, Sadegh
    ANALYTICA CHIMICA ACTA, 2016, 902 : 70 - 81
  • [7] A model population analysis method for variable selection based on mutual information
    Long, Xu-Xia
    Li, Hong-Dong
    Fan, Wei
    Xu, Qing-Song
    Liang, Yi-Zeng
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2013, 121 : 75 - 81
  • [8] Efficient Input Variable Selection for Calibration Model Design
    Fujiwara, Koichi
    Kano, Manabu
    2013 9TH ASIAN CONTROL CONFERENCE (ASCC), 2013,
  • [9] A hybrid variable selection strategy based on continuous shrinkage of variable space in multivariate calibration
    Yun, Yong-Huan
    Bin, Jun
    Liu, Dong-Li
    Xu, Lin
    Yan, Ting-Liang
    Cao, Dong-Sheng
    Xu, Qing-Song
    ANALYTICA CHIMICA ACTA, 2019, 1058 : 58 - 69
  • [10] Model population analysis for variable selection
    Li, Hong-Dong
    Liang, Yi-Zeng
    Xu, Qing-Song
    Cao, Dong-Sheng
    JOURNAL OF CHEMOMETRICS, 2010, 24 (7-8) : 418 - 423