Key wavelengths screening using competitive adaptive reweighted sampling method for multivariate calibration

被引:1565
|
作者
Li, Hongdong [1 ]
Liang, Yizeng [1 ]
Xu, Qingsong [2 ]
Cao, Dongsheng [1 ]
机构
[1] Cent S Univ, Coll Chem & Chem Engn, Res Ctr Modernizat Tradit Chinese Med, Changsha 410083, Peoples R China
[2] Cent S Univ, Sch Math Sci, Changsha 410083, Peoples R China
关键词
Wavelength selection; Monte Carlo; Adaptive reweighted sampling; Model sampling; Near infrared; Multivariate calibration; PARTIAL LEAST-SQUARES; UNINFORMATIVE VARIABLE ELIMINATION; SUCCESSIVE PROJECTIONS ALGORITHM; GENETIC-ALGORITHM; REGRESSION APPLICATION; SELECTION; OPTIMIZATION; COMPONENTS; MODELS; TOOL;
D O I
10.1016/j.aca.2009.06.046
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
By employing the simple but effective principle 'survival of the fittest' on which Darwin's Evolution Theory is based, a novel strategy for selecting an optimal combination of key wavelengths of multi-component spectral data, named competitive adaptive reweighted sampling (CARS), is developed. Key wavelengths are defined as the wavelengths with large absolute coefficients in a multivariate linear regression model, such as partial least squares (PLS). In the present work, the absolute values of regression coefficients of PLS model are used as an index for evaluating the importance of each wavelength. Then, based on the importance level of each wavelength, CARS sequentially selects N subsets of wavelengths from N Monte Carlo (MC) sampling runs in an iterative and competitive manner. In each sampling run, a fixed ratio (e.g. 80%) of samples is first randomly selected to establish a calibration model. Next, based on the regression coefficients, a two-step procedure including exponentially decreasing function (EDF) based enforced wavelength selection and adaptive reweighted sampling (ARS) based competitive wavelength selection is adopted to select the key wavelengths. Finally, cross validation (CV) is applied to choose the subset with the lowest root mean square error of CV (RMSECV). The performance of the proposed procedure is evaluated using one simulated dataset together with one near infrared dataset of two properties. The results reveal an outstanding characteristic of CARS that it can usually locate an optimal combination of some key wavelengths which are interpretable to the chemical property of interest. Additionally, our study shows that better prediction is obtained by CARS when compared to full spectrum PLS modeling, Monte Carlo uninformative variable elimination (MC-UVE) and moving window partial least squares regression (MWPLSR). (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:77 / 84
页数:8
相关论文
共 50 条
  • [21] Characterization of Tobacco with Near-Infrared Spectroscopy with Competitive Adaptive Reweighted Sampling and Partial Least Squares Discrimination
    Wu, Lijun
    Wang, Baoxing
    Yin, Yanfei
    Duan, Rumin
    Xie, Zhiqiang
    Liu, En-Fen
    Bai, Xiaoli
    ANALYTICAL LETTERS, 2016, 49 (14) : 2290 - 2300
  • [22] Correction to: Rapid spectral analysis of agro-products using an optimal strategy: dynamic backward interval PLS–competitive adaptive reweighted sampling
    Xiangzhong Song
    Guorong Du
    Qianqian Li
    Guo Tang
    Yue Huang
    Analytical and Bioanalytical Chemistry, 2020, 412 : 8453 - 8453
  • [23] Improvement of near infrared spectroscopic (NIRS) analysis of caffeine in roasted Arabica coffee by variable selection method of stability competitive adaptive reweighted sampling (SCARS)
    Zhang, Xuan
    Li, Wei
    Yin, Bin
    Chen, Weizhong
    Kelly, Declan P.
    Wang, Xiaoxin
    Zheng, Kaiyi
    Du, Yiping
    SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2013, 114 : 350 - 356
  • [24] Rapid spectral analysis of agro-products using an optimal strategy: dynamic backward interval PLS-competitive adaptive reweighted sampling
    Song, Xiangzhong
    Du, Guorong
    Li, Qianqian
    Tang, Guo
    Huang, Yue
    ANALYTICAL AND BIOANALYTICAL CHEMISTRY, 2020, 412 (12) : 2795 - 2804
  • [25] Quantitative analysis of near infrared spectroscopic data based on dual-band transformation and competitive adaptive reweighted sampling
    Li, Yiming
    Yang, Xinwu
    SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2023, 285
  • [26] Spectroscopic diagnosis of zinc contaminated soils based on competitive adaptive reweighted sampling algorithm and an improved support vector machine
    Xu, Xibo
    Ren, Mengyan
    Cao, Jianfei
    Wu, Quanyuan
    Liu, Peiyuan
    Lv, Jianshu
    SPECTROSCOPY LETTERS, 2020, 53 (02) : 86 - 99
  • [27] Optimizing Rice Near-Infrared Models Using Fractional Order Savitzky-Golay Derivation (FOSGD) Combined with Competitive Adaptive Reweighted Sampling (CARS)
    Xia, Zhenzhen
    Yang, Jie
    Wang, Jing
    Wang, Shengpeng
    Liu, Yan
    APPLIED SPECTROSCOPY, 2020, 74 (04) : 417 - 426
  • [28] AN ADAPTIVE SYNTHESIS CALIBRATION METHOD FOR TIME-INTERLEAVED SAMPLING SYSTEMS
    Pan, Huiqing
    Tian, Shulin
    Ye, Peng
    METROLOGY AND MEASUREMENT SYSTEMS, 2010, 17 (03) : 405 - 414
  • [29] Rapid fatty acids detection of vegetable oils by Raman spectroscopy based on competitive adaptive reweighted sampling coupled with support vector regression
    Pang, Linjiang
    Chen, Hui
    Yin, Liqing
    Cheng, Jiyu
    Jin, Jiande
    Zhao, Honghui
    Liu, Zhihao
    Dong, Longlong
    Yu, Huichun
    Lu, Xinghua
    FOOD QUALITY AND SAFETY, 2022, 6
  • [30] Optimization of Quantitative Modeling of Starch in Huangshui Based on Near-Infrared Spectral Feature Extraction Using Competitive Adaptive Reweighted Sampling Combined with Successive Projections Algorithm
    Mu, Wenzhu
    Zhang, Guiyu
    Zhang, Wei
    Yao, Rui
    Fu, Ni
    Shipin Kexue/Food Science, 2024, 45 (19): : 8 - 14