Variable Selection in Visible and Near-Infrared Spectral Analysis for Noninvasive Determination of Soluble Solids Content of ‘Ya’ Pear

被引:0
|
作者
Jiangbo Li
Wenqian Huang
Liping Chen
Shuxiang Fan
Baohua Zhang
Zhiming Guo
Chunjiang Zhao
机构
[1] Beijing Academy of Agriculture and Forestry Sciences,Beijing Research Center of Intelligent Equipment for Agriculture
[2] China Agricultural University,College of Engineering
来源
Food Analytical Methods | 2014年 / 7卷
关键词
Near infrared spectroscopy; Monte Carlo–uninformative variable elimination; Successive projections algorithm; Variable selection; Soluble solids content; ‘Ya’ pear;
D O I
暂无
中图分类号
学科分类号
摘要
Informative variable selection or wavelength selection plays an important role in the quantitative analysis of near-infrared (NIR) spectra because the modern spectroscopy instrumentations usually have a high resolution and the obtained spectral data sets may have thousands of variables and hundreds or thousands of samples. In this study, a new combination of Monte Carlo–uninformative variable elimination (MC-UVE) and successive projections algorithm (SPA; MC-UVE-SPA) was proposed to select the most effective variables. MC-UVE was firstly used to eliminate the uninformative variables in the raw spectra data. Then, SPA was applied to determine the variables with the least collinearity. A case study was done based on the NIR spectroscopy for the non-destructive determination of soluble solids content (SSC) in ‘Ya’ pear. A total of 160 samples were prepared for the calibration (n = 120) and prediction (n = 40) sets. Three calibration algorithms including linear regressions of partial least square regression (PLS) and multiple linear regression (MLR), and nonlinear regression of least-square support vector machine (LS-SVM) were used for model establishment by using the selected variables by SPA, UVE, MC-UVE, UVE-SPA, and MC-UVE-SPA, respectively. The results indicated that linear models such as PLS and MLR were more effective than nonlinear model such as LS-SVM in the prediction of SSC of ‘Ya’ pear. In terms of linear models, different variable selection methods can obtain a similar result with the RMSEP values range from 0.2437 to 0.2830. However, combination of MC-UVE and SPA was helpful for obtaining a more parsimonious and efficient model for predicting the SSC values in ‘Ya’ pear. Twenty-two effective variables selected by MC-UVE-SPA achieved the optimal linear MC-UVE-SPA-MLR model compared with other all developed models by balancing between model accuracy and model complexity. The coefficients of determination (r2), root mean square error of prediction, and residual predictive deviation by MC-UVE-SPA-MLR were 0.9271, 0.2522, and 3.7037, respectively.
引用
收藏
页码:1891 / 1902
页数:11
相关论文
共 50 条
  • [11] Variable selection for partial least squares analysis of soluble solids content in watermelon using near-infrared diffuse transmission technique
    Jie, Dengfei
    Xie, Lijuan
    Fu, Xiaping
    Rao, Xiuqin
    Ying, Yibin
    JOURNAL OF FOOD ENGINEERING, 2013, 118 (04) : 387 - 392
  • [12] Improving accuracy of prediction model for soluble solids content of watermelon by variable selection based on near-infrared spectroscopy
    Land Use and Technology Department, China University of Geosciences , Beijing 100083, China
    不详
    Nongye Gongcheng Xuebao, 2013, 12 (264-270):
  • [13] Hybrid variable selection in visible and near-infrared spectral analysis for non-invasive quality determination of grape juice
    Wu, Di
    He, Yong
    Nie, Pengcheng
    Cao, Fang
    Bao, Yidan
    ANALYTICA CHIMICA ACTA, 2010, 659 (1-2) : 229 - 237
  • [14] Near-infrared spectrometric method for nondestructive determination of soluble solids content of peaches
    Peiris, KHS
    Dull, GG
    Leffler, RG
    Kays, SJ
    JOURNAL OF THE AMERICAN SOCIETY FOR HORTICULTURAL SCIENCE, 1998, 123 (05) : 898 - 905
  • [15] Interpretable Perturbator for Variable Selection in near-Infrared Spectral Analysis
    Duan, Chaoshu
    Liu, Xuyang
    Cai, Wensheng
    Shao, Xueguang
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2023, 64 (07) : 2508 - 2514
  • [16] Online quantitative analysis of soluble solids content in navel oranges using visible-near infrared spectroscopy and variable selection methods
    Liu, Yande
    Zhou, Yanrui
    Pan, Yuanyuan
    JOURNAL OF INNOVATIVE OPTICAL HEALTH SCIENCES, 2014, 7 (06)
  • [17] Variable selection in visible and near-infrared spectra: Application to on-line determination of sugar content in pears
    Xu, Huirong
    Qi, Bing
    Sun, Tong
    Fu, Xiaping
    Ying, Yibin
    JOURNAL OF FOOD ENGINEERING, 2012, 109 (01) : 142 - 147
  • [18] Measurement of soluble solids content in pear by FTNIR spectroscopy and variable selection
    Zhu W.
    Jiang H.
    Chen Q.
    Guo J.
    Nongye Jixie Xuebao/Transactions of the Chinese Society of Agricultural Machinery, 2010, 41 (10): : 129 - 133
  • [19] Research on the soluble solids content of pear internal quality index by near-infrared diffuse reflectance spectroscopy
    Liu Yan-de
    Sun Xu-dong
    Chen Xing-miao
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2008, 28 (04) : 797 - 800
  • [20] Study on the Influence of Light Intensity on Near-Infrared Diffuse Reflectance Spectra of Pear Soluble Solids Content
    Wu Fang-long
    Shen Huang-tong
    Wu Chen-kai
    Yu Yong-hua
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2013, 33 (10) : 2671 - 2674