Novel comprehensive variable selection algorithm based on multi-weight vector optimal selection and bootstrapping soft shrinkage

被引:4
|
作者
Zhang, Pengfei [1 ]
Xu, Zhuopin [1 ,2 ]
Ma, Huimin [3 ]
Cheng, Weimin [1 ,2 ]
Li, Xiaohong [1 ,2 ]
Tang, Liwen [1 ]
Zhao, Guangxia [1 ,2 ]
Wu, Yuejin [1 ,4 ]
Liu, Zan [1 ]
Wang, Qi [1 ,4 ]
机构
[1] Chinese Acad Sci, Hefei Inst Phys Sci, Hefei 230031, Peoples R China
[2] Univ Sci & Technol China, Hefei 230026, Peoples R China
[3] Anhui Agr Univ, Hefei 230036, Peoples R China
[4] CAS Innovat Acad Seed Design, Hainan Branch, Sanya 572025, Peoples R China
基金
中国国家自然科学基金;
关键词
Chemometrics; Variable selection; Near-infrared spectroscopy; PARTIAL LEAST-SQUARES; NEAR-INFRARED SPECTROSCOPY; WAVELENGTH INTERVAL SELECTION; POPULATION ANALYSIS; REGRESSION; CHEMOMETRICS; NIR;
D O I
10.1016/j.infrared.2023.104800
中图分类号
TH7 [仪器、仪表];
学科分类号
0804 ; 080401 ; 081102 ;
摘要
The dimensionality of spectral data is increasing with the advancements in spectral technology. Therefore, there is an urgent need to develop high-performance variable selection algorithms for chemometrics applications. This study proposes a novel multi-weight optimal-bootstrap soft shrinkage (MWO-BOSS) method for variable selection based on the bootstrap soft shrinkage (BOSS) algorithm, comprising three effective improvement strategies. First, the optimal weight vector of six weight vectors are used as weights of the selection variables, rather than the absolute value of the regression coefficients based only on a single weight vector. Second, in each loop, a step-by-step strategy is implemented to determine the optimal set of variables. Finally, a smoothing operation is added to the weight vector to improve the anti-noise performance of the algorithm. The performance of the MWO-BOSS algorithm was tested on the four spectral datasets corn protein, corn oil, soil, and beer and compared with six high-performance algorithms, namely interval partial least squares (iPLS), Moving Window Partial LeastSquares(MWPLS), competitive adaptive reweighted sampling (CARS), variable combinatorial population analysis (VCPA), VCPA-IRIV and BOSS. The results show that the MWO-BOSS algorithm effectively improves the predictive ability of the model, with MWO-BOSS-Step-S providing the best results among the four tested datasets.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] A bootstrapping soft shrinkage approach for variable selection in chemical modeling
    Deng, Bai-Chuan
    Yun, Yong-Huan
    Cao, Dong-Sheng
    Yin, Yu-Long
    Wang, Wei-Ting
    Lu, Hong-Mei
    Luo, Qian-Yi
    Liang, Yi-Zeng
    ANALYTICA CHIMICA ACTA, 2016, 908 : 63 - 74
  • [2] Bootstrapping soft shrinkage variable selection method based on the combination of frequency and regression coefficient
    Zhang F.
    Tang X.
    Tong A.
    Wang B.
    Wang J.
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2020, 41 (01): : 64 - 70
  • [3] A modification of the bootstrapping soft shrinkage approach for spectral variable selection in the issue of over-fitting, model accuracy and variable selection credibility
    Yan, Hong
    Song, Xiangzhong
    Tian, Kuangda
    Gao, Jingxian
    Li, Qianqian
    Xiong, Yanmei
    Min, Shungeng
    SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2019, 210 : 362 - 371
  • [4] A Bootstrapping Soft Shrinkage Approach and Interval Random Variables Selection Hybrid Model for Variable Selection in Near-Infrared Spectroscopy
    Gamal Al-Kaf, Hasan Ali
    Mohammed Alduais, Nayef Abdulwahab
    Saad, Abdul-Malik H. Y.
    Chia, Kim Seng
    Mohsen, Abdulqader M.
    Alhussian, Hitham
    Haidar Mahdi, Ammar Abdo Mohammed
    Wan Salam, Wan Saiful-Islam
    IEEE ACCESS, 2020, 8 : 168036 - 168052
  • [6] A heterogenous network selection algorithm for internet of vehicles based on comprehensive weight
    Jiang, Fuchun
    Feng, Chenwei
    Zhang, Hongyi
    ALEXANDRIA ENGINEERING JOURNAL, 2021, 60 (05) : 4677 - 4688
  • [7] A Novel Input Variable Selection and Structure Optimization Algorithm for Multilayer Perceptron-Based Soft Sensors
    Wang, Hongxun
    Sui, Lin
    Zhang, Mengyan
    Zhang, Fangfang
    Ma, Fengying
    Sun, Kai
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [8] Genetic algorithm based incremental learning for optimal weight and classifier selection
    Hulley, Gregory
    Marwala, Tshilidzi
    COMPUTATIONAL MODELS FOR LIFE SCIENCES (CMLS 07), 2007, 952 : 258 - 267
  • [9] Variable Selection Based on Random Vector Functional-link in Soft Sensor Modeling
    Wen, Xiaohong
    Ding, Jie
    Yan, Gaowei
    2016 9TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2016), 2016, : 1339 - 1343
  • [10] A novel variable selection algorithm for multi-layer perceptron with elastic net
    Zhang, Fangfang
    Sun, Kai
    Wu, Xiuliang
    NEUROCOMPUTING, 2019, 361 : 110 - 118