Novel comprehensive variable selection algorithm based on multi-weight vector optimal selection and bootstrapping soft shrinkage

被引:4
|
作者
Zhang, Pengfei [1 ]
Xu, Zhuopin [1 ,2 ]
Ma, Huimin [3 ]
Cheng, Weimin [1 ,2 ]
Li, Xiaohong [1 ,2 ]
Tang, Liwen [1 ]
Zhao, Guangxia [1 ,2 ]
Wu, Yuejin [1 ,4 ]
Liu, Zan [1 ]
Wang, Qi [1 ,4 ]
机构
[1] Chinese Acad Sci, Hefei Inst Phys Sci, Hefei 230031, Peoples R China
[2] Univ Sci & Technol China, Hefei 230026, Peoples R China
[3] Anhui Agr Univ, Hefei 230036, Peoples R China
[4] CAS Innovat Acad Seed Design, Hainan Branch, Sanya 572025, Peoples R China
基金
中国国家自然科学基金;
关键词
Chemometrics; Variable selection; Near-infrared spectroscopy; PARTIAL LEAST-SQUARES; NEAR-INFRARED SPECTROSCOPY; WAVELENGTH INTERVAL SELECTION; POPULATION ANALYSIS; REGRESSION; CHEMOMETRICS; NIR;
D O I
10.1016/j.infrared.2023.104800
中图分类号
TH7 [仪器、仪表];
学科分类号
0804 ; 080401 ; 081102 ;
摘要
The dimensionality of spectral data is increasing with the advancements in spectral technology. Therefore, there is an urgent need to develop high-performance variable selection algorithms for chemometrics applications. This study proposes a novel multi-weight optimal-bootstrap soft shrinkage (MWO-BOSS) method for variable selection based on the bootstrap soft shrinkage (BOSS) algorithm, comprising three effective improvement strategies. First, the optimal weight vector of six weight vectors are used as weights of the selection variables, rather than the absolute value of the regression coefficients based only on a single weight vector. Second, in each loop, a step-by-step strategy is implemented to determine the optimal set of variables. Finally, a smoothing operation is added to the weight vector to improve the anti-noise performance of the algorithm. The performance of the MWO-BOSS algorithm was tested on the four spectral datasets corn protein, corn oil, soil, and beer and compared with six high-performance algorithms, namely interval partial least squares (iPLS), Moving Window Partial LeastSquares(MWPLS), competitive adaptive reweighted sampling (CARS), variable combinatorial population analysis (VCPA), VCPA-IRIV and BOSS. The results show that the MWO-BOSS algorithm effectively improves the predictive ability of the model, with MWO-BOSS-Step-S providing the best results among the four tested datasets.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] A niching behaviour-based algorithm for multi-level manufacturing service composition optimal-selection
    Ding, Tao
    Yan, Guangrong
    Lei, Yi
    Xu, Xiangyu
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (03) : 1177 - 1189
  • [42] A niching behaviour-based algorithm for multi-level manufacturing service composition optimal-selection
    Tao Ding
    Guangrong Yan
    Yi Lei
    Xiangyu Xu
    Journal of Ambient Intelligence and Humanized Computing, 2020, 11 : 1177 - 1189
  • [43] An optimal weight heterogeneous integrated carbon price prediction model based on temporal information extraction and specific comprehensive feature selection
    Wang, Jujie
    Xu, Shulian
    Shu, Shuqin
    ENERGY, 2024, 312
  • [44] Solving service selection problem based on a novel multi-objective artificial bees colony algorithm
    Huang L.
    Zhang B.
    Yuan X.
    Zhang C.
    Gao Y.
    Huang, Liping (huanglp@swc.neu.edu.cn), 1600, Shanghai Jiaotong University (22): : 474 - 480
  • [45] Solving Service Selection Problem Based on a Novel Multi-Objective Artificial Bees Colony Algorithm
    黄利萍
    张斌
    苑勋
    张长胜
    高岩
    Journal of Shanghai Jiaotong University(Science), 2017, 22 (04) : 474 - 480
  • [46] A NOVEL MULTI-CHANNEL SPARSE RECOVERY STAP ALGORITHM FOR SAMPLE SELECTION BASED ON PRIOR KNOWLEDGE
    Kang, Niezipeng
    Zhang, Yun
    Li, Gaopeng
    Ren, Hang
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 4407 - 4410
  • [47] A novel multi-objective evolutionary algorithm with fuzzy logic based adaptive selection of operators: FAME
    Santiago, Alejandro
    Dorronsoro, Bernabe
    Nebro, Antonio J.
    Durillo, Juan J.
    Castillo, Oscar
    Fraire, Hector J.
    INFORMATION SCIENCES, 2019, 471 : 233 - 251
  • [48] Application of Multi-objective and Multiple Fuzzy Comprehensive Optimization Model Based on Entropy Weight in Hydropower Planning Schemes Selection
    Tian Ling
    Lu Jinxi
    CIVIL ENGINEERING IN CHINA - CURRENT PRACTICE AND RESEARCH REPORT, 2010, : 982 - 986
  • [49] A Novel Model for Landslide Displacement Prediction Based on EDR Selection and Multi-Swarm Intelligence Optimization Algorithm
    Zhang, Junrong
    Tang, Huiming
    Tannant, Dwayne D.
    Lin, Chengyuan
    Xia, Ding
    Wang, Yankun
    Wang, Qianyun
    SENSORS, 2021, 21 (24)
  • [50] Integrated parameter inversion analysis method of a CFRD based on multi-output support vector machines and the clonal selection algorithm
    Zheng, Dongjian
    Cheng, Lin
    Bao, Tengfei
    Lv, Beibei
    COMPUTERS AND GEOTECHNICS, 2013, 47 : 68 - 77