Model averaging in calibration of near-infrared instruments with correlated high-dimensional data

被引:1
|
作者
Salaki, Deiby Tineke [1 ]
Kurnia, Anang [2 ]
Sartono, Bagus [2 ]
Mangku, I. Wayan [3 ]
Gusnanto, Arief [4 ]
机构
[1] Sam Ratulangi Univ, Dept Math, Manado, Indonesia
[2] Bogor Agr Univ, Dept Stat, Bogor, Indonesia
[3] Bogor Agr Univ, Dept Math, Bogor, Indonesia
[4] Univ Leeds, Dept Stat, Leeds LS2 9JT, W Yorkshire, England
关键词
Model averaging; high-dimensional data; multicollinearity; calibration; near-infrared spectroscopy; VARIABLE SELECTION; RIDGE-REGRESSION; LASSO;
D O I
10.1080/02664763.2022.2122947
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Model averaging (MA) is a modelling strategy where the uncertainty in the configuration of selected variables is taken into account by weight-combining each estimate of the so-called 'candidate model'. Some studies have shown that MA enables better prediction, even in high-dimensional cases. However, little is known about the model prediction performance at different types of multicollinearity in high-dimensional data. Motivated by calibration of near-infrared (NIR) instruments,we focus on MA prediction performance in such data. The weighting schemes that we consider are based on the Akaike's information criterion (AIC), Mallows' C-p, and cross-validation. For estimating the model parameters, we consider the standard least squares and the ridge regression methods. The results indicate that MA outperforms model selection methods such as LASSO and SCAD in high-correlation data. The use of Mallows' C-p and cross-validation for the weights tends to yield similar results in all structures of correlation, although the former is generally preferred. We also find that the ridge model averaging outperforms the least-squares model averaging. This research suggests ridge model averaging to build a relatively better prediction of the NIR calibration model.
引用
收藏
页码:279 / 297
页数:19
相关论文
共 50 条
  • [31] Supervised Classification of High-Dimensional Correlated Data: Application to Genomic Data
    Aboubacry Gaye
    Abdou Ka Diongue
    Seydou Nourou Sylla
    Maryam Diarra
    Amadou Diallo
    Cheikh Talla
    Cheikh Loucoubar
    Journal of Classification, 2024, 41 : 158 - 169
  • [32] Two-group classification with high-dimensional correlated data: A factor model approach
    Pedro Duarte Silva, A.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2011, 55 (11) : 2975 - 2990
  • [33] Computer model calibration using high-dimensional output
    Higdon, Dave
    Gattiker, James
    Williams, Brian
    Rightley, Maria
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2008, 103 (482) : 570 - 583
  • [34] Model Selection for High-Dimensional Data
    Owrang, Arash
    Jansson, Magnus
    2016 50TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2016, : 606 - 609
  • [35] Redundancy Analysis to Reduce the High-Dimensional Near-Infrared Spectral Information to Improve the Authentication of Olive Oil
    Sanchez-Rodriguez, Maria Isabel
    Sanchez-Lopez, Elena
    Marinas, Alberto
    Caridad, Jose Maria
    Urbano, Francisco Jose
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, 62 (19) : 4620 - 4628
  • [36] The Euclid Near-Infrared Calibration Source
    Holmes, Rory
    Bizenberger, Peter
    Krause, Oliver
    Schweitzer, Mario
    Glauser, Adrian M.
    SPACE TELESCOPES AND INSTRUMENTATION 2010: OPTICAL, INFRARED, AND MILLIMETER WAVE, 2010, 7731
  • [37] Wavelength calibration of near-infrared spectra
    Hinkle, KH
    Joyce, RR
    Hedden, A
    Wallace, L
    Engleman, R
    PUBLICATIONS OF THE ASTRONOMICAL SOCIETY OF THE PACIFIC, 2001, 113 (783) : 548 - 566
  • [38] Nonlinear calibration for near-infrared spectroscopy
    Dadhe, K
    CHEMICAL ENGINEERING & TECHNOLOGY, 2004, 27 (09) : 946 - 950
  • [39] Adaptive and reversed penalty for analysis of high-dimensional correlated data
    Yang, Yuehan
    Yang, Hu
    APPLIED MATHEMATICAL MODELLING, 2021, 92 : 63 - 77
  • [40] Online Variational Bayes Inference for High-Dimensional Correlated Data
    Kabisa, Sylvie
    Dunson, David B.
    Morris, Jeffrey S.
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2016, 25 (02) : 426 - 444