Model averaging in calibration of near-infrared instruments with correlated high-dimensional data

被引:1
|
作者
Salaki, Deiby Tineke [1 ]
Kurnia, Anang [2 ]
Sartono, Bagus [2 ]
Mangku, I. Wayan [3 ]
Gusnanto, Arief [4 ]
机构
[1] Sam Ratulangi Univ, Dept Math, Manado, Indonesia
[2] Bogor Agr Univ, Dept Stat, Bogor, Indonesia
[3] Bogor Agr Univ, Dept Math, Bogor, Indonesia
[4] Univ Leeds, Dept Stat, Leeds LS2 9JT, W Yorkshire, England
关键词
Model averaging; high-dimensional data; multicollinearity; calibration; near-infrared spectroscopy; VARIABLE SELECTION; RIDGE-REGRESSION; LASSO;
D O I
10.1080/02664763.2022.2122947
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Model averaging (MA) is a modelling strategy where the uncertainty in the configuration of selected variables is taken into account by weight-combining each estimate of the so-called 'candidate model'. Some studies have shown that MA enables better prediction, even in high-dimensional cases. However, little is known about the model prediction performance at different types of multicollinearity in high-dimensional data. Motivated by calibration of near-infrared (NIR) instruments,we focus on MA prediction performance in such data. The weighting schemes that we consider are based on the Akaike's information criterion (AIC), Mallows' C-p, and cross-validation. For estimating the model parameters, we consider the standard least squares and the ridge regression methods. The results indicate that MA outperforms model selection methods such as LASSO and SCAD in high-correlation data. The use of Mallows' C-p and cross-validation for the weights tends to yield similar results in all structures of correlation, although the former is generally preferred. We also find that the ridge model averaging outperforms the least-squares model averaging. This research suggests ridge model averaging to build a relatively better prediction of the NIR calibration model.
引用
收藏
页码:279 / 297
页数:19
相关论文
共 50 条
  • [11] Calibration of the empirical likelihood for high-dimensional data
    Liu, Yukun
    Zou, Changliang
    Wang, Zhaojun
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2013, 65 (03) : 529 - 550
  • [12] Calibration of the empirical likelihood for high-dimensional data
    Yukun Liu
    Changliang Zou
    Zhaojun Wang
    Annals of the Institute of Statistical Mathematics, 2013, 65 : 529 - 550
  • [13] Jackknife model averaging for high-dimensional quantile regression
    Wang, Miaomiao
    Zhang, Xinyu
    Wan, Alan T. K.
    You, Kang
    Zou, Guohua
    BIOMETRICS, 2023, 79 (01) : 178 - 189
  • [14] A Model-Averaging Approach for High-Dimensional Regression
    Ando, Tomohiro
    Li, Ker-Chau
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2014, 109 (505) : 254 - 265
  • [15] Towards Correlated Data Trading for High-Dimensional Private Data
    Cai, Hui
    Yang, Yuanyuan
    Fan, Weibei
    Xiao, Fu
    Zhu, Yanmin
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 34 (03) : 1047 - 1059
  • [16] Standardization of near-infrared spectrometric instruments
    Bouveresse, E
    Hartmann, C
    Massart, DL
    Last, IR
    Prebble, KA
    ANALYTICAL CHEMISTRY, 1996, 68 (06) : 982 - 990
  • [17] A COMPOSITE LIKELIHOOD APPROACH TO COMPUTER MODEL CALIBRATION WITH HIGH-DIMENSIONAL SPATIAL DATA
    Chang, Won
    Haran, Murali
    Olson, Roman
    Keller, Klaus
    STATISTICA SINICA, 2015, 25 (01) : 243 - 259
  • [19] Application of Fourier transform to multivariate calibration of near-infrared data
    Pasti, L
    Jouan-Rimbaud, D
    Massart, DL
    de Noord, OE
    ANALYTICA CHIMICA ACTA, 1998, 364 (1-3) : 253 - 263
  • [20] Optimal model averaging forecasting in high-dimensional survival analysis
    Yan, Xiaodong
    Wang, Hongni
    Wang, Wei
    Xie, Jinhan
    Ren, Yanyan
    Wang, Xinjun
    INTERNATIONAL JOURNAL OF FORECASTING, 2021, 37 (03) : 1147 - 1155