Model averaging in calibration of near-infrared instruments with correlated high-dimensional data

被引:1
|
作者
Salaki, Deiby Tineke [1 ]
Kurnia, Anang [2 ]
Sartono, Bagus [2 ]
Mangku, I. Wayan [3 ]
Gusnanto, Arief [4 ]
机构
[1] Sam Ratulangi Univ, Dept Math, Manado, Indonesia
[2] Bogor Agr Univ, Dept Stat, Bogor, Indonesia
[3] Bogor Agr Univ, Dept Math, Bogor, Indonesia
[4] Univ Leeds, Dept Stat, Leeds LS2 9JT, W Yorkshire, England
关键词
Model averaging; high-dimensional data; multicollinearity; calibration; near-infrared spectroscopy; VARIABLE SELECTION; RIDGE-REGRESSION; LASSO;
D O I
10.1080/02664763.2022.2122947
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Model averaging (MA) is a modelling strategy where the uncertainty in the configuration of selected variables is taken into account by weight-combining each estimate of the so-called 'candidate model'. Some studies have shown that MA enables better prediction, even in high-dimensional cases. However, little is known about the model prediction performance at different types of multicollinearity in high-dimensional data. Motivated by calibration of near-infrared (NIR) instruments,we focus on MA prediction performance in such data. The weighting schemes that we consider are based on the Akaike's information criterion (AIC), Mallows' C-p, and cross-validation. For estimating the model parameters, we consider the standard least squares and the ridge regression methods. The results indicate that MA outperforms model selection methods such as LASSO and SCAD in high-correlation data. The use of Mallows' C-p and cross-validation for the weights tends to yield similar results in all structures of correlation, although the former is generally preferred. We also find that the ridge model averaging outperforms the least-squares model averaging. This research suggests ridge model averaging to build a relatively better prediction of the NIR calibration model.
引用
收藏
页码:279 / 297
页数:19
相关论文
共 50 条
  • [21] STOCHASTIC GAUSSIAN PROCESS MODEL AVERAGING FOR HIGH-DIMENSIONAL INPUTS
    Xuereb, Maxime
    Ng, Szu Hui
    Pedrielli, Giulia
    2020 WINTER SIMULATION CONFERENCE (WSC), 2020, : 373 - 384
  • [22] Calibration of high resolution remote sensing instruments in the visible and near infrared
    Schuller, L
    Fischer, J
    Armbruster, W
    Bartsch, B
    CALIBRATION AND INTERCALIBRATION OF SATELLITE SENSORS AND EARLY RESULTS OF RADARSAT, 1997, 19 (09): : 1325 - 1334
  • [23] High-dimensional inference for linear model with correlated errors
    Panxu Yuan
    Xiao Guo
    Metrika, 2022, 85 : 21 - 52
  • [24] High-dimensional inference for linear model with correlated errors
    Yuan, Panxu
    Guo, Xiao
    METRIKA, 2022, 85 (01) : 21 - 52
  • [25] An interpretable deep learning approach for calibration transfer among multiple near-infrared instruments
    Yang, Jie
    Li, Juntao
    Hu, Jie
    Yang, Wenjun
    Zhang, Xiaolei
    Xu, Jinfan
    Zhang, Youchao
    Luo, Xuan
    Ting, K.C.
    Lin, Tao
    Ying, Yibin
    Computers and Electronics in Agriculture, 2022, 192
  • [26] An interpretable deep learning approach for calibration transfer among multiple near-infrared instruments
    Yang, Jie
    Li, Juntao
    Hu, Jie
    Yang, Wenjun
    Zhang, Xiaolei
    Xu, Jinfan
    Zhang, Youchao
    Luo, Xuan
    Ting, K. C.
    Lin, Tao
    Ying, Yibin
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 192
  • [27] Martingale-residual-based greedy model averaging for high-dimensional current status data
    Wang, Chang
    Du, Mingyue
    STATISTICS IN MEDICINE, 2024, 43 (09) : 1726 - 1742
  • [28] Iterative Bayesian Model Averaging: a method for the application of survival analysis to high-dimensional microarray data
    Amalia Annest
    Roger E Bumgarner
    Adrian E Raftery
    Ka Yee Yeung
    BMC Bioinformatics, 10
  • [29] Iterative Bayesian Model Averaging: a method for the application of survival analysis to high-dimensional microarray data
    Annest, Amalia
    Bumgarner, Roger E.
    Raftery, Adrian E.
    Yeung, Ka Yee
    BMC BIOINFORMATICS, 2009, 10
  • [30] Supervised Classification of High-Dimensional Correlated Data: Application to Genomic Data
    Gaye, Aboubacry
    Diongue, Abdou Ka
    Sylla, Seydou Nourou
    Diarra, Maryam
    Diallo, Amadou
    Talla, Cheikh
    Loucoubar, Cheikh
    JOURNAL OF CLASSIFICATION, 2024, 41 (01) : 158 - 169