Automatic cross-validation in structured models: Is it time to leave out leave-one-out?

被引:7
|
作者
Adin, Aritz [1 ,2 ]
Krainski, Elias Teixeira [1 ,3 ]
Lenzi, Amanda [1 ,4 ]
Liu, Zhedong [1 ,5 ]
Martinez-Minaya, Joaquin [1 ,6 ]
Rue, Havard [1 ,3 ]
机构
[1] Univ Publ Navarra, Campus Arrosadia, Pamplona 31006, Spain
[2] Univ Publ Navarra, Inst Adv Mat & Math InaMat2, Dept Stat Comp Sci & Math, Pamplona, Spain
[3] King Abdullah Univ Sci & Technol KAUST, Stat Program, Comp Elect & Math Sci & Engn Div, Thuwal, Saudi Arabia
[4] Univ Edinburgh, Sch Math, Edinburgh, Scotland
[5] RIKEN Ctr AI Project, Tokyo, Japan
[6] Univ Politecn Valencia, Dept Appl Stat Operat Res & Qual, Valencia, Spain
关键词
Cross-validation; Hierarchical models; INLA; Spatial statistics; COMPOSITIONAL DATA-ANALYSIS; EVOLUTION; JOINT;
D O I
10.1016/j.spasta.2024.100843
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Standard techniques such as leave-one-out cross-validation (LOOCV) might not be suitable for evaluating the predictive performance of models incorporating structured random effects. In such cases, the correlation between the training and test sets could have a notable impact on the model's prediction error. To overcome this issue, an automatic group construction procedure for leave-group-out cross validation (LGOCV) has recently emerged as a valuable tool for enhancing predictive performance measurement in structured models. The purpose of this paper is (i) to compare LOOCV and LGOCV within structured models, emphasizing model selection and predictive performance, and (ii) to provide real data applications in spatial statistics using complex structured models fitted with INLA, showcasing the utility of the automatic LGOCV method. First, we briefly review the key aspects of the recently proposed LGOCV method for automatic group construction in latent Gaussian models. We also demonstrate the effectiveness of this method for selecting the model with the highest predictive performance by simulating extrapolation tasks in both temporal and spatial data analyses. Finally, we provide insights into the effectiveness of the LGOCV method in modeling complex structured data, encompassing spatio-temporal multivariate count data, spatial compositional data, and spatio-temporal geospatial data.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Using Bayesian Leave-One-Out and Leave-Future-Out Cross-Validation to Evaluate the Performance of Rate-Time Models to Forecast Production of Tight-Oil Wells
    Maraggi L.M.R.
    Lake L.W.
    Walsh M.P.
    SPE Reservoir Evaluation and Engineering, 2022, 25 (04): : 730 - 750
  • [32] Erratum to: Practical Bayesian model evaluation using leave-one-out cross-validation and WAIC
    Aki Vehtari
    Andrew Gelman
    Jonah Gabry
    Statistics and Computing, 2017, 27 : 1433 - 1433
  • [33] Optimizing Sparse Kernel Ridge Regression Hyperparameters Based on Leave-One-Out Cross-Validation
    Karasuyama, Masayuki
    Nakano, Ryohei
    2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 3463 - 3468
  • [34] Bayesian Leave-One-Out Cross Validation Approximations for Gaussian Latent Variable Models
    Vehtari, Aki
    Mononen, Tommi
    Tolvanen, Ville
    Sivula, Tuomas
    Winther, Ole
    JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17
  • [35] Efficient approximate k-fold and leave-one-out cross-validation for ridge regression
    Meijer, Rosa J.
    Goeman, Jelle J.
    BIOMETRICAL JOURNAL, 2013, 55 (02) : 141 - 155
  • [36] The leave-one-out kernel
    Tsuda, K
    Kawanabe, M
    ARTIFICIAL NEURAL NETWORKS - ICANN 2002, 2002, 2415 : 727 - 732
  • [37] Leave-one-out Unfairness
    Black, Emily
    Fredrikson, Matt
    PROCEEDINGS OF THE 2021 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, FACCT 2021, 2021, : 285 - 295
  • [38] Efficient leave-one-out cross-validation for Bayesian non-factorized normal and Student-t models
    Paul-Christian Bürkner
    Jonah Gabry
    Aki Vehtari
    Computational Statistics, 2021, 36 : 1243 - 1261
  • [39] Dichotomous logistic regression with leave-one-out validation
    Teh, Sin Yin
    Othman, Abdul Rahman
    Khoo, Michael Boon Chong
    World Academy of Science, Engineering and Technology, 2010, 62 : 1001 - 1010
  • [40] Efficient leave-one-out cross-validation for Bayesian non-factorized normal and Student-t models
    Burkner, Paul-Christian
    Gabry, Jonah
    Vehtari, Aki
    COMPUTATIONAL STATISTICS, 2021, 36 (02) : 1243 - 1261