Automatic cross-validation in structured models: Is it time to leave out leave-one-out?

被引:7
|
作者
Adin, Aritz [1 ,2 ]
Krainski, Elias Teixeira [1 ,3 ]
Lenzi, Amanda [1 ,4 ]
Liu, Zhedong [1 ,5 ]
Martinez-Minaya, Joaquin [1 ,6 ]
Rue, Havard [1 ,3 ]
机构
[1] Univ Publ Navarra, Campus Arrosadia, Pamplona 31006, Spain
[2] Univ Publ Navarra, Inst Adv Mat & Math InaMat2, Dept Stat Comp Sci & Math, Pamplona, Spain
[3] King Abdullah Univ Sci & Technol KAUST, Stat Program, Comp Elect & Math Sci & Engn Div, Thuwal, Saudi Arabia
[4] Univ Edinburgh, Sch Math, Edinburgh, Scotland
[5] RIKEN Ctr AI Project, Tokyo, Japan
[6] Univ Politecn Valencia, Dept Appl Stat Operat Res & Qual, Valencia, Spain
关键词
Cross-validation; Hierarchical models; INLA; Spatial statistics; COMPOSITIONAL DATA-ANALYSIS; EVOLUTION; JOINT;
D O I
10.1016/j.spasta.2024.100843
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Standard techniques such as leave-one-out cross-validation (LOOCV) might not be suitable for evaluating the predictive performance of models incorporating structured random effects. In such cases, the correlation between the training and test sets could have a notable impact on the model's prediction error. To overcome this issue, an automatic group construction procedure for leave-group-out cross validation (LGOCV) has recently emerged as a valuable tool for enhancing predictive performance measurement in structured models. The purpose of this paper is (i) to compare LOOCV and LGOCV within structured models, emphasizing model selection and predictive performance, and (ii) to provide real data applications in spatial statistics using complex structured models fitted with INLA, showcasing the utility of the automatic LGOCV method. First, we briefly review the key aspects of the recently proposed LGOCV method for automatic group construction in latent Gaussian models. We also demonstrate the effectiveness of this method for selecting the model with the highest predictive performance by simulating extrapolation tasks in both temporal and spatial data analyses. Finally, we provide insights into the effectiveness of the LGOCV method in modeling complex structured data, encompassing spatio-temporal multivariate count data, spatial compositional data, and spatio-temporal geospatial data.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Efficient approximate leave-one-out cross-validation for kernel logistic regression
    Gavin C. Cawley
    Nicola L. C. Talbot
    Machine Learning, 2008, 71 : 243 - 264
  • [22] Ensemble Kalman Filter Regularization Using Leave-One-Out Data Cross-Validation
    Rayo, Lautaro
    Hoteit, Ibrahim
    NUMERICAL ANALYSIS AND APPLIED MATHEMATICS (ICNAAM 2012), VOLS A AND B, 2012, 1479 : 1247 - 1250
  • [23] Design of Incremental Echo State Network Using Leave-One-Out Cross-Validation
    Yang, Cuili
    Zhu, Xinxin
    Ahmad, Zohaib
    Wang, Lei
    Qiao, Junfei
    IEEE ACCESS, 2018, 6 : 74874 - 74884
  • [24] Algorithmic stability and sanity-check bounds for leave-one-out cross-validation
    Kearns, M
    Ron, D
    NEURAL COMPUTATION, 1999, 11 (06) : 1427 - 1453
  • [25] A leave-one-out cross-validation SAS macro for the identification of markers associated with survival
    Rushing, Christel
    Bulusu, Anuradha
    Hurwitz, Herbert I.
    Nixon, Andrew B.
    Pang, Herbert
    COMPUTERS IN BIOLOGY AND MEDICINE, 2015, 57 : 123 - 129
  • [26] Enhanced Kriging leave-one-out cross-validation in improving model estimation and optimization
    Pang, Yong
    Wang, Yitang
    Lai, Xiaonan
    Zhang, Shuai
    Liang, Pengwei
    Song, Xueguan
    COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2023, 414
  • [27] Spatial leave-one-out cross-validation for variable selection in the presence of spatial autocorrelation
    Le Rest, Kevin
    Pinaud, David
    Monestiez, Pascal
    Chadoeuf, Joel
    Bretagnolle, Vincent
    GLOBAL ECOLOGY AND BIOGEOGRAPHY, 2014, 23 (07): : 811 - 820
  • [28] EBM PEARL: LEAVE-ONE-OUT (LOO) CROSS VALIDATION
    Hupert, Jordan
    JOURNAL OF PEDIATRICS, 2020, 220 : 264 - 264
  • [29] A scalable estimate of the out-of-sample prediction error via approximate leave-one-out cross-validation
    Rad, Kamiar Rahnama
    Maleki, Arian
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2020, 82 (04) : 965 - 996
  • [30] Honest leave-one-out cross-validation for estimating post-tuning generalization error
    Wang, Boxiang
    Zou, Hui
    STAT, 2021, 10 (01):