Model selection with bootstrap validation

被引:1
|
作者
Savvides, Rafael [1 ]
Makela, Jarmo [1 ]
Puolamaki, Kai [1 ,2 ]
机构
[1] Univ Helsinki, Dept Comp Sci, Helsinki, Finland
[2] Univ Helsinki, Inst Atmospher & Earth Syst Res, Helsinki, Finland
基金
芬兰科学院;
关键词
bootstrap; model selection; CROSS-VALIDATION;
D O I
10.1002/sam.11606
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Model selection is one of the most central tasks in supervised learning. Validation set methods are the standard way to accomplish this task: models are trained on training data, and the model with the smallest loss on the validation data is selected. However, it is generally not obvious how much validation data is required to make a reliable selection, which is essential when labeled data are scarce or expensive. We propose a bootstrap-based algorithm, bootstrap validation (BSV), that uses the bootstrap to adjust the validation set size and to find the best-performing model within a tolerance parameter specified by the user. We find that BSV works well in practice and can be used as a drop-in replacement for validation set methods or k-fold cross-validation. The main advantage of BSV is that less validation data is typically needed, so more data can be used to train the model, resulting in better approximations and efficient use of validation data.
引用
收藏
页码:162 / 186
页数:25
相关论文
共 50 条
  • [21] Bootstrap for model selection: Linear approximation of the optimism
    Simon, G
    Lendasse, A
    Verleysen, M
    COMPUTATIONAL METHODS IN NEURAL MODELING, PT 1, 2003, 2686 : 182 - 189
  • [22] SETAR model selection-A bootstrap approach
    John Öhrvik
    Gabriella Schoier
    Computational Statistics, 2005, 20 : 559 - 573
  • [23] A bootstrap method for NARMAX model order selection
    Kukreja, SL
    Kearney, RE
    Galiana, HL
    MODELLING AND CONTROL IN BIOMEDICAL SYSTEMS 2000, 2000, : 329 - 332
  • [24] Fast bootstrap methodology for regression model selection
    Lendasse, A
    Simon, G
    Wertz, V
    Verleysen, M
    NEUROCOMPUTING, 2005, 64 : 161 - 181
  • [25] Bootstrap model selection for polynomial phase signals
    Zoubir, AM
    Iskander, DR
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 2229 - 2232
  • [26] Bootstrap-after-Bootstrap Model Averaging for Reducing Model Uncertainty in Model Selection for Air Pollution Mortality Studies
    Roberts, Steven
    Martin, Michael A.
    ENVIRONMENTAL HEALTH PERSPECTIVES, 2010, 118 (01) : 131 - 136
  • [27] Bootstrap and backward elimination based approaches for model selection
    el-sallam, AAA
    Kayhan, S
    Zoubir, AM
    ISPA 2003: PROCEEDINGS OF THE 3RD INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, PTS 1 AND 2, 2003, : 152 - 157
  • [28] Model selection in linear regression using paired bootstrap
    Rabbi, Fazli
    Khan, Salahuddin
    Khalil, Alamgir
    Mashwani, Wali Khan
    Shafiq, Muhammad
    Goktas, Pinar
    Unvan, Yuksel Akay
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2021, 50 (07) : 1629 - 1639
  • [29] Bootstrap-based Selection for Instrumental Variables Model
    Wang, Wenjie
    Liu, Qingfeng
    ECONOMICS BULLETIN, 2015, 35 (03): : 1886 - +
  • [30] Balanced bootstrap resampling method for neural model selection
    Hung, Wen-Liang
    Lee, E. Stanley
    Chuang, Shun-Chin
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2011, 62 (12) : 4576 - 4581