Inference for the generalization error

被引:603
|
作者
Nadeau, C
Bengio, Y
机构
[1] Hlth Canada, Ottawa, ON K1A 0L2, Canada
[2] Univ Montreal, CIRANO, Montreal, PQ H3C 3J7, Canada
[3] Univ Montreal, Dept IRO, Montreal, PQ H3C 3J7, Canada
关键词
generalization error; cross-validation; variance estimation; hypothesis tests; size; power;
D O I
10.1023/A:1024068626366
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to compare learning algorithms, experimental results reported in the machine learning literature often use statistical tests of significance to support the claim that a new learning algorithm generalizes better. Such tests should take into account the variability due to the choice of training set and not only that due to the test examples, as is often the case. This could lead to gross underestimation of the variance of the cross-validation estimator, and to the wrong conclusion that the new algorithm is significantly better when it is not. We perform a theoretical investigation of the variance of a variant of the cross-validation estimator of the generalization error that takes into account the variability due to the randomness of the training set as well as test examples. Our analysis shows that all the variance estimators that are based only on the results of the cross-validation experiment must be biased. This analysis allows us to propose new estimators of this variance. We show, via simulations, that tests of hypothesis about the generalization error using those new variance estimators have better properties than tests involving variance estimators currently in use and listed in Dietterich ( 1998). In particular, the new tests have correct size and good power. That is, the new tests do not reject the null hypothesis too often when the hypothesis is true, but they tend to frequently reject the null hypothesis when the latter is false.
引用
收藏
页码:239 / 281
页数:43
相关论文
共 50 条
  • [11] ON A POSSIBLE GENERALIZATION OF THE BAYES METHOD OF INFERENCE
    MESIAR, R
    PIASECKI, K
    FUZZY SETS AND SYSTEMS, 1990, 37 (03) : 351 - 357
  • [12] A Generalization Bound for Online Variational Inference
    Cherief-Abdellatif, Badr-Eddine
    Alquier, Pierre
    Khan, Mohammad Emtiyaz
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 101, 2019, 101 : 662 - 677
  • [13] Generalization Error of Invariant Classifiers
    Sokolic, Jure
    Giryes, Raja
    Sapiro, Guillermo
    Rodrigues, Miguel R. D.
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 54, 2017, 54 : 1094 - 1103
  • [14] Generalization error of combined classifiers
    Mason, L
    Bartlett, PL
    Golea, M
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2002, 65 (02) : 415 - 438
  • [15] GENERATIVE APPROXIMATION OF GENERALIZATION ERROR
    Yamazaki, Keisuke
    2009 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2009, : 186 - 191
  • [16] Generalization error of ensemble estimators
    Ueda, N
    Nakano, R
    ICNN - 1996 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS. 1-4, 1996, : 90 - 95
  • [17] κ-generalization of Gauss' law of error
    Wada, T
    Suyari, H
    PHYSICS LETTERS A, 2006, 348 (3-6) : 89 - 93
  • [18] GENERALIZATION OF ERROR ACCUMULATION THEORY
    ULANOV, GM
    KRICHEVS.IY
    DOKLADY AKADEMII NAUK SSSR, 1972, 202 (01): : 56 - &
  • [19] Generalization error of multinomial classifier
    Raudys, Sarunas
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, PROCEEDINGS, 2006, 4109 : 502 - 511
  • [20] Generalization, uncertainty, and error modeling
    Goodchild, MF
    GIS/LIS '96 - ANNUAL CONFERENCE AND EXPOSITION PROCEEDINGS, 1996, : 765 - 774