Improving Model Selection by Employing the Test Data

被引:0
|
作者
Westphal, Max [1 ]
Brannath, Werner [1 ]
机构
[1] Univ Bremen, Fac Math & Comp Sci 3, Inst Stat, Bremen, Germany
关键词
MULTIPLE COMPARISONS; OVER-OPTIMISM; BIOINFORMATICS; INTELLIGENCE; INFERENCE; DESIGN;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Model selection and evaluation are usually strictly separated by means of data splitting to enable an unbiased estimation and a simple statistical inference for the unknown generalization performance of the final prediction model. We investigate the properties of novel evaluation strategies, namely when the final model is selected based on empirical performances on the test data. To guard against selection induced overoptimism, we employ a parametric multiple test correction based on the approximate multivariate distribution of performance estimates. Our numerical experiments involve training common machine learning algorithms (EN, CART, SVM, XGB) on various artificial classification tasks. At its core, our proposed approach improves model selection in terms of the expected final model performance without introducing overoptimism. We furthermore observed a higher probability for a successful evaluation study, making it easier in practice to empirically demonstrate a sufficiently high predictive performance.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Fusion of Feature Selection Methods for Improving Model Accuracy in the Milling Process Data Classification Problem
    Kusy, Maciej
    Zajdel, Roman
    Kluska, Jacek
    Zabinski, Tomasz
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [32] Uncertainty analysis of constant amplitude fatigue test data employing the six parameters random fatigue limit model
    Leonetti, Davide
    Maljaars, Johan
    Snijder, H. H.
    12TH INTERNATIONAL FATIGUE CONGRESS (FATIGUE 2018), 2018, 165
  • [33] Improving test case selection by handling class and attribute noise
    Al-Sabbagh, Khaled Walid
    Staron, Miroslaw
    Hebig, Regina
    JOURNAL OF SYSTEMS AND SOFTWARE, 2022, 183
  • [34] Model selection with overdispersed data
    Fitzmaurice, GM
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES D-THE STATISTICIAN, 1997, 46 (01) : 81 - 91
  • [35] Longitudinal data model selection
    Azari, Rahman
    Li, Lexin
    Tsai, Chih-Ling
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 50 (11) : 3053 - 3066
  • [36] Test case selection for improving the effectiveness of software fault localization
    Wang, Kechao
    Wang, Tiantian
    Su, Xiaohong
    Ma, Peijun
    Tong, Zhixiang
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2014, 51 (04): : 865 - 873
  • [37] Data-Driven Test Selection at Scale
    Mehta, Sonu
    Farmahinifarahani, Farima
    Bhagwan, Ranjita
    Guptha, Suraj
    Jafari, Sina
    Kumar, Rahul
    Saini, Vaibhav
    Santhiar, Anirudh
    PROCEEDINGS OF THE 29TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE '21), 2021, : 1225 - 1235
  • [38] ON THE SELECTION OF TEST DATA FOR RECURSIVE MATHEMATICAL SUBROUTINES
    ROWLAND, JH
    DAVIS, PJ
    SIAM JOURNAL ON COMPUTING, 1981, 10 (01) : 59 - 72
  • [39] A PAIRED TEST FOR RECOGNIZER SELECTION WITH UNTRANSCRIBED DATA
    Raj, Bhiksha
    Singh, Rita
    Baker, James
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5676 - 5679
  • [40] A model attitude control and measurement technique for improving quality of wind tunnel dynamic test data
    Liu Z.
    Sun H.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2016, 37 (08): : 2426 - 2435