Improving Model Selection by Employing the Test Data

被引:0
|
作者
Westphal, Max [1 ]
Brannath, Werner [1 ]
机构
[1] Univ Bremen, Fac Math & Comp Sci 3, Inst Stat, Bremen, Germany
关键词
MULTIPLE COMPARISONS; OVER-OPTIMISM; BIOINFORMATICS; INTELLIGENCE; INFERENCE; DESIGN;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Model selection and evaluation are usually strictly separated by means of data splitting to enable an unbiased estimation and a simple statistical inference for the unknown generalization performance of the final prediction model. We investigate the properties of novel evaluation strategies, namely when the final model is selected based on empirical performances on the test data. To guard against selection induced overoptimism, we employ a parametric multiple test correction based on the approximate multivariate distribution of performance estimates. Our numerical experiments involve training common machine learning algorithms (EN, CART, SVM, XGB) on various artificial classification tasks. At its core, our proposed approach improves model selection in terms of the expected final model performance without introducing overoptimism. We furthermore observed a higher probability for a successful evaluation study, making it easier in practice to empirically demonstrate a sufficiently high predictive performance.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] EMPLOYING UNIFIED DATA MODEL IN INTEGRATED ANALYSIS SYSTEM
    Zhan, Yi
    Duan, Weixi
    Li, Rongsheng
    Zhang, Chao
    PROCEEDINGS OF 2011 INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY AND APPLICATION, ICCTA2011, 2011, : 467 - 470
  • [42] A data compression scheme employing a biologically inspired model
    Wynn, TA
    Roberts, RG
    PROCEEDINGS OF THE THIRTY-FOURTH SOUTHEASTERN SYMPOSIUM ON SYSTEM THEORY, 2002, : 391 - 395
  • [43] Improving Instance Selection Methods for Big Data Classification
    Malhat, Mohamed
    El Menshawy, Mohamed
    Mousa, Hamdy
    El Sisi, Ashraf
    2017 13TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO), 2017, : 213 - 218
  • [44] On improving the selection of Thellier-type paleointensity data
    Paterson, Greig A.
    Tauxe, Lisa
    Biggin, Andrew J.
    Shaar, Ron
    Jonestrask, Lori C.
    GEOCHEMISTRY GEOPHYSICS GEOSYSTEMS, 2014, 15 (04): : 1180 - 1192
  • [45] IMPROVING THE PERFORMANCE OF MODEL-ORDER SELECTION CRITERIA BY PARTIAL-MODEL SELECTION SEARCH
    Alkhaldi, Weaam
    Iskander, D. Robert
    Zoubir, Abdelhak M.
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4130 - 4133
  • [46] A Model Selection Approach for Variable Selection with Censored Data
    Castellanos, Maria Eugenia
    Garcia-Donato, Gonzalo
    Cabras, Stefano
    BAYESIAN ANALYSIS, 2021, 16 (01): : 271 - 300
  • [47] Improving Model-Based Test Generation by Model Decomposition
    Arcaini, Paolo
    Gargantini, Angelo
    Riccobene, Elvinia
    2015 10TH JOINT MEETING OF THE EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND THE ACM SIGSOFT SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE 2015) PROCEEDINGS, 2015, : 119 - 130
  • [48] IMPROVING THE DATA MODEL OF PCR SOFTWARE
    Fiala, Petr
    Rost, Michal
    Spanihel, Vladimir
    SOFTWARE DEVELOPMENT 2012, 2012, : 29 - 35
  • [49] Improving the Value of Standard Toxicity Test Data in REACH
    Breitholtz, Magnus
    Lundstrom, Elin
    Dahl, Ulrika
    Forbes, Valery
    REGULATING CHEMICAL RISKS: EUROPEAN AND GLOBAL CHALLENGES, 2010, : 85 - 98
  • [50] A likelihood ratio test for spatial model selection
    Liu, Tuo
    Lee, Lung-fei
    JOURNAL OF ECONOMETRICS, 2019, 213 (02) : 434 - 458