Improving Model Selection by Employing the Test Data

被引：0

作者：

Westphal, Max ^{[1
]}

Brannath, Werner ^{[1
]}

机构：

[1] Univ Bremen, Fac Math & Comp Sci 3, Inst Stat, Bremen, Germany

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97 | 2019年 / 97卷

关键词：

MULTIPLE COMPARISONS; OVER-OPTIMISM; BIOINFORMATICS; INTELLIGENCE; INFERENCE; DESIGN;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Model selection and evaluation are usually strictly separated by means of data splitting to enable an unbiased estimation and a simple statistical inference for the unknown generalization performance of the final prediction model. We investigate the properties of novel evaluation strategies, namely when the final model is selected based on empirical performances on the test data. To guard against selection induced overoptimism, we employ a parametric multiple test correction based on the approximate multivariate distribution of performance estimates. Our numerical experiments involve training common machine learning algorithms (EN, CART, SVM, XGB) on various artificial classification tasks. At its core, our proposed approach improves model selection in terms of the expected final model performance without introducing overoptimism. We furthermore observed a higher probability for a successful evaluation study, making it easier in practice to empirically demonstrate a sufficiently high predictive performance.

引用

页数：10

共 50 条

[21] A Test Selection Procedure for Improving the Accuracy of Defect Diagnosis
Pomeranz, Irith
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2016, 24 (08) : 2759 - 2767
[22] USE OF PSYCHOLOGICAL CONSTRUCTS FOR IMPROVING SELECTION TEST VALIDITY
MCBAIN, WN
CANADIAN JOURNAL OF PSYCHOLOGY, 1957, 11 (03): : 164 - 170
[23] Improving the Feature Selection Stability of the Delta Test in Regression
Marion R.
Frénay B.
IEEE Transactions on Artificial Intelligence, 2024, 5 (05): : 1911 - 1917
[24] Understanding and Improving Regression Test Selection in Continuous Integration
Shi, August
Zhao, Peiyuan
Marinov, Darko
2019 IEEE 30TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE), 2019, : 228 - 238
[25] THE COMPLEXITY OF DATA FLOW CRITERIA FOR TEST DATA SELECTION
WEYUKER, EJ
INFORMATION PROCESSING LETTERS, 1984, 19 (02) : 103 - 109
[26] AN ANALYSIS OF TEST DATA SELECTION CRITERIA USING THE RELAY MODEL OF FAULT-DETECTION
RICHARDSON, DJ
THOMPSON, MC
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1993, 19 (06) : 533 - 553
[27] Improving the validity of test data obtained with data measurement systems
Sychev, E.I.
Khramenkov, V.N.
Measurement Techniques, 1989, 31 (08) : 730 - 733
[28] IMPROVING THE VALIDITY OF TEST DATA OBTAINED WITH DATA MEASUREMENT SYSTEMS
SYCHEV, EI
KHRAMENKOV, VN
MEASUREMENT TECHNIQUES USSR, 1988, 31 (08): : 730 - 733
[29] A Simple Parametric Model Selection Test
Schennach, Susanne M.
Wilhelm, Daniel
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (520) : 1663 - 1674
[30] Proper Model Selection with Significance Test
Huang, Jin
Ling, Charles X.
Zhang, Harry
Matwin, Stan
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PART I, PROCEEDINGS, 2008, 5211 : 536 - +

← 1 2 3 4 5 →