A Multiple Test Correction for Streams and Cascades of Statistical Hypothesis Tests

被引:13
|
作者
Webb, Geoffrey I. [1 ]
Petitjean, Francois [1 ]
机构
[1] Monash Univ, Fac Informat Technol, Clayton, Vic, Australia
基金
澳大利亚研究理事会;
关键词
Hypothesis testing; multiple testing; model selection; INFORMATION; WOMEN;
D O I
10.1145/2939672.2939775
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Statistical hypothesis testing is a popular and powerful tool for inferring knowledge from data. For every such test performed, there is always a non-zero probability of making a false discovery, i.e. rejecting a null hypothesis in error. Familywise error rate (FWER) is the probability of making at least one false discovery during an inference process. The expected FWER grows exponentially with the number of hypothesis tests that are performed, almost guaranteeing that an error will be committed if the number of tests is big enough and the risk is not managed; a problem known as the multiple testing problem. State-of-the-art methods for controlling FWER in multiple comparison settings require that the set of hypotheses be predetermined. This greatly hinders statistical testing for many modern applications of statistical inference, such as model selection, because neither the set of hypotheses that will be tested, nor even the number of hypotheses, can be known in advance. This paper introduces Subfamilywise Multiple Testing, a multiple-testing correction that can be used in applications for which there are repeated pools of null hypotheses from each of which a single null hypothesis is to be rejected and neither the specific hypotheses nor their number are known until the final rejection decision is completed. To demonstrate the importance and relevance of this work to current machine learning problems, we further refine the theory to the problem of model selection and show how to use Subfamilywise Multiple Testing for learning graphical models. We assess its ability to discover graphical models on more than 7,000 datasets, studying the ability of Subfamilywise Multiple Testing to outperform the state of the art on data with varying size and dimensionality, as well as with varying density and power of the present correlations. Subfamilywise Multiple Testing provides a significant improvement in statistical efficiency, often requiring only half as much data to discover the same model, while strictly controlling FWER.
引用
收藏
页码:1255 / 1264
页数:10
相关论文
共 50 条
  • [21] STATISTICAL TESTS OF THE NONREVERSIBILITY OF AGRICULTURAL SUPPLY HYPOTHESIS
    WOODS, M
    TWEETEN, L
    RAY, DE
    PARVIN, G
    AMERICAN JOURNAL OF AGRICULTURAL ECONOMICS, 1980, 62 (05) : 1110 - 1110
  • [22] STATISTICAL HYPOTHESIS TESTS OF SOME MICROMETEOROLOGICAL OBSERVATIONS
    SETHURAMAN, S
    TICHLER, J
    JOURNAL OF APPLIED METEOROLOGY, 1977, 16 (05): : 455 - 461
  • [23] Statistics for Clinicians -: 4:: Basic concepts of statistical reasoning:: Hypothesis tests and the t-test
    Carlin, JB
    Doyle, LW
    JOURNAL OF PAEDIATRICS AND CHILD HEALTH, 2001, 37 (01) : 72 - 77
  • [24] DECISION ON STATISTICAL AND SCIENTIFIC HYPOTHESES - PROBLEMS WITH THE MULTIPLE SIGNIFICANCE TEST FOR EVALUATING A SCIENTIFIC HYPOTHESIS
    HAGER, W
    WESTERMANN, R
    ZEITSCHRIFT FUR SOZIALPSYCHOLOGIE, 1983, 14 (02): : 106 - 117
  • [25] Judicious use of multiple hypothesis tests
    Roback, PJ
    Askins, RA
    CONSERVATION BIOLOGY, 2005, 19 (01) : 261 - 267
  • [26] Asymptotic hypothesis test to compare likelihood ratios of multiple diagnostic tests in unpaired designs
    Luts, Jan
    Roldan Nofuentes, Jose Antonio
    de Dios Luna del Castillo, Juan
    Van Huffel, Sabine
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2011, 141 (11) : 3578 - 3594
  • [27] OPTIMAL SEQUENTIAL MULTIPLE HYPOTHESIS TESTS
    Novikov, Andrey
    KYBERNETIKA, 2009, 45 (02) : 309 - 330
  • [28] An establishment level test of the statistical discrimination hypothesis
    Tomaskovic-Devey, D
    Skaggs, S
    WORK AND OCCUPATIONS, 1999, 26 (04) : 422 - 445
  • [29] Robust statistical methods for exclusive hypothesis test
    Li, Meng
    Sun, Jianguo
    Tong, Xingwei
    STATISTICS AND ITS INTERFACE, 2025, 18 (01) : 81 - 92
  • [30] Object Category Detection by Statistical Test of Hypothesis
    Sharma, Gaurav
    Chaudhury, Santanu
    Srivastava, J. B.
    SIXTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS & IMAGE PROCESSING ICVGIP 2008, 2008, : 474 - +