Extended likelihood approach to large-scale multiple testing

被引：16

作者：

Lee, Youngjo ^{[1
]}

Bjornstad, Jan F. ^{[2
]}

机构：

[1] Seoul Natl Univ, Seoul 151, South Korea

[2] Stat Norway, N-0033 Oslo, Norway

来源：

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY | 2013年 / 75卷 / 03期

基金：

新加坡国家研究基金会;

关键词：

Extended likelihood; False discovery rate; Likelihood; Likelihood ratio test; Maximum likelihood; Multiple testing; FALSE DISCOVERY RATE; EMPIRICAL BAYES; MICROARRAYS;

D O I：

10.1111/rssb.12005

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

To date, only frequentist, Bayesian and empirical Bayes approaches have been studied for the large-scale inference problem of testing simultaneously hundreds or thousands of hypotheses. Their derivations start with some summarizing statistics without modelling the basic responses. As a consequence testing procedures have been developed without necessarily checking model assumptions, and empirical null distributions are needed to avoid the problem of rejecting all null hypotheses when the sample sizes are large. Nevertheless these procedures may not be statistically efficient. We present the multiple-testing problem as a multiple-prediction problem of whether a null hypothesis is true or not. We introduce hierarchical random-effect models for basic responses and show how the extended likelihood is built. It is shown that the likelihood prediction has a certain oracle property. The extended likelihood leads to new testing procedures, which are optimal for the usual loss function in hypothesis testing. The new tests are based on certain shrinkage t-statistics and control the local probability of false discovery for individual tests to maintain the global frequentist false discovery rate and have no need to consider an empirical null distribution for the shrinkage t-statistics. Conditions are given when these false rates vanish. Three examples illustrate how to use the likelihood method in practice. A numerical study shows that the likelihood approach can greatly improve existing methods and finding the best fitting model is crucial for the behaviour of test procedures.

引用

页码：553 / 575

页数：23

共 50 条

[1] On Large-Scale Multiple Testing Over Networks: An Asymptotic Approach
Pournaderi, Mehrdad
Xiang, Yu
IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2023, 9 : 442 - 457
[2] A general approach to account for dependence in large-scale multiple testing
Friguet, Chloe
JOURNAL OF THE SFDS, 2012, 153 (02): : 100 - 122
[3] Large-Scale Multiple Testing of Correlations
Cai, T. Tony
Liu, Weidong
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2016, 111 (513) : 229 - 240
[4] Large-scale multiple testing under dependence
Sun, Wenguang
Cai, T. Tony
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2009, 71 : 393 - 424
[5] Regression testing approach for large-scale systems
Kandil, Passant
Moussa, Sherin
Badr, Nagwa
2014 IEEE INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS (ISSREW), 2014, : 132 - 133
[6] A Robust Method for Large-Scale Multiple Hypotheses Testing
Han, Seungbong
Andrei, Adin-Cristian
Tsui, Kam-Wah
BIOMETRICAL JOURNAL, 2010, 52 (02) : 222 - 232
[7] Large-scale testing
Weich, Imke
Lorenz, Jan
Fischl, Andreas
Rodic, Slobodan
Buschner, Josef
STAHLBAU, 2012, 81 (03) : 203 - 211
[8] The EFT likelihood for large-scale structure
Cabass, Giovanni
Schmidt, Fabian
JOURNAL OF COSMOLOGY AND ASTROPARTICLE PHYSICS, 2020, (04):
[9] LIKELIHOOD ANALYSIS OF LARGE-SCALE FLOWS
JAFFE, AH
KAISER, N
ASTROPHYSICAL JOURNAL, 1995, 455 (01): : 26 - 31
[10] A nonparametric empirical Bayes framework for large-scale multiple testing
Martin, Ryan
Tokdar, Surya T.
BIOSTATISTICS, 2012, 13 (03) : 427 - 439

← 1 2 3 4 5 →