Heteroscedasticity-Adjusted Ranking and Thresholding for Large-Scale Multiple Testing

被引：2

作者：

Fu, Luella ^{[1
]}

Gang, Bowen ^{[2
]}

James, Gareth M. ^{[3
]}

Sun, Wenguang ^{[3
]}

机构：

[1] San Francisco State Univ, Dept Math, San Francisco, CA 94132 USA

[2] Fudan Univ, Dept Stat, Shanghai, Peoples R China

[3] Univ Southern Calif, Dept Data Sci & Operat, Los Angeles, CA 90089 USA

来源：

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION | 2022年 / 117卷 / 538期

关键词：

Covariate-assisted inference; Data processing and information loss; False discovery rate; Heteroscedasticity; Multiple testing with side information; Structured multiple testing; FALSE-DISCOVERY RATE; GENE-EXPRESSION; EMPIRICAL BAYES; POWER; HYPOTHESES; NULL; MICROARRAYS;

D O I：

10.1080/01621459.2020.1840992

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

Standardization has been a widely adopted practice in multiple testing, for it takes into account the variability in sampling and makes the test statistics comparable across different study units. However, despite conventional wisdom to the contrary, we show that there can be a significant loss in information from basing hypothesis tests on standardized statistics rather than the full data. We develop a new class of heteroscedasticity-adjusted ranking and thresholding (HART) rules that aim to improve existing methods by simultaneously exploiting commonalities and adjusting heterogeneities among the study units. The main idea of HART is to bypass standardization by directly incorporating both the summary statistic and its variance into the testing procedure. A key message is that the variance structure of the alternative distribution, which is subsumed under standardized statistics, is highly informative and can be exploited to achieve higher power. The proposed HART procedure is shown to be asymptotically valid and optimal for false discovery rate (FDR) control. Our simulation results demonstrate that HART achieves substantial power gain over existing methods at the same FDR level. We illustrate the implementation through a microarray analysis of myeloma.

引用

页码：1028 / 1040

页数：13

共 50 条

[1] Large-Scale Multiple Testing of Correlations
Cai, T. Tony
Liu, Weidong
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2016, 111 (513) : 229 - 240
[2] Large-scale multiple testing under dependence
Sun, Wenguang
Cai, T. Tony
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2009, 71 : 393 - 424
[3] Extended likelihood approach to large-scale multiple testing
Lee, Youngjo
Bjornstad, Jan F.
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2013, 75 (03) : 553 - 575
[4] A Robust Method for Large-Scale Multiple Hypotheses Testing
Han, Seungbong
Andrei, Adin-Cristian
Tsui, Kam-Wah
BIOMETRICAL JOURNAL, 2010, 52 (02) : 222 - 232
[5] Large-scale testing
Weich, Imke
Lorenz, Jan
Fischl, Andreas
Rodic, Slobodan
Buschner, Josef
STAHLBAU, 2012, 81 (03) : 203 - 211
[6] USER TESTING OF A LARGE-SCALE RETRIEVAL-SYSTEM USING STATISTICAL RANKING
HARMAN, D
PROCEEDINGS OF THE ASIS ANNUAL MEETING, 1990, 27 : 337 - 337
[7] A nonparametric empirical Bayes framework for large-scale multiple testing
Martin, Ryan
Tokdar, Surya T.
BIOSTATISTICS, 2012, 13 (03) : 427 - 439
[8] On Large-Scale Multiple Testing Over Networks: An Asymptotic Approach
Pournaderi, Mehrdad
Xiang, Yu
IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2023, 9 : 442 - 457
[9] A spatially adaptive large-scale multiple-testing procedure
Han, Yixin
Wang, Yunlong
Wang, Zhaojun
STAT, 2023, 12 (01):
[10] A general approach to account for dependence in large-scale multiple testing
Friguet, Chloe
JOURNAL OF THE SFDS, 2012, 153 (02): : 100 - 122

← 1 2 3 4 5 →