Heteroscedasticity-Adjusted Ranking and Thresholding for Large-Scale Multiple Testing

被引:2
|
作者
Fu, Luella [1 ]
Gang, Bowen [2 ]
James, Gareth M. [3 ]
Sun, Wenguang [3 ]
机构
[1] San Francisco State Univ, Dept Math, San Francisco, CA 94132 USA
[2] Fudan Univ, Dept Stat, Shanghai, Peoples R China
[3] Univ Southern Calif, Dept Data Sci & Operat, Los Angeles, CA 90089 USA
关键词
Covariate-assisted inference; Data processing and information loss; False discovery rate; Heteroscedasticity; Multiple testing with side information; Structured multiple testing; FALSE-DISCOVERY RATE; GENE-EXPRESSION; EMPIRICAL BAYES; POWER; HYPOTHESES; NULL; MICROARRAYS;
D O I
10.1080/01621459.2020.1840992
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Standardization has been a widely adopted practice in multiple testing, for it takes into account the variability in sampling and makes the test statistics comparable across different study units. However, despite conventional wisdom to the contrary, we show that there can be a significant loss in information from basing hypothesis tests on standardized statistics rather than the full data. We develop a new class of heteroscedasticity-adjusted ranking and thresholding (HART) rules that aim to improve existing methods by simultaneously exploiting commonalities and adjusting heterogeneities among the study units. The main idea of HART is to bypass standardization by directly incorporating both the summary statistic and its variance into the testing procedure. A key message is that the variance structure of the alternative distribution, which is subsumed under standardized statistics, is highly informative and can be exploited to achieve higher power. The proposed HART procedure is shown to be asymptotically valid and optimal for false discovery rate (FDR) control. Our simulation results demonstrate that HART achieves substantial power gain over existing methods at the same FDR level. We illustrate the implementation through a microarray analysis of myeloma.
引用
收藏
页码:1028 / 1040
页数:13
相关论文
共 50 条
  • [31] A Novel Ranking Model for a Large-Scale Scientific Publication
    Bong-Soo Sohn
    Jai E. Jung
    Mobile Networks and Applications, 2015, 20 : 508 - 520
  • [32] Outlier Ranking for Large-Scale Public Health Data
    Joshi, Ananya
    Townes, Tina
    Gormley, Nolan
    Neureiter, Luke
    Rosenfeld, Roni
    Wilder, Bryan
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 20, 2024, : 22176 - 22184
  • [33] T2Ranking: A Large-scale Chinese Benchmark for Passage Ranking
    Xie, Xiaohui
    Dong, Qian
    Wang, Bingning
    Lv, Feiyang
    Yao, Ting
    Gan, Weinan
    Wu, Zhijing
    Li, Xiangsheng
    Li, Haitao
    Liu, Yiqun
    Ma, Jin
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 2681 - 2690
  • [34] Testing large-scale cloud management
    Citron, D.
    Zlotnick, A.
    IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2011, 55 (06)
  • [35] SOME OBSERVATIONS ON LARGE-SCALE TESTING
    Bergen, Garret L.
    JOURNAL OF APPLIED PSYCHOLOGY, 1936, 20 (02) : 249 - 257
  • [36] LARGE-SCALE TESTING OF ACETOLACTATE SYNTHASE
    EHRAT, MC
    MOSINGER, E
    FELIX, HR
    PROSPECTS FOR AMINO ACID BIOSYNTHESIS INHIBITORS IN CROP PROTECTION AND PHARMACEUTICAL CHEMISTRY, 1989, 42 : 207 - 209
  • [37] A METHOD FOR LARGE-SCALE TESTING FOR PYROGENS
    KUNA, S
    EDISON, AO
    BUTZ, C
    JOURNAL OF THE AMERICAN PHARMACEUTICAL ASSOCIATION-SCIENTIFIC EDITION, 1946, 35 (02): : 59 - 63
  • [38] Responding to Large-Scale Testing Errors
    Valenstein, Paul N.
    Alpern, Ann
    Keren, David F.
    AMERICAN JOURNAL OF CLINICAL PATHOLOGY, 2010, 133 (03) : 440 - 446
  • [39] Panel: Large-scale software testing
    Horgan, B
    EIGHTH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING, PROCEEDINGS, 1997, : 220 - 220
  • [40] LARGE-SCALE PROBLEMS IN LSI TESTING
    不详
    ELECTRONICS, 1968, 41 (24): : 99 - &