Covariate-modulated large-scale multiple testing under dependence

被引:2
|
作者
Wang, Jiangzhou [1 ]
Cui, Tingting [2 ]
Zhu, Wensheng [2 ]
Wang, Pengfei [3 ]
机构
[1] Shenzhen Univ, Coll Math & Stat, Shenzhen 518060, Peoples R China
[2] Northeast Normal Univ, Sch Math & Stat, Key Lab Appl Stat MOE, Changchun, Peoples R China
[3] Dongbei Univ Finance & Econ, Sch Stat, Dalian 116025, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Covariate-modulated HMM; FDR; Local correlations; Large-scale multiple testing; FALSE DISCOVERY RATE; HIDDEN MARKOV-MODELS; GENOME-WIDE ASSOCIATION; MIXTURES; NUMBER;
D O I
10.1016/j.csda.2022.107664
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Large-scale multiple testing, which calls for conducting tens of thousands of hypothesis testings simultaneously, has been applied in many scientific fields. Most conventional multiple testing procedures often focused on the control of false discovery rate (FDR) and largely ignored covariate information and the dependence structure among tests. A FDR control procedure, termed as Covariate-Modulated Local Index of Significance (cmLIS) procedure, which not only takes into account local correlations among tests but also accommodates the covariate information by leveraging a covariate-modulated hidden Markov model (HMM), has been proposed. In the oracle case where all parameters of the covariate-modulated HMM are known, the cmLIS procedure is shown to be valid and optimal in some sense. According to whether the number of mixed components in the nonnull distribution is known, two Bayesian sampling algorithms are provided for parameter estimation. Extensive simulations are conducted to demonstrate the effectiveness of the cmLIS procedure over state-of-the-art multiple testing procedures. Finally, the cmLIS procedure is applied to an RNA sequencing data and a schizophrenia (SCZ) data. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] covRNA: discovering covariate associations in large-scale gene expression data
    Urban, Lara
    Remmele, Christian W.
    Dittrich, Marcus
    Schwarz, Roland F.
    Mueller, Tobias
    BMC RESEARCH NOTES, 2020, 13 (01)
  • [32] covRNA: discovering covariate associations in large-scale gene expression data
    Lara Urban
    Christian W. Remmele
    Marcus Dittrich
    Roland F. Schwarz
    Tobias Müller
    BMC Research Notes, 13
  • [33] Testing large-scale cloud management
    Citron, D.
    Zlotnick, A.
    IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2011, 55 (06)
  • [34] SOME OBSERVATIONS ON LARGE-SCALE TESTING
    Bergen, Garret L.
    JOURNAL OF APPLIED PSYCHOLOGY, 1936, 20 (02) : 249 - 257
  • [35] LARGE-SCALE TESTING OF ACETOLACTATE SYNTHASE
    EHRAT, MC
    MOSINGER, E
    FELIX, HR
    PROSPECTS FOR AMINO ACID BIOSYNTHESIS INHIBITORS IN CROP PROTECTION AND PHARMACEUTICAL CHEMISTRY, 1989, 42 : 207 - 209
  • [36] A METHOD FOR LARGE-SCALE TESTING FOR PYROGENS
    KUNA, S
    EDISON, AO
    BUTZ, C
    JOURNAL OF THE AMERICAN PHARMACEUTICAL ASSOCIATION-SCIENTIFIC EDITION, 1946, 35 (02): : 59 - 63
  • [37] Responding to Large-Scale Testing Errors
    Valenstein, Paul N.
    Alpern, Ann
    Keren, David F.
    AMERICAN JOURNAL OF CLINICAL PATHOLOGY, 2010, 133 (03) : 440 - 446
  • [38] Panel: Large-scale software testing
    Horgan, B
    EIGHTH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING, PROCEEDINGS, 1997, : 220 - 220
  • [39] LARGE-SCALE PROBLEMS IN LSI TESTING
    不详
    ELECTRONICS, 1968, 41 (24): : 99 - &
  • [40] TESTING OF OBJECTIVES IN LARGE-SCALE PRODUCTION
    IVANOVSKII, IB
    PLOTNIKOV, VS
    KHLEBNIKOV, FP
    SOVIET JOURNAL OF OPTICAL TECHNOLOGY, 1977, 44 (04): : 220 - 223