Large-scale dependent multiple testing via hidden semi-Markov models

被引:0
|
作者
Wang, Jiangzhou [1 ]
Wang, Pengfei [2 ]
机构
[1] Shenzhen Univ, Inst Stat Sci, Coll Math & Stat, Shenzhen 518060, Peoples R China
[2] Dongbei Univ Finance & Econ, Sch Stat, Dalian 116025, Peoples R China
关键词
FDR; Hidden semi-Markov model; Multiple testing; FALSE DISCOVERY RATE; EMPIRICAL BAYES;
D O I
10.1007/s00180-023-01367-z
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Large-scale multiple testing is common in the statistical analysis of high-dimensional data. Conventional multiple testing procedures usually implicitly assumed that the tests are independent. However, this assumption is rarely established in many practical applications, particularly in "high-throughput" data analysis. Incorporating dependence structure information among tests can improve statistical power and interpretability of discoveries. In this paper, we propose a new large-scale dependent multiple testing procedure based on the hidden semi-Markov model (HSMM), which characterizes local correlations among tests using a semi-Markov process instead of a first-order Markov chain. Our novel approach allows for the number of consecutive null hypotheses to follow any reasonable distribution, enabling a more accurate description of complex local correlations. We show that the proposed procedure minimizes the marginal false non-discovery rate (mFNR) at the same marginal false discovery rate (mFDR) level. To reduce the computational complexity of the HSMM, we make use of the hidden Markov model (HMM) with an expanded state space to approximate it. We provide a forward-backward algorithm and an expectation-maximization (EM) algorithm for implementing the proposed procedure. Finally, we demonstrate the superior performance of the SMLIS procedure through extensive simulations and a real data analysis.
引用
收藏
页码:1093 / 1126
页数:34
相关论文
共 50 条
  • [41] Filtering hidden semi-Markov chains
    Elliott, Robert
    Limnios, Nikolaos
    Swishchuk, Anatoliy
    STATISTICS & PROBABILITY LETTERS, 2013, 83 (09) : 2007 - 2014
  • [42] VITERBI ALGORITHMS FOR HIDDEN SEMI-MARKOV MODELS WITH APPLICATION TO DNA ANALYSIS
    Pertsinidou, Christina-Elisavet
    Limnios, Nikolaos
    RAIRO-OPERATIONS RESEARCH, 2015, 49 (03) : 511 - 526
  • [43] Segmenting Continuous Motions with Hidden Semi-markov Models and Gaussian Processes
    Nakamura, Tomoaki
    Nagai, Takayuki
    Mochihashi, Daichi
    Kobayashi, Ichiro
    Asoh, Hideki
    Kaneko, Masahide
    FRONTIERS IN NEUROROBOTICS, 2017, 11
  • [44] MCMC implementation for Bayesian hidden semi-Markov models with illustrative applications
    Economou, Theodoros
    Bailey, Trevor C.
    Kapelan, Zoran
    STATISTICS AND COMPUTING, 2014, 24 (05) : 739 - 752
  • [45] Hidden Semi-Markov Models for Semantic-Graph Language Modeling
    Yetim, Sadik Yagiz
    Duman, Tolga M.
    Arikan, Orhan
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2024, 361 (16):
  • [46] Equipment health diagnosis and prognosis using hidden semi-Markov models
    Ming Dong
    David He
    Prashant Banerjee
    Jonathan Keller
    The International Journal of Advanced Manufacturing Technology, 2006, 30 : 738 - 749
  • [47] Inhomogeneous hidden semi-Markov models for incompletely observed point processes
    Shahzadi, Amina
    Wang, Ting
    Bebbington, Mark
    Parry, Matthew
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2023, 75 (02) : 253 - 280
  • [48] The use of hidden semi-Markov models in clinical diagnosis maze tasks
    Marhasev, Einat
    Hadad, Meirav
    Kaminka, Gal A.
    Feintuch, Uri
    INTELLIGENT DATA ANALYSIS, 2009, 13 (06) : 943 - 967
  • [49] On robust estimation of hidden semi-Markov regime-switching models
    Qin, Shanshan
    Tan, Zhenni
    Wu, Yuehua
    ANNALS OF OPERATIONS RESEARCH, 2024, 338 (2-3) : 1049 - 1081
  • [50] MCMC implementation for Bayesian hidden semi-Markov models with illustrative applications
    Theodoros Economou
    Trevor C. Bailey
    Zoran Kapelan
    Statistics and Computing, 2014, 24 : 739 - 752