Large-scale dependent multiple testing via hidden semi-Markov models

被引:0
|
作者
Wang, Jiangzhou [1 ]
Wang, Pengfei [2 ]
机构
[1] Shenzhen Univ, Inst Stat Sci, Coll Math & Stat, Shenzhen 518060, Peoples R China
[2] Dongbei Univ Finance & Econ, Sch Stat, Dalian 116025, Peoples R China
关键词
FDR; Hidden semi-Markov model; Multiple testing; FALSE DISCOVERY RATE; EMPIRICAL BAYES;
D O I
10.1007/s00180-023-01367-z
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Large-scale multiple testing is common in the statistical analysis of high-dimensional data. Conventional multiple testing procedures usually implicitly assumed that the tests are independent. However, this assumption is rarely established in many practical applications, particularly in "high-throughput" data analysis. Incorporating dependence structure information among tests can improve statistical power and interpretability of discoveries. In this paper, we propose a new large-scale dependent multiple testing procedure based on the hidden semi-Markov model (HSMM), which characterizes local correlations among tests using a semi-Markov process instead of a first-order Markov chain. Our novel approach allows for the number of consecutive null hypotheses to follow any reasonable distribution, enabling a more accurate description of complex local correlations. We show that the proposed procedure minimizes the marginal false non-discovery rate (mFNR) at the same marginal false discovery rate (mFDR) level. To reduce the computational complexity of the HSMM, we make use of the hidden Markov model (HMM) with an expanded state space to approximate it. We provide a forward-backward algorithm and an expectation-maximization (EM) algorithm for implementing the proposed procedure. Finally, we demonstrate the superior performance of the SMLIS procedure through extensive simulations and a real data analysis.
引用
收藏
页码:1093 / 1126
页数:34
相关论文
共 50 条
  • [31] Recursive maximum likelihood estimation for hidden semi-Markov models
    Squire, K
    Levinson, SE
    2005 IEEE WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2005, : 329 - 334
  • [32] ACTIVITY RECOGNITION USING LOGICAL HIDDEN SEMI-MARKOV MODELS
    Zha, Ya-Bing
    Yue, Shi-Guang
    Yin, Quan-Jun
    Liu, Xiao-Cheng
    2013 10TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2013, : 77 - 84
  • [33] Modified hidden semi-Markov models for motor wear prognosis
    Wu, X.
    Li, Y.
    Teng, W.
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART J-JOURNAL OF ENGINEERING TRIBOLOGY, 2012, 226 (J2) : 174 - 179
  • [34] Hidden semi-Markov models for machinery health diagnosis and prognosis
    Dong, M
    He, D
    TRANSACTIONS OF THE NORTH AMERICAN MANUFACTURING RESEARCH INSTITUTION OF SME, VOL 32, 2004, 2004, : 199 - 206
  • [35] Unsupervised Classification of Human Activity with Hidden Semi-Markov Models
    Cavallo, Francesca Romana
    Toumazou, Christofer
    Nikolic, Konstantin
    APPLIED SYSTEM INNOVATION, 2022, 5 (04)
  • [36] Quantile hidden semi-Markov models for multivariate time series
    Merlo, Luca
    Maruotti, Antonello
    Petrella, Lea
    Punzo, Antonio
    STATISTICS AND COMPUTING, 2022, 32 (04)
  • [37] Quantile hidden semi-Markov models for multivariate time series
    Luca Merlo
    Antonello Maruotti
    Lea Petrella
    Antonio Punzo
    Statistics and Computing, 2022, 32
  • [38] Machine condition recognition via hidden semi-Markov model
    Yang, Wenhui
    Chen, Lu
    COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 158 (158)
  • [39] Initialization of Hidden Markov and Semi-Markov Models: A Critical Evaluation of Several Strategies
    Maruotti, Antonello
    Punzo, Antonio
    INTERNATIONAL STATISTICAL REVIEW, 2021, 89 (03) : 447 - 480
  • [40] Reduced-complexity estimation for large-scale hidden Markov models
    Dey, S
    Mareels, I
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (05) : 1242 - 1249