PENALIZED ESTIMATION IN HIGH-DIMENSIONAL HIDDEN MARKOV MODELS WITH STATE-SPECIFIC GRAPHICAL MODELS

被引:16
|
作者
Stadler, Nicolas [1 ]
Mukherjee, Sach [1 ]
机构
[1] Netherlands Canc Inst, Dept Biochem, NL-1066 CX Amsterdam, Netherlands
来源
ANNALS OF APPLIED STATISTICS | 2013年 / 7卷 / 04期
关键词
HMM; Graphical Lasso; universal regularization; model selection; MMDL; greedy backward pruning; genome biology; chromatin modeling; VARIABLE SELECTION; MIXTURE;
D O I
10.1214/13-AOAS662
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider penalized estimation in hidden Markov models (HMMs) with multivariate Normal observations. In the moderate-to-large dimensional setting, estimation for HMMs remains challenging in practice, due to several concerns arising from the hidden nature of the states. We address these concerns by l(1)-penalization of state-specific inverse covariance matrices. Penalized estimation leads to sparse inverse covariance matrices which can be interpreted as state-specific conditional independence graphs. Penalization is nontrivial in this latent variable setting; we propose a penalty that automatically adapts to the number of states K and the state-specific sample sizes and can cope with scaling issues arising from the unknown states. The methodology is adaptive and very general, applying in particular to both low- and high-dimensional settings without requiring hand tuning. Furthermore, our approach facilitates exploration of the number of states K by coupling estimation for successive candidate values K. Empirical results on simulated examples demonstrate the effectiveness of the proposed approach. In a challenging real data example from genome biology, we demonstrate the ability of our approach to yield gains in predictive power and to deliver richer estimates than existing methods.
引用
收藏
页码:2157 / 2179
页数:23
相关论文
共 50 条
  • [31] Efficient Distributed Estimation of High-dimensional Sparse Precision Matrix for Transelliptical Graphical Models
    Guan Peng Wang
    Heng Jian Cui
    Acta Mathematica Sinica, English Series, 2021, 37 : 689 - 706
  • [32] Efficient Distributed Estimation of High-dimensional Sparse Precision Matrix for Transelliptical Graphical Models
    Wang, Guan Peng
    Cui, Heng Jian
    ACTA MATHEMATICA SINICA-ENGLISH SERIES, 2021, 37 (05) : 689 - 706
  • [33] Non-convex penalized estimation in high-dimensional models with single-index structure
    Wang, Tao
    Xu, Pei-Rong
    Zhu, Li-Xing
    JOURNAL OF MULTIVARIATE ANALYSIS, 2012, 109 : 221 - 235
  • [34] Directional fault classification for correlated High-Dimensional data streams using hidden Markov models
    He, Yan
    Kang, Yicheng
    Tsung, Fugee
    Xiang, Dongdong
    JOURNAL OF QUALITY TECHNOLOGY, 2023, 55 (05) : 535 - 549
  • [35] Filtering and Smoothing State Estimation for Flag Hidden Markov Models
    Doty, Kyle
    Roy, Sandip
    Fischer, Thomas R.
    2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 7042 - 7047
  • [36] State Estimation for Flag Hidden Markov Models with Imperfect Sensors
    Doty, Kyle
    Roy, Sandip
    Fischer, Thomas R.
    2016 ANNUAL CONFERENCE ON INFORMATION SCIENCE AND SYSTEMS (CISS), 2016,
  • [37] AN ASYMPTOTIC ANALYSIS OF BAYESIAN STATE ESTIMATION IN HIDDEN MARKOV MODELS
    Yamazaki, Keisuke
    2011 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2011,
  • [38] PENALIZED JOINT MODELS OF HIGH-DIMENSIONAL LONGITUDINAL BIOMARKERS AND A SURVIVAL OUTCOME
    Sun, Jiehuan
    Basu, Sanjib
    ANNALS OF APPLIED STATISTICS, 2024, 18 (02): : 1490 - 1505
  • [39] Variable selection and estimation in high-dimensional models
    Horowitz, Joel L.
    CANADIAN JOURNAL OF ECONOMICS-REVUE CANADIENNE D ECONOMIQUE, 2015, 48 (02): : 389 - 407
  • [40] Variance estimation for high-dimensional regression models
    Spokoiny, V
    JOURNAL OF MULTIVARIATE ANALYSIS, 2002, 82 (01) : 111 - 133