PENALIZED ESTIMATION IN HIGH-DIMENSIONAL HIDDEN MARKOV MODELS WITH STATE-SPECIFIC GRAPHICAL MODELS

被引:16
|
作者
Stadler, Nicolas [1 ]
Mukherjee, Sach [1 ]
机构
[1] Netherlands Canc Inst, Dept Biochem, NL-1066 CX Amsterdam, Netherlands
来源
ANNALS OF APPLIED STATISTICS | 2013年 / 7卷 / 04期
关键词
HMM; Graphical Lasso; universal regularization; model selection; MMDL; greedy backward pruning; genome biology; chromatin modeling; VARIABLE SELECTION; MIXTURE;
D O I
10.1214/13-AOAS662
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider penalized estimation in hidden Markov models (HMMs) with multivariate Normal observations. In the moderate-to-large dimensional setting, estimation for HMMs remains challenging in practice, due to several concerns arising from the hidden nature of the states. We address these concerns by l(1)-penalization of state-specific inverse covariance matrices. Penalized estimation leads to sparse inverse covariance matrices which can be interpreted as state-specific conditional independence graphs. Penalization is nontrivial in this latent variable setting; we propose a penalty that automatically adapts to the number of states K and the state-specific sample sizes and can cope with scaling issues arising from the unknown states. The methodology is adaptive and very general, applying in particular to both low- and high-dimensional settings without requiring hand tuning. Furthermore, our approach facilitates exploration of the number of states K by coupling estimation for successive candidate values K. Empirical results on simulated examples demonstrate the effectiveness of the proposed approach. In a challenging real data example from genome biology, we demonstrate the ability of our approach to yield gains in predictive power and to deliver richer estimates than existing methods.
引用
收藏
页码:2157 / 2179
页数:23
相关论文
共 50 条
  • [41] Monitoring sequential structural changes in penalized high-dimensional linear models
    Ratnasingam, Suthakaran
    Ning, Wei
    SEQUENTIAL ANALYSIS-DESIGN METHODS AND APPLICATIONS, 2021, 40 (03): : 381 - 404
  • [42] Conditional score matching for high-dimensional partial graphical models
    Fan, Xinyan
    Zhang, Qingzhao
    Ma, Shuangge
    Fang, Kuangnan
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2021, 153 (153)
  • [43] SCAD-PENALIZED REGRESSION IN HIGH-DIMENSIONAL PARTIALLY LINEAR MODELS
    Xie, Huiliang
    Huang, Jian
    ANNALS OF STATISTICS, 2009, 37 (02): : 673 - 696
  • [44] Quantifying the Privacy Risks of Learning High-Dimensional Graphical Models
    Murakonda, Sasi Kumar
    Shokri, Reza
    Theodorakopoulos, George
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [45] Variance estimation in high-dimensional linear models
    Dicker, Lee H.
    BIOMETRIKA, 2014, 101 (02) : 269 - 284
  • [46] High-dimensional inference for cluster-based graphical models
    Eisenach, Carson
    Bunea, Florentina
    Ning, Yang
    Dinicu, Claudiu
    1600, Microtome Publishing (21):
  • [47] Interactive analysis of high-dimensional association structures with graphical models
    Angelika Blauth
    Iris Pigeot
    François Bry
    Metrika, 2000, 51 : 53 - 65
  • [48] High-dimensional undirected graphical models for arbitrary mixed data
    Goebler, Konstantin
    Drton, Mathias
    Mukherjee, Sach
    Miloschewski, Anne
    ELECTRONIC JOURNAL OF STATISTICS, 2024, 18 (01): : 2339 - 2404
  • [49] Interactive analysis of high-dimensional association structures with graphical models
    Blauth, A
    Pigeot, I
    Bry, F
    METRIKA, 2000, 51 (01) : 53 - 65
  • [50] High-Dimensional Inference for Cluster-Based Graphical Models
    Eisenach, Carson
    Bunea, Florentina
    Ning, Yang
    Dinicu, Claudiu
    JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21