VARIATIONAL BAYESIAN ANALYSIS OF NONHOMOGENEOUS HIDDEN MARKOV MODELS WITH LONG AND ULTRALONG SEQUENCES

被引:0
|
作者
Chen, Xinyuan [1 ]
Li, Yiwei [2 ]
Feng, Xiangnan [3 ]
Chang, Joseph T. [4 ]
机构
[1] Mississippi State Univ, Dept Math & Stat, Starkville, MS 39762 USA
[2] Lingnan Univ, Dept Mkt & Int Business, Hong Kong, Peoples R China
[3] Fudan Univ, Dept Stat & Data Sci, Shanghai, Peoples R China
[4] Yale Univ, Dept Stat & Data Sci, New Haven, CT USA
来源
ANNALS OF APPLIED STATISTICS | 2023年 / 17卷 / 02期
基金
中国国家自然科学基金;
关键词
Nonhomogeneous hidden Markov model; variational Bayesian inference; local Lya-punov exponents; mobile Internet usage; INFERENCE; APPROXIMATION;
D O I
10.1214/22-AOAS1685
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Nonhomogeneous hidden Markov models (NHMMs) are useful in mod-eling sequential and autocorrelated data. Bayesian approaches, particularly Markov chain Monte Carlo (MCMC) methods, are principal statistical in-ference tools for NHMMs. However, MCMC sampling is computationally demanding, especially for long observation sequences. We develop a vari-ational Bayes (VB) method for NHMMs, which utilizes a structured varia-tional family of Gaussian distributions with factorized covariance matrices to approximate target posteriors, combining a forward-backward algorithm and stochastic gradient ascent in estimation. To improve efficiency and handle ul-tralong sequences, we further propose a subsequence VB (SVB) method that works on subsamples. The SVB method exploits the memory decay property of NHMMs and uses buffers to control for bias caused by breaking sequen-tial dependence from subsampling. We highlight that the local nonhomogene-ity of NHMMs substantially affects the required buffer lengths and propose the use of local Lyapunov exponents that characterize local memory decay rates of NHMMs and adaptively determine buffer lengths. Our methods are validated in simulation studies and in modeling ultralong sequences of cus-tomers' telecom records to uncover the relationship between their mobile In-ternet usage behaviors and conventional telecommunication behaviors.
引用
收藏
页码:1615 / 1640
页数:26
相关论文
共 50 条