Fast estimation of posterior probabilities in change-point analysis through a constrained hidden Markov model

被引:9
|
作者
The Minh Luong [1 ]
Rozenholc, Yves [1 ]
Nuel, Gregory [1 ]
机构
[1] Univ Paris 05, MAP5, F-75006 Paris, France
关键词
Change-point estimation; Segmentation; Posterior distribution of change-points; Constrained hidden Markov model; Forward backward algorithm; Fast computation; COMPARATIVE GENOMIC HYBRIDIZATION; ARRAY CGH DATA; CIRCULAR BINARY SEGMENTATION; COPY NUMBER; REGRESSION; ALGORITHM;
D O I
10.1016/j.csda.2013.06.020
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The detection of change-points in heterogeneous sequences is a statistical challenge with applications across a wide variety of fields. In bioinformatics, a vast amount of methodology exists to identify an ideal set of change-points for detecting Copy Number Variation (CNV). While considerable efficient algorithms are currently available for finding the best segmentation of the data in CNV, relatively few approaches consider the important problem of assessing the uncertainty of the change-point location. Asymptotic and stochastic approaches exist but often require additional model assumptions to speed up the computations, while exact methods generally have quadratic complexity which may be intractable for large data sets of tens of thousands points or more. A hidden Markov model, with constraints specifically chosen to correspond to a segment-based change-point model, provides an exact method for obtaining the posterior distribution of change-points with linear complexity. The methods are implemented in the R package postCP, which uses the results of a given change-point detection algorithm to estimate the probability that each observation is a change-point. The results include an implementation of postCP on a publicly available CNV data set (n = 120). Due to its frequentist framework, postCP obtains less conservative confidence intervals than previously published Bayesian methods, but with linear complexity instead of quadratic. Simulations showed that postCP provided comparable loss to a Bayesian MCMC method when estimating posterior means, specifically when assessing larger scale changes, while being more computationally efficient. On another high-resolution CNV data set (n = 14,241), the implementation processed information in less than one second on a mid-range laptop computer. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:129 / 140
页数:12
相关论文
共 50 条
  • [1] Dirichlet Process Hidden Markov Multiple Change-point Model
    Ko, Stanley I. M.
    Chong, Terence T. L.
    Ghosh, Pulak
    BAYESIAN ANALYSIS, 2015, 10 (02): : 275 - 296
  • [2] POSTERIOR PROBABILITIES FOR A CHANGE-POINT USING RANKS
    PETTITT, AN
    BIOMETRIKA, 1981, 68 (02) : 443 - 450
  • [3] Posterior convergence and model estimation in Bayesian change-point problems
    Lian, Heng
    ELECTRONIC JOURNAL OF STATISTICS, 2010, 4 : 239 - 253
  • [4] Quasi-hidden Markov model and its applications in change-point problems
    Wu, Zhengxiao
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2016, 86 (12) : 2273 - 2290
  • [5] Using hidden Markov chains and empirical Bayes change-point estimation for transect data
    Ver Hoef J.M.
    Cressie N.
    Environmental and Ecological Statistics, 1997, 4 (3) : 247 - 264
  • [6] Using hidden Markov chains and empirical Bayes change-point estimation for transect data
    Hoef, JMV
    Cressie, N
    ENVIRONMENTAL AND ECOLOGICAL STATISTICS, 1997, 4 (03) : 247 - 264
  • [7] A Hidden Markov Filtering Approach to Multiple Change-point Models
    Lai, Tze Leung
    Xing, Haipeng
    47TH IEEE CONFERENCE ON DECISION AND CONTROL, 2008 (CDC 2008), 2008, : 1914 - 1919
  • [8] Change-point estimation for censored regression model
    Zhan-feng WANG
    ScienceinChina(SeriesA:Mathematics), 2007, (01) : 63 - 72
  • [9] ESTIMATION OF A CHANGE-POINT IN THE WEIBULL REGRESSION MODEL
    Palmeros-Rojas, Oscar
    Villasenor-Alva, Jose A.
    Tajonar-Sanabria, Francisco S.
    ADVANCES AND APPLICATIONS IN STATISTICS, 2014, 43 (02) : 107 - 118
  • [10] Change-point estimation for censored regression model
    Wang, Zhan-feng
    Wu, Yao-hua
    Zhao, Lin-cheng
    SCIENCE IN CHINA SERIES A-MATHEMATICS, 2007, 50 (01): : 63 - 72