Fast estimation of posterior probabilities in change-point analysis through a constrained hidden Markov model

被引:9
|
作者
The Minh Luong [1 ]
Rozenholc, Yves [1 ]
Nuel, Gregory [1 ]
机构
[1] Univ Paris 05, MAP5, F-75006 Paris, France
关键词
Change-point estimation; Segmentation; Posterior distribution of change-points; Constrained hidden Markov model; Forward backward algorithm; Fast computation; COMPARATIVE GENOMIC HYBRIDIZATION; ARRAY CGH DATA; CIRCULAR BINARY SEGMENTATION; COPY NUMBER; REGRESSION; ALGORITHM;
D O I
10.1016/j.csda.2013.06.020
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The detection of change-points in heterogeneous sequences is a statistical challenge with applications across a wide variety of fields. In bioinformatics, a vast amount of methodology exists to identify an ideal set of change-points for detecting Copy Number Variation (CNV). While considerable efficient algorithms are currently available for finding the best segmentation of the data in CNV, relatively few approaches consider the important problem of assessing the uncertainty of the change-point location. Asymptotic and stochastic approaches exist but often require additional model assumptions to speed up the computations, while exact methods generally have quadratic complexity which may be intractable for large data sets of tens of thousands points or more. A hidden Markov model, with constraints specifically chosen to correspond to a segment-based change-point model, provides an exact method for obtaining the posterior distribution of change-points with linear complexity. The methods are implemented in the R package postCP, which uses the results of a given change-point detection algorithm to estimate the probability that each observation is a change-point. The results include an implementation of postCP on a publicly available CNV data set (n = 120). Due to its frequentist framework, postCP obtains less conservative confidence intervals than previously published Bayesian methods, but with linear complexity instead of quadratic. Simulations showed that postCP provided comparable loss to a Bayesian MCMC method when estimating posterior means, specifically when assessing larger scale changes, while being more computationally efficient. On another high-resolution CNV data set (n = 14,241), the implementation processed information in less than one second on a mid-range laptop computer. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:129 / 140
页数:12
相关论文
共 50 条
  • [21] Change point estimation for continuous-time hidden Markov models
    Elliott, Robert J.
    Deng, Jia
    SYSTEMS & CONTROL LETTERS, 2013, 62 (02) : 112 - 114
  • [22] A Parameter Estimation of Software Reliability Growth Model with Change-Point
    Kim, Do Hoon
    Park, Chun Gun
    Nam, Kyung H.
    KOREAN JOURNAL OF APPLIED STATISTICS, 2008, 21 (05) : 813 - 823
  • [23] Estimation in a Cox regression model with a change-point at an unknown time
    Pons, O
    STATISTICS, 2002, 36 (02) : 101 - 124
  • [24] Maximum score change-point estimation in binary response model
    Zhao L.
    Kou C.
    Wu Y.
    Journal of Systems Science and Complexity, 2006, 19 (3) : 386 - 392
  • [25] Recurrent Estimation of Hidden Markov Model Transition Probabilities from Aggregate Data
    Lyubchyk, Leonid
    Grinberg, Galyna
    Dunaievska, Olha
    Lubchick, Maria
    2019 9TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER INFORMATION TECHNOLOGIES (ACIT'2019), 2019, : 64 - 67
  • [26] CALIBRATION OF PIECEWISE MARKOV MODELS USING A CHANGE-POINT ANALYSIS THROUGH AN ITERATIVE CONVEX OPTIMIZATION ALGORITHM
    Alarid-Escudero, F.
    Enns, E.
    Peralta-Torres, Y. E.
    Maclehose, R.
    Kuntz, K. M.
    VALUE IN HEALTH, 2015, 18 (07) : A814 - A814
  • [27] MAXIMUM SCORE CHANGE-POINT ESTIMATION IN BINARY RESPONSE MODEL
    Lincheng ZHAO Chaofeng KOU Yaohua WU Department of Statistics and Finance
    JournalofSystemsScience&Complexity, 2006, (03) : 386 - 392
  • [28] Smoothed maximum score change-point estimation in binary response model
    Yuan M.
    Zhao L.-C.
    Wu Y.-H.
    Acta Mathematicae Applicatae Sinica, 2006, 22 (4) : 655 - 662
  • [29] Filtering and change point estimation for hidden Markov-modulated Poisson processes
    Elliott, Robert J.
    Siu, Tak Kuen
    APPLIED MATHEMATICS LETTERS, 2014, 28 : 66 - 71
  • [30] Estimation of Cure Fraction and Misclassification Probabilities Using Continuous Time Hidden Markov Model
    Grover, Gurprit
    Chakravarty, Sangeeta
    Thakur, Arpan Kumar
    STATISTICS AND APPLICATIONS, 2021, 19 (02): : 127 - 138