CHANGE-POINT DETECTION IN MULTINOMIAL DATA WITH A LARGE NUMBER OF CATEGORIES

被引:23
|
作者
Wang, Guanghui [1 ,2 ]
Zou, Changliang [1 ,2 ]
Yin, Guosheng [3 ]
机构
[1] Nankai Univ, Inst Stat, Tianjin, Peoples R China
[2] Nankai Univ, LPMC, Tianjin, Peoples R China
[3] Univ Hong Kong, Dept Stat & Actuarial Sci, Hong Kong, Hong Kong, Peoples R China
来源
ANNALS OF STATISTICS | 2018年 / 46卷 / 05期
关键词
Asymptotic normality; categorical data; high-dimensional homogeneity test; multiple change-point detection; sparse contingency table; TIME-SERIES; MULTIPLE; MODELS;
D O I
10.1214/17-AOS1610
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider a sequence of multinomial data for which the probabilities associated with the categories are subject to abrupt changes of unknown magnitudes at unknown locations. When the number of categories is comparable to or even larger than the number of subjects allocated to these categories, conventional methods such as the classical Pearson's chi-squared test and the deviance test may not work well. Motivated by high-dimensional homogeneity tests, we propose a novel change-point detection procedure that allows the number of categories to tend to infinity. The null distribution of our test statistic is asymptotically normal and the test performs well with finite samples. The number of change-points is determined by minimizing a penalized objective function based on segmentation, and the locations of the change-points are estimated by minimizing the objective function with the dynamic programming algorithm. Under some mild conditions, the consistency of the estimators of multiple change-points is established. Simulation studies show that the proposed method performs satisfactorily for identifying change-points in terms of power and estimation accuracy, and it is illustrated with an analysis of a real data set.
引用
收藏
页码:2020 / 2044
页数:25
相关论文
共 50 条
  • [41] Online change-point detection with kernels
    Ferrari, Andre
    Richard, Cedric
    Bourrier, Anthony
    Bouchikhi, Ikram
    PATTERN RECOGNITION, 2023, 133
  • [42] Change-point detection using wavelets
    Chen, WF
    Kuo, CCJ
    DIGITAL SIGNAL PROCESSING TECHNOLOGY, 1996, 2750 : 147 - 158
  • [43] Differentially Private Change-Point Detection
    Cummings, Rachel
    Krehbiel, Sara
    Mei, Yajun
    Tuo, Rui
    Zhang, Wanrong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [44] Sketching for Sequential Change-Point Detection
    Xie, Yao
    Wang, Meng
    Thompson, Andrew
    2015 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2015, : 78 - 82
  • [45] OPTIMAL CHANGE-POINT DETECTION AND LOCALIZATION
    Verzelen, Nicolas
    Fromont, Magalie
    Lerasle, Matthieu
    Reynaud-Bouret, Patricia
    ANNALS OF STATISTICS, 2023, 51 (04): : 1586 - 1610
  • [46] Sketching for sequential change-point detection
    Cao, Yang
    Thompson, Andrew
    Wang, Meng
    Xie, Yao
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2019, 2019 (01)
  • [47] Change-point detection in panel data via double CUSUM statistic
    Cho, Haeran
    ELECTRONIC JOURNAL OF STATISTICS, 2016, 10 (02): : 2000 - 2038
  • [48] A Dirichlet process model for change-point detection with multivariate bioclimatic data
    Mastrantonio, Gianluca
    Jona Lasinio, Giovanna
    Pollice, Alessio
    Teodonio, Lorenzo
    Capotorti, Giulia
    ENVIRONMETRICS, 2022, 33 (01)
  • [49] Directional Change-Point Detection for Process Control with Multivariate Categorical Data
    Li, Jian
    Tsung, Fugee
    Zou, Changliang
    NAVAL RESEARCH LOGISTICS, 2013, 60 (02) : 160 - 173
  • [50] Optimal multiple change-point detection for high-dimensional data
    Pilliat, Emmanuel
    Carpentier, Alexandra
    Verzelen, Nicolas
    ELECTRONIC JOURNAL OF STATISTICS, 2023, 17 (01): : 1240 - 1315