Anomaly Detection for a Large Number of Streams: A Permutation-Based Higher Criticism Approach

被引:0
|
作者
Stoepker, Ivo, V [1 ]
Castro, Rui M. [1 ]
Arias-Castro, Ery [2 ,3 ]
van den Heuvel, Edwin [1 ]
机构
[1] Tech Univ Eindhoven, Dept Math & Comp Sci, Eindhoven, Netherlands
[2] Univ Calif San Diego, Dept Math, La Jolla, CA 92093 USA
[3] Univ Calif San Diego, Halicioglu Data Sci Inst, La Jolla, CA 92093 USA
关键词
Distribution-free testing; Minimax hypothesis testing; Permutation test; FALSE DISCOVERY RATE; TESTS; RARE;
D O I
10.1080/01621459.2022.2126361
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Anomaly detection when observing a large number of data streams is essential in a variety of applications, ranging from epidemiological studies to monitoring of complex systems. High-dimensional scenarios are usually tackled with scan-statistics and related methods, requiring stringent modeling assumptions for proper calibration. In this work we take a nonparametric stance, and propose a permutation-based variant of the higher criticism statistic not requiring knowledge of the null distribution. This results in an exact test in finite samples which is asymptotically optimal in the wide class of exponential models. We demonstrate the power loss in finite samples is minimal with respect to the oracle test. Furthermore, since the proposed statistic does not rely on asymptotic approximations it typically performs better than popular variants of higher criticism that rely on such approximations. We include recommendations such that the test can be readily applied in practice, and demonstrate its applicability in monitoring the content uniformity of an active ingredient for a batch-produced drug product. for this article are available online.
引用
收藏
页码:461 / 474
页数:14
相关论文
共 50 条
  • [1] Nestedness patterns and the dual nature of community reassembly in California streams: a multivariate permutation-based approach
    Novak, Mark
    Moore, Jonathan W.
    Leidy, Robert A.
    GLOBAL CHANGE BIOLOGY, 2011, 17 (12) : 3714 - 3723
  • [2] A permutation-based Bayesian approach for inverse covariance estimation
    Cao, Xuan
    Zhang, Shaojun
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2020, 49 (14) : 3557 - 3571
  • [3] Rapid Deployment of Anomaly Detection Models for Large Number of Emerging KPI Streams
    Bu, Jiahao
    Liu, Ying
    Zhang, Shenglin
    Meng, Weibin
    Liu, Qitong
    Zhu, Xiaotian
    Pei, Dan
    2018 IEEE 37TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2018,
  • [4] Anomaly detection of large scale network based on data streams
    Research Center of Computer Network and Information Security Technology, Harbin Institute of Technology, Harbin 150001, China
    Tongxin Xuebao, 2006, 2 (1-8):
  • [5] A Permutation-Based Approach for Solving the Job-Shop Problem
    Zhou J.
    Constraints, 1997, 2 (2) : 185 - 213
  • [6] A Graph-Theoretic Approach to Multiobjective Permutation-Based Optimization
    Koliechkina, Liudmyla
    Pichugina, Oksana
    Yakovlev, Sergiy
    OPTIMIZATION AND APPLICATIONS, OPTIMA 2019, 2020, 1145 : 383 - 400
  • [7] Epistasis detection using a permutation-based gradient boosting machine
    Che, Kai
    Liu, Xiaoyan
    Guo, Maozu
    Zhang, Junwei
    Wang, Lei
    Zhang, Yin
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 1247 - 1252
  • [8] New approach for attack of permutation-based image encryption schemes
    Mekhaznia T.
    Bennour A.
    International Journal of Computers and Applications, 2021, 43 (07) : 697 - 705
  • [9] Permutation-Based Diversity Measure for Classifier-Chain Approach
    Trajdos, Pawel
    Kurzynski, Marek
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS CORES 2017, 2018, 578 : 412 - 422
  • [10] Multi-user detection for random permutation-based multiple access
    Coulon, M
    Roviras, D
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PROCEEDINGS: SIGNAL PROCESSING FOR COMMUNICATIONS SPECIAL SESSIONS, 2003, : 61 - 64