Anomaly Detection for a Large Number of Streams: A Permutation-Based Higher Criticism Approach

被引:0
|
作者
Stoepker, Ivo, V [1 ]
Castro, Rui M. [1 ]
Arias-Castro, Ery [2 ,3 ]
van den Heuvel, Edwin [1 ]
机构
[1] Tech Univ Eindhoven, Dept Math & Comp Sci, Eindhoven, Netherlands
[2] Univ Calif San Diego, Dept Math, La Jolla, CA 92093 USA
[3] Univ Calif San Diego, Halicioglu Data Sci Inst, La Jolla, CA 92093 USA
关键词
Distribution-free testing; Minimax hypothesis testing; Permutation test; FALSE DISCOVERY RATE; TESTS; RARE;
D O I
10.1080/01621459.2022.2126361
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Anomaly detection when observing a large number of data streams is essential in a variety of applications, ranging from epidemiological studies to monitoring of complex systems. High-dimensional scenarios are usually tackled with scan-statistics and related methods, requiring stringent modeling assumptions for proper calibration. In this work we take a nonparametric stance, and propose a permutation-based variant of the higher criticism statistic not requiring knowledge of the null distribution. This results in an exact test in finite samples which is asymptotically optimal in the wide class of exponential models. We demonstrate the power loss in finite samples is minimal with respect to the oracle test. Furthermore, since the proposed statistic does not rely on asymptotic approximations it typically performs better than popular variants of higher criticism that rely on such approximations. We include recommendations such that the test can be readily applied in practice, and demonstrate its applicability in monitoring the content uniformity of an active ingredient for a batch-produced drug product. for this article are available online.
引用
收藏
页码:461 / 474
页数:14
相关论文
共 50 条
  • [31] Permutation-Based Elitist Genetic Algorithm for Optimization of Large-Sized Resource-Constrained Project Scheduling
    Kim, Jin-Lee
    Ellis, Ralph D., Jr.
    JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT, 2008, 134 (11) : 904 - 913
  • [32] A Cooperative Tree-based Hybrid GA-B&B Approach for Solving Challenging Permutation-based Problems
    Mehdi, Malika
    Charr, Jean-Claude
    Melab, Nouredine
    Talbi, El-Ghazali
    Bouvry, Pascal
    GECCO-2011: PROCEEDINGS OF THE 13TH ANNUAL GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2011, : 513 - 520
  • [33] Approach to Anomaly Detection in Microservice System with Multi-Source Data Streams
    ZHANG Qixun
    HAN Jing
    CHENG Li
    ZHANG Baisheng
    GONG Zican
    ZTECommunications, 2022, 20 (03) : 85 - 92
  • [34] STREAMRHF: Tree-Based Unsupervised Anomaly Detection for Data Streams
    Nesic, Stefan
    Putina, Andrian
    Bahri, Maroua
    Huet, Alexis
    Navarro, Jose Manuel
    Rossi, Dario
    Sozio, Mauro
    2022 IEEE/ACS 19TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2022,
  • [35] Anomaly Detection in Data Streams Based on Graph Coloring Density Coefficients
    Tripathi, Achyut Mani
    Baruah, Rashmi Dutta
    PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
  • [36] Anomaly Detection for Data Streams in Large-Scale Distributed Heterogeneous Computing Environments
    Dang, Yue
    Wang, Bin
    Brant, Ryan
    Zhang, Zhiping
    Alqallaf, Maha
    Wu, Zhiqiang
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON CYBER WARFARE AND SECURITY (ICCWS 2017), 2017, : 121 - 130
  • [37] DETECTION BOUNDARY AND HIGHER CRITICISM APPROACH FOR RARE AND WEAK GENETIC EFFECTS
    Wu, Zheyang
    Sun, Yiming
    He, Shiquan
    Cho, Judy
    Zhao, Hongyu
    Jin, Jiashun
    ANNALS OF APPLIED STATISTICS, 2014, 8 (02): : 824 - 851
  • [38] An Ensemble-Based Approach to Anomaly Detection in Marine Engine Sensor Streams for Efficient Condition Monitoring and Analysis
    Kim, Donghyun
    Lee, Sangbong
    Lee, Jihwan
    SENSORS, 2020, 20 (24) : 1 - 16
  • [39] An Anomaly Detection Approach based on Symbolic Similarity
    Yan, Qiuyan
    Xia, Shixiong
    Shi, Yilong
    2010 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-5, 2010, : 3003 - 3008
  • [40] A CNN Based Approach for Crowd Anomaly Detection
    Joshi, Kinjal, V
    Patel, Narendra M.
    INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2021, 12 (01): : 1 - 11