Anomaly Detection for a Large Number of Streams: A Permutation-Based Higher Criticism Approach

被引:0
|
作者
Stoepker, Ivo, V [1 ]
Castro, Rui M. [1 ]
Arias-Castro, Ery [2 ,3 ]
van den Heuvel, Edwin [1 ]
机构
[1] Tech Univ Eindhoven, Dept Math & Comp Sci, Eindhoven, Netherlands
[2] Univ Calif San Diego, Dept Math, La Jolla, CA 92093 USA
[3] Univ Calif San Diego, Halicioglu Data Sci Inst, La Jolla, CA 92093 USA
关键词
Distribution-free testing; Minimax hypothesis testing; Permutation test; FALSE DISCOVERY RATE; TESTS; RARE;
D O I
10.1080/01621459.2022.2126361
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Anomaly detection when observing a large number of data streams is essential in a variety of applications, ranging from epidemiological studies to monitoring of complex systems. High-dimensional scenarios are usually tackled with scan-statistics and related methods, requiring stringent modeling assumptions for proper calibration. In this work we take a nonparametric stance, and propose a permutation-based variant of the higher criticism statistic not requiring knowledge of the null distribution. This results in an exact test in finite samples which is asymptotically optimal in the wide class of exponential models. We demonstrate the power loss in finite samples is minimal with respect to the oracle test. Furthermore, since the proposed statistic does not rely on asymptotic approximations it typically performs better than popular variants of higher criticism that rely on such approximations. We include recommendations such that the test can be readily applied in practice, and demonstrate its applicability in monitoring the content uniformity of an active ingredient for a batch-produced drug product. for this article are available online.
引用
收藏
页码:461 / 474
页数:14
相关论文
共 50 条
  • [41] Anomaly detection: A signature.-based approach
    Sy, B
    Chan, HH
    DMIN '05: Proceedings of the 2005 International Conference on Data Mining, 2005, : 193 - 199
  • [42] EADetection: An efficient and accurate sequential behavior anomaly detection approach over data streams
    Cheng, Li
    Wang, Yijie
    Zhou, Yong
    Ma, Xingkong
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2018, 14 (10)
  • [43] An effective operations permutation-based discrete harmony search approach for the flexible job shop scheduling problem with makespan criterion
    Gaham, Mehdi
    Bouzouia, Brahim
    Achour, Nouara
    APPLIED INTELLIGENCE, 2018, 48 (06) : 1423 - 1441
  • [44] Permutation-Based Noncoherent Space-Time Codes With Analog Energy Detection for IR-UWB Communications With PPM
    Abou-Rjeily, Chadi
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2016, 15 (08) : 5541 - 5554
  • [45] Higher-Order PCA for Anomaly Detection in Large-Scale Networks
    Kim, Hayang
    Lee, Sungeun
    Ma, Xiaoli
    Wang, Chao
    2009 3RD IEEE INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP), 2009, : 85 - 88
  • [46] Higher-Order PCA for Anomaly Detection in Large-Scale Networks
    Kim, Hayang
    Lee, Sungeun
    Ma, Xiaoli
    Wang, Chao
    2009 3RD IEEE INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP 2009), 2009, : 85 - 88
  • [47] Higher-Order Moment-Based Anomaly Detection
    Renganathan, Venkatraman
    Hashemi, Navid
    Ruths, Justin
    Summers, Tyler H.
    IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 211 - 216
  • [48] Tamper Detection in Industrial Sensors: An Approach Based on Anomaly Detection
    Villegas-Ch, William
    Govea, Jaime
    Jaramillo-Alcazar, Angel
    SENSORS, 2023, 23 (21)
  • [49] A robust approach to design a single facility layout plan in dynamic manufacturing environments using a permutation-based genetic algorithm
    Fazlelahi, Forough Zarea
    Pournader, Mehrdokht
    Gharakhani, Mohsen
    Sadjadi, Seyed Jafar
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART B-JOURNAL OF ENGINEERING MANUFACTURE, 2016, 230 (12) : 2264 - 2274
  • [50] Data Streams Anomaly Detection Algorithm Based on Self-set Threshold
    Luo Yuanyan
    Du Xuehui
    Sun Yi
    PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON COMMUNICATION AND INFORMATION PROCESSING (ICCIP 2018), 2018, : 18 - 26