Anomaly pattern detection for streaming data

被引:19
|
作者
Kim, Taegong [1 ]
Park, Cheong Hee [1 ]
机构
[1] Chungnam Natl Univ, Dept Comp Sci & Engn, 220 Gung Dong, Daejeon 305763, South Korea
基金
新加坡国家研究基金会;
关键词
Anomaly pattern detection; Control charts; Hypothesis testing; Outlier detection; Streaming data; OUTLIER;
D O I
10.1016/j.eswa.2020.113252
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Outlier detection aims to find a data sample that is different from most other data samples. While outlier detection is performed at an individual instance level, anomaly pattern detection on a data stream means detecting a time point where a pattern to generate data is unusual and significantly different from normal behavior. Beyond predicting the outlierness of individual data samples in a data stream, it can be very useful to detect the occurrence of anomalous patterns in real time. In this paper, we propose a method for anomaly pattern detection in a data stream based on binary classification for outliers and statistical tests on a data stream of binary labels of normal or an outlier. In the first step, by applying the clustering-based outlier detection method, we transform a data stream into a stream of binary values where 0 stands for the prediction as normal data and 1 for outlier prediction. In the second step, anomaly pattern detection is performed on a stream of binary values by two approaches: testing the equality of parameters in the binomial distributions of a reference window and a detection window, and using control charts for the fraction defective. The proposed method obtained the average true positive detection rate of 94% in simulated experiments using real and artificial data. The experimental results also show that anomaly pattern occurrence can be detected reliably even when outlier detection performance is relatively low. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Real-time anomaly detection using parallelized intrusion detection architecture for streaming data
    Chellammal, P.
    Malarchelvi, Sheba Kezia P. D.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (04):
  • [42] UNSUPERVISED STREAMING ANOMALY DETECTION FOR INSTRUMENTED INFRASTRUCTURE
    Hoeltgebaum, Henrique
    Adams, Niall
    Lau, F. Din-Houn
    ANNALS OF APPLIED STATISTICS, 2021, 15 (03): : 1101 - 1125
  • [43] Online Influence Forest for Streaming Anomaly Detection
    Martins, Ines
    Resende, Joao S.
    Gama, Joao
    ADVANCES IN INTELLIGENT DATA ANALYSIS XXI, IDA 2023, 2023, 13876 : 274 - 286
  • [44] Revisiting streaming anomaly detection: benchmark and evaluation
    Cao, Yang
    Ma, Yixiao
    Zhu, Ye
    Ting, Kai Ming
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 58 (01)
  • [45] Waterloss detection in streaming water meter data using wavelet change-point anomaly detection
    Christodoulou, S. E.
    Kourti, E.
    Agathokleous, A.
    Christodoulou, C.
    EWORK AND EBUSINESS IN ARCHITECTURE, ENGINEERING AND CONSTRUCTION, 2016, : 613 - 618
  • [46] Application of Chebyshev's Inequality in Online Anomaly Detection Driven by Streaming PMU Data
    Wang, Pengyuan
    Wang, Honggang
    Hart, Philip
    Guo, Xian
    Mahapatra, Kaveri
    2020 IEEE POWER & ENERGY SOCIETY GENERAL MEETING (PESGM), 2020,
  • [47] An Efficient Modelling of Oversampling with Optimal Deep Learning Enabled Anomaly Detection in Streaming Data
    Rajakumar, R.
    Devi, S. Sathiya
    CHINA COMMUNICATIONS, 2024, 21 (05) : 249 - 260
  • [48] Dynamic Micro-cluster-Based Streaming Data Clustering Method for Anomaly Detection
    Wang, Xiaolan
    Ahmed, Md Manjur
    Husen, Mohd Nizam
    Tao, Hai
    Zhao, Qian
    SOFT COMPUTING IN DATA SCIENCE, SCDS 2023, 2023, 1771 : 61 - 75
  • [49] Online and Unsupervised Anomaly Detection for Streaming Data Using an Array of Sliding Windows and PDDs
    Zhang, Lingyu
    Zhao, Jiabao
    Li, Wei
    IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (04) : 2284 - 2289
  • [50] Online Anomaly Detection with Streaming Data based on Fine-grained Feature Forecasting
    Liu, Keying
    Mao, Wentao
    Shi, Huadong
    Wu, Chao
    Chen, Jiaxian
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 454 - 459