Anomaly pattern detection for streaming data

被引:19
|
作者
Kim, Taegong [1 ]
Park, Cheong Hee [1 ]
机构
[1] Chungnam Natl Univ, Dept Comp Sci & Engn, 220 Gung Dong, Daejeon 305763, South Korea
基金
新加坡国家研究基金会;
关键词
Anomaly pattern detection; Control charts; Hypothesis testing; Outlier detection; Streaming data; OUTLIER;
D O I
10.1016/j.eswa.2020.113252
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Outlier detection aims to find a data sample that is different from most other data samples. While outlier detection is performed at an individual instance level, anomaly pattern detection on a data stream means detecting a time point where a pattern to generate data is unusual and significantly different from normal behavior. Beyond predicting the outlierness of individual data samples in a data stream, it can be very useful to detect the occurrence of anomalous patterns in real time. In this paper, we propose a method for anomaly pattern detection in a data stream based on binary classification for outliers and statistical tests on a data stream of binary labels of normal or an outlier. In the first step, by applying the clustering-based outlier detection method, we transform a data stream into a stream of binary values where 0 stands for the prediction as normal data and 1 for outlier prediction. In the second step, anomaly pattern detection is performed on a stream of binary values by two approaches: testing the equality of parameters in the binomial distributions of a reference window and a detection window, and using control charts for the fraction defective. The proposed method obtained the average true positive detection rate of 94% in simulated experiments using real and artificial data. The experimental results also show that anomaly pattern occurrence can be detected reliably even when outlier detection performance is relatively low. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Solution Pattern for Anomaly Detection in Financial Data Streams
    Zakrzewicz, Maciej
    Wojciechowski, Marek
    Glawinski, Pawel
    NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2019, 2019, 1064 : 77 - 84
  • [32] Real-time Anomaly Detection and Classification in Streaming PMU Data
    Hannon, Christopher
    Deka, Deepjyoti
    Jin, Dong
    Vuffray, Marc
    Lokhov, Andrey Y.
    2021 IEEE MADRID POWERTECH, 2021,
  • [33] Continual AE-WGAN for Unsupervised Anomaly Detection in Streaming Data
    Seghair, Tarek
    Besbes, Olfa
    Abdellatif, Takoua
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT I, ACIIDS 2024, 2024, 14795 : 3 - 14
  • [34] Data-Driven Anomaly Detection Approach for Time-Series Streaming Data
    Zhang, Minghu
    Guo, Jianwen
    Li, Xin
    Jin, Rui
    SENSORS, 2020, 20 (19) : 1 - 17
  • [35] Anomaly detection in streaming environmental sensor data: A data-driven modeling approach
    Hill, David J.
    Minsker, Barbara S.
    ENVIRONMENTAL MODELLING & SOFTWARE, 2010, 25 (09) : 1014 - 1022
  • [36] PROBABILISTIC REASONING FOR STREAMING ANOMALY DETECTION
    Carter, Kevin M.
    Streilein, William W.
    2012 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP (SSP), 2012, : 377 - 380
  • [37] SAND: Streaming Subsequence Anomaly Detection
    Boniol, Paul
    Paparrizos, John
    Palpanas, Themis
    Franklin, Michael J.
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 14 (10): : 1717 - 1729
  • [38] Anomaly Detection over Streaming Data: Indy500 Case Study
    Widanage, Chathura
    Li, Jiayu
    Tyagi, Sahil
    Teja, Ravi
    Peng, Bo
    Kamburugamuve, Supun
    Koskey, Jon
    Baum, Dan
    Smith, Dayle M.
    Qiu, Judy
    2019 IEEE 12TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (IEEE CLOUD 2019), 2019, : 9 - 16
  • [39] Anomaly detection in streaming data with gaussian process based stochastic differential equations
    Glyn-Davies, Alex
    Girolami, Mark
    PATTERN RECOGNITION LETTERS, 2022, 153 : 254 - 260
  • [40] Anomaly detection for streaming data based on grid-clustering and Gaussian distribution
    Zou, Beiji
    Yang, Kangkang
    Kui, Xiaoyan
    Liu, Jun
    Liao, Shenghui
    Zhao, Wei
    INFORMATION SCIENCES, 2023, 638