Anomaly pattern detection for streaming data

被引:19
|
作者
Kim, Taegong [1 ]
Park, Cheong Hee [1 ]
机构
[1] Chungnam Natl Univ, Dept Comp Sci & Engn, 220 Gung Dong, Daejeon 305763, South Korea
基金
新加坡国家研究基金会;
关键词
Anomaly pattern detection; Control charts; Hypothesis testing; Outlier detection; Streaming data; OUTLIER;
D O I
10.1016/j.eswa.2020.113252
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Outlier detection aims to find a data sample that is different from most other data samples. While outlier detection is performed at an individual instance level, anomaly pattern detection on a data stream means detecting a time point where a pattern to generate data is unusual and significantly different from normal behavior. Beyond predicting the outlierness of individual data samples in a data stream, it can be very useful to detect the occurrence of anomalous patterns in real time. In this paper, we propose a method for anomaly pattern detection in a data stream based on binary classification for outliers and statistical tests on a data stream of binary labels of normal or an outlier. In the first step, by applying the clustering-based outlier detection method, we transform a data stream into a stream of binary values where 0 stands for the prediction as normal data and 1 for outlier prediction. In the second step, anomaly pattern detection is performed on a stream of binary values by two approaches: testing the equality of parameters in the binomial distributions of a reference window and a detection window, and using control charts for the fraction defective. The proposed method obtained the average true positive detection rate of 94% in simulated experiments using real and artificial data. The experimental results also show that anomaly pattern occurrence can be detected reliably even when outlier detection performance is relatively low. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Autonomous anomaly detection for streaming data
    Basheer, Muhammad Yunus Iqbal
    Ali, Azliza Mohd
    Hamid, Nurzeatul Hamimah Abdul
    Ariffin, Muhammad Azizi Mohd
    Osman, Rozianawaty
    Nordin, Sharifalillah
    Gu, Xiaowei
    KNOWLEDGE-BASED SYSTEMS, 2024, 284
  • [2] Weakly Supervised Anomaly Detection for Streaming Data
    Zhang, Wei
    Challis, Chris
    23RD IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2021), 2021, : 31 - 34
  • [3] Evolving anomaly detection for network streaming data
    Wang Xiaolan
    Ahmed, Md Manjur
    Husen, Mohd Nizam
    Qian, Zhao
    Belhaouari, Samir Brahim
    INFORMATION SCIENCES, 2022, 608 : 757 - 777
  • [4] Anomaly Detection in Streaming Nonstationary Temporal Data
    Talagala, Priyanga Dilini
    Hyndman, Rob J.
    Smith-Miles, Kate
    Kandanaarachchi, Sevvandi
    Munoz, Mario A.
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2020, 29 (01) : 13 - 27
  • [5] ANOMALY PATTERN DETECTION IN STREAMING DATA BASED ON THE TRANSFORMATION TO MULTIPLE BINARY-VALUED DATA STREAMS
    Kim, Taegong
    Park, Cheong Hee
    JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2022, 12 (01) : 19 - 27
  • [6] Anomaly Pattern Detection on Data Streams
    Park, Cheong Hee
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 689 - 692
  • [7] Anomaly detection in streaming data: A comparison and evaluation study
    Vazquez, Felix Iglesias
    Hartl, Alexander
    Zseby, Tanja
    Zimek, Arthur
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 233
  • [8] Integrated Clustering and Anomaly Detection (INCAD) for Streaming Data
    Guggilam, Sreelekha
    Zaidi, Syed Mohammed Arshad
    Chandola, Varun
    Patra, Abani K.
    COMPUTATIONAL SCIENCE - ICCS 2019, PT IV, 2019, 11539 : 45 - 59
  • [9] Anomaly Detection in Resource Constrained Environments With Streaming Data
    Jain, Prarthi
    Jain, Seemandhar
    Zaiane, Osmar R.
    Srivastava, Abhishek
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (03): : 649 - 659
  • [10] Correlated Anomaly Detection from Large Streaming Data
    Chen, Zheng
    Yu, Xinli
    Ling, Yuan
    Song, Bo
    Quan, Wei
    Hu, Xiaohua
    Yan, Erjia
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 982 - 992