FAAD:an unsupervised fast and accurate anomaly detection method for a multi-dimensional sequence over data stream

被引:0
|
作者
Bin LI [1 ]
Yi-jie WANG [1 ]
Dong-sheng YANG [2 ]
Yong-mou LI [1 ]
Xing-kong MA [1 ]
机构
[1] Science and Technology on Parallel and Distributed Processing Laboratory, College of Computer,National University of Defense Technology
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Data stream; Multi-dimensional sequence; Anomaly detection; Concept drift; Feature selection;
D O I
暂无
中图分类号
TP311.13 [];
学科分类号
1201 ;
摘要
Recently, sequence anomaly detection has been widely used in many fields. Sequence data in these fields are usually multi-dimensional over the data stream. It is a challenge to design an anomaly detection method for a multi-dimensional sequence over the data stream to satisfy the requirements of accuracy and high speed. It is because:(1) Redundant dimensions in sequence data and large state space lead to a poor ability for sequence modeling;(2) Anomaly detection cannot adapt to the high-speed nature of the data stream, especially when concept drift occurs, and it will reduce the detection rate. On one hand, most existing methods of sequence anomaly detection focus on the single-dimension sequence. On the other hand, some studies concerning multi-dimensional sequence concentrate mainly on the static database rather than the data stream. To improve the performance of anomaly detection for a multi-dimensional sequence over the data stream, we propose a novel unsupervised fast and accurate anomaly detection(FAAD) method which includes three algorithms. First, a method called "information calculation and minimum spanning tree cluster" is adopted to reduce redundant dimensions. Second, to speed up model construction and ensure the detection rate for the sequence over the data stream, we propose a method called"random sampling and subsequence partitioning based on the index probabilistic suffix tree." Last, the method called "anomaly buffer based on model dynamic adjustment" dramatically reduces the effects of concept drift in the data stream. FAAD is implemented on the streaming platform Storm to detect multi-dimensional log audit data.Compared with the existing anomaly detection methods, FAAD has a good performance in detection rate and speed without being affected by concept drift.
引用
收藏
页码:388 / 404
页数:17
相关论文
共 50 条
  • [21] Unsupervised Anomaly Detection in Stream Data with Online Evolving Spiking Neural Networks
    Maciag, Piotr S.
    Kryszkiewicz, Marzena
    Bembenik, Robert
    Lobo, Jesus L.
    Del Ser, Javier
    NEURAL NETWORKS, 2021, 139 : 118 - 139
  • [22] Stream cube: An architecture for multi-dimensional analysis of data streams
    Han, JW
    Chen, YX
    Dong, GZ
    Pei, H
    Wah, BW
    Wang, JY
    Cai, YD
    DISTRIBUTED AND PARALLEL DATABASES, 2005, 18 (02) : 173 - 197
  • [23] Stream Cube: An Architecture for Multi-Dimensional Analysis of Data Streams
    Jiawei Han
    Yixin Chen
    Guozhu Dong
    Jian Pei
    Benjamin W. Wah
    Jianyong Wang
    Y. Dora Cai
    Distributed and Parallel Databases, 2005, 18 : 173 - 197
  • [24] Unsupervised Anomaly Detection in Spatio-Temporal Stream Network Sensor Data
    Santos-Fernandez, Edgar
    Ver Hoef, Jay M.
    Peterson, Erin E.
    Mcgree, James
    Villa, Cesar A.
    Leigh, Catherine
    Turner, Ryan
    Roberts, Cameron
    Mengersen, Kerrie
    WATER RESOURCES RESEARCH, 2024, 60 (11)
  • [25] Fast and Adaptive Indexing of Multi-Dimensional Observational Data
    Wang, Sheng
    Maier, David
    Ooi, Beng Chin
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 9 (14): : 1683 - 1694
  • [27] Statistical Change Detection for Multi-Dimensional Data
    Song, Xiuyao
    Wu, Mingxi
    Jermaine, Christopher
    Ranka, Sanjay
    KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2007, : 667 - 676
  • [28] Fast Anomaly Detection based on Data Stream in Network Intrusion Detection System
    Yang, Yihong
    Xu, Xiaolong
    Wang, Lina
    Zhong, Weiyi
    Yan, Chao
    Qi, Lianyong
    PROCEEDINGS OF ACM TURING AWARD CELEBRATION CONFERENCE, ACM TURC 2021, 2021, : 87 - 91
  • [30] A new data normalization method for unsupervised anomaly intrusion detection
    Longzheng CAIJian CHENYun KETao CHENZhigang LI Engineering and Commerce CollegeSouthCentral University for NationalitiesWuhan China Guangdong Institute of Science and TechnologyZhuhai China
    Journal of Zhejiang University-Science C(Computers & Electronics), 2010, 11 (10) : 778 - 784