FAAD:an unsupervised fast and accurate anomaly detection method for a multi-dimensional sequence over data stream

被引:0
|
作者
Bin LI [1 ]
Yi-jie WANG [1 ]
Dong-sheng YANG [2 ]
Yong-mou LI [1 ]
Xing-kong MA [1 ]
机构
[1] Science and Technology on Parallel and Distributed Processing Laboratory, College of Computer,National University of Defense Technology
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Data stream; Multi-dimensional sequence; Anomaly detection; Concept drift; Feature selection;
D O I
暂无
中图分类号
TP311.13 [];
学科分类号
1201 ;
摘要
Recently, sequence anomaly detection has been widely used in many fields. Sequence data in these fields are usually multi-dimensional over the data stream. It is a challenge to design an anomaly detection method for a multi-dimensional sequence over the data stream to satisfy the requirements of accuracy and high speed. It is because:(1) Redundant dimensions in sequence data and large state space lead to a poor ability for sequence modeling;(2) Anomaly detection cannot adapt to the high-speed nature of the data stream, especially when concept drift occurs, and it will reduce the detection rate. On one hand, most existing methods of sequence anomaly detection focus on the single-dimension sequence. On the other hand, some studies concerning multi-dimensional sequence concentrate mainly on the static database rather than the data stream. To improve the performance of anomaly detection for a multi-dimensional sequence over the data stream, we propose a novel unsupervised fast and accurate anomaly detection(FAAD) method which includes three algorithms. First, a method called "information calculation and minimum spanning tree cluster" is adopted to reduce redundant dimensions. Second, to speed up model construction and ensure the detection rate for the sequence over the data stream, we propose a method called"random sampling and subsequence partitioning based on the index probabilistic suffix tree." Last, the method called "anomaly buffer based on model dynamic adjustment" dramatically reduces the effects of concept drift in the data stream. FAAD is implemented on the streaming platform Storm to detect multi-dimensional log audit data.Compared with the existing anomaly detection methods, FAAD has a good performance in detection rate and speed without being affected by concept drift.
引用
收藏
页码:388 / 404
页数:17
相关论文
共 50 条
  • [1] FAAD: an unsupervised fast and accurate anomaly detection method for a multi-dimensional sequence over data stream
    Bin Li
    Yi-jie Wang
    Dong-sheng Yang
    Yong-mou Li
    Xing-kong Ma
    Frontiers of Information Technology & Electronic Engineering, 2019, 20 : 388 - 404
  • [2] FAAD: an unsupervised fast and accurate anomaly detection method for a multi-dimensional sequence over data stream
    Li, Bin
    Wang, Yi-jie
    Yang, Dong-sheng
    Li, Yong-mou
    Ma, Xing-kong
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2019, 20 (03) : 388 - 404
  • [3] A C-SVM based Anomaly Detection Method for Multi-dimensional Sequence over Data Stream
    Bao, Han
    Wang, Yijie
    2016 IEEE 22ND INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2016, : 948 - 955
  • [4] A Variable Markovian based Outlier Detection Method for Multi-dimensional Sequence over Data Stream
    Yang, Dongsheng
    Wang, Yijie
    Li, Yongmou
    Ma, Xingkong
    2016 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT), 2016, : 183 - 188
  • [5] Fast Anomaly Detection in Multiple Multi-Dimensional Data Streams
    Sun, Hongyu
    He, Qiang
    Liao, Kewen
    Sellis, Timos
    Guo, Longkun
    Zhang, Xuyun
    Shen, Jun
    Chen, Feifei
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 1218 - 1223
  • [6] Fusion Learning Based Unsupervised Anomaly Detection for Multi-Dimensional Time Series
    Zhou X.
    Wang Y.
    Xu H.
    Liu M.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2023, 60 (03): : 496 - 508
  • [7] A multi-dimensional wavelet-based anomaly detection method
    Wu, Shuyan
    Li, Xiaoge
    Zhang, Bin
    Qin, Donghong
    ICIC Express Letters, 2015, 9 (12): : 3393 - 3399
  • [8] A New Anomaly Detection Method Based on Multi-dimensional Condition Monitoring Data for Aircraft Engine
    Chen, Shaowei
    Wu, Meng
    Zhao, Shuai
    Wen, Pengfei
    Huang, Dengshan
    Wang, Yan
    2019 IEEE INTERNATIONAL CONFERENCE ON PROGNOSTICS AND HEALTH MANAGEMENT (ICPHM), 2019,
  • [9] Design of anomaly detection research model for multi-dimensional temporal data
    Wang, Shuai
    Jiang, Yan
    2024 4TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND INTELLIGENT SYSTEMS ENGINEERING, MLISE 2024, 2024, : 263 - 266
  • [10] Fast and private multi-dimensional range search over encrypted data
    Kermanshahi, Shabnam Kasra
    Steinfeld, Ron
    Yi, Xun
    Liu, Joseph K.
    Nepal, Surya
    Lou, Junwei
    INFORMATION SCIENCES, 2024, 652