A C-SVM based Anomaly Detection Method for Multi-dimensional Sequence over Data Stream

被引:0
|
作者
Bao, Han [1 ]
Wang, Yijie [1 ]
机构
[1] Natl Univ Def Technol, Sci & Technol Parallel & Distributed Proc Lab, Coll Comp, Changsha 410073, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Data stream; Concept drift; Anomaly detection; Multi-dimensional sequence; Feature selection; C-SVM; QUERIES;
D O I
10.1109/ICPADS.2016.125
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Anomaly detection over multi-dimensional data stream has attracted considerable attention recently in various fields, such as network, finance and aerospace. In many cases, anomalies are composed of a sequence of multi-dimensional data, and it's necessary to detect this type of anomalies accurately and efficiently over data stream. Existing online methods of anomaly detection merely focus on the single-dimensional sequence. What's more, current studies about multi-dimensional sequence are mainly concentrated on static database. However, the anomaly detection for multi-dimensional sequence over data stream is much more difficult, due to the complexity of multidimensional sequence processing, the dynamic nature of data stream and the unbalance between normal and abnormal data. Facing these challenges, we propose an anomaly detection method for multi-dimensional sequence over data stream based on cost sensitive support vector machine (C-SVM) called ADMS. First, to improve the accuracy and efficiency, the ADMS transforms multi-dimensional sequences into feature vectors in a lossless way and prunes worthless features of these vectors. And then, the ADMS can detect abnormal sequences over dynamically imbalanced data stream by lively testing these vectors based on C-SVM. Experiments show that the false negative rate (FNR) of the ADMS is lower than 5%, the false positive rate (FPR) is lower than 7%, and the throughput is improved 42% by pruning worthless features. In addition, the AMDS performs well when there are concept drifts over the data stream.
引用
收藏
页码:948 / 955
页数:8
相关论文
共 50 条
  • [22] Statistical Change Detection for Multi-Dimensional Data
    Song, Xiuyao
    Wu, Mingxi
    Jermaine, Christopher
    Ranka, Sanjay
    KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2007, : 667 - 676
  • [23] A KNNS Based Anomaly Detection Method Applied for UAV Flight Data Stream
    Liu, Yu
    Ding, Wenrui
    2015 PROGNOSTICS AND SYSTEM HEALTH MANAGEMENT CONFERENCE (PHM), 2015,
  • [24] A multi-phase approach for classifying multi-dimensional sequence data
    Lee, Chang-Hwan
    INTELLIGENT DATA ANALYSIS, 2015, 19 (03) : 547 - 561
  • [25] Rapid Detection of Chinese Liquors Using a Portable E-nose Based on C-SVM
    Qi, Pei-Feng
    Meng, Qing-Hao
    Jing, Ya-Qi
    Zeng, Ming
    Ma, Shu-Gen
    PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, : 1388 - 1392
  • [26] Multi-dimensional range query over encrypted data
    Shi, Elaine
    Bethencourt, John
    Chan, T-H. Hubert
    Song, Dawn
    Perrig, Adrian
    2007 IEEE SYMPOSIUM ON SECURITY AND PRIVACY, PROCEEDINGS, 2007, : 350 - +
  • [27] A Method Based on Tensor Decomposition for Missing Multi-dimensional Data Completion
    Chen, Jianke
    Chen, Pinghua
    2017 IEEE 2ND INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA), 2017, : 149 - 153
  • [28] Concept Drift Based Multi-dimensional Data Streams Sampling Method
    Lin, Ling
    Qi, Xiaolong
    Zhu, Zhirui
    Gao, Yang
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2019, PT I, 2019, 11439 : 331 - 342
  • [29] Anomaly detection model based on data stream clustering
    Chunyong Yin
    Sun Zhang
    Zhichao Yin
    Jin Wang
    Cluster Computing, 2019, 22 : 1729 - 1738
  • [30] Anomaly detection model based on data stream clustering
    Yin, Chunyong
    Zhang, Sun
    Yin, Zhichao
    Wang, Jin
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 1): : 1729 - 1738