Online Outlier Detection for Data Streams

被引:0
|
作者
Sadik, Shiblee [1 ]
Gruenwald, Le [1 ]
机构
[1] Univ Oklahoma, Norman, OK 73019 USA
关键词
Knowledge Discovery; Data Mining; Stream Databases;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Outlier detection is a well established area of statistics but most of the existing outlier detection techniques are designed for applications where the entire dataset is available for random access. A typical outlier detection technique constructs a standard data distribution or model and identifies the deviated data points from the model as outliers. Evidently these techniques are not suitable for online data streams where the entire dataset, due to its unbounded volume, is not available for random access. Moreover, the data distribution in data streams change over time which challenges the existing outlier detection techniques that assume a constant standard data distribution for the entire dataset. In addition, data streams are characterized by uncertainty which imposes further complexity. In this paper we propose an adaptive, online outlier detection technique addressing the aforementioned characteristics of data streams, called Adaptive Outlier Detection for Data Streams (A-ODDS), which identifies outliers with respect to all the received data points as well as temporally close data points. The temporally close data points are selected based on time and change of data distribution. We also present an efficient and online implementation of the technique and a performance study showing the superiority of A-ODDS over existing techniques in terms of accuracy and execution time on a real-life dataset collected from meteorological applications.
引用
收藏
页码:88 / 96
页数:9
相关论文
共 50 条
  • [1] Outlier Detection on Uncertain Data Streams
    Zhu B.
    Zhong Y.
    Wang X.
    Bai M.
    Hunan Daxue Xuebao/Journal of Hunan University Natural Sciences, 2020, 47 (02): : 134 - 140
  • [2] Online Outlier Detection of Energy Data Streams using Incremental and Kernel PCA Algorithms
    Deng, Jeremiah D.
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2016, : 390 - 397
  • [3] Adaptive Threshold for Outlier Detection on Data Streams
    Clark, James P.
    Liu, Zhen
    Japkowicz, Nathalie
    2018 IEEE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2018, : 41 - 49
  • [4] A Survey of Outlier Detection Algorithms for Data Streams
    Tamboli, Jinita
    Shukla, Madhu
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 3535 - 3540
  • [5] Outlier and anomaly pattern detection on data streams
    Cheong Hee Park
    The Journal of Supercomputing, 2019, 75 : 6118 - 6128
  • [6] Outlier and anomaly pattern detection on data streams
    Park, Cheong Hee
    JOURNAL OF SUPERCOMPUTING, 2019, 75 (09): : 6118 - 6128
  • [7] Attribute Outlier Detection over Data Streams
    Cao, Hui
    Zhou, Yongluan
    Shou, Lidan
    Chen, Gang
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT II, PROCEEDINGS, 2010, 5982 : 216 - +
  • [8] Trajectory Outlier Detection on Trajectory Data Streams
    Cao, Keyan
    Liu, Yefan
    Meng, Gongjie
    Liu, Haoli
    Miao, Anchen
    Xu, Jingke
    IEEE Access, 2020, 8 : 34187 - 34196
  • [9] Trajectory Outlier Detection on Trajectory Data Streams
    Cao, Keyan
    Liu, Yefan
    Meng, Gongjie
    Liu, Haoli
    Miao, Anchen
    Xu, Jingke
    IEEE ACCESS, 2020, 8 : 34187 - 34196
  • [10] Outlier detection over data streams: Survey
    Brahmi Z.
    Souiden I.
    International Journal of Business Intelligence and Data Mining, 2021, 19 (04) : 481 - 507