Efficient density and cluster based incremental outlier detection in data streams

被引:45
|
作者
Degirmenci, Ali [1 ]
Karal, Omer [1 ]
机构
[1] Ankara Yildirim Beyazit Univ, Ayvali Mah 150,Sok Etlik Kecioren, Ankara, Turkey
关键词
LOF; DBSCAN; Outlier detection; Core KNN; Incremental learning; Data stream;
D O I
10.1016/j.ins.2022.06.013
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a novel, parameter-free, incremental local density and cluster-based outlier factor (iLDCBOF) method is presented that unifies incremental versions of local outlier factor (LOF) and density-based spatial clustering of applications with noise (DBSCAN) to detect outliers efficiently in data streams. The iLDCBOF has many advanced advantages compared to previously reported iLOF-based studies: (1) it is based on a newly developed core k-nearest neighbor (CkNN) concept to reliably and scalably detect outliers from data streams and prevent the clustering of outliers; 2) it uses a newly-developed algorithm that automatically adjusts the value of the k (number of neighbors) parameter for different real-time applications; and 3) it uses the Mahalanobis distance metric, so its performance is not affected even for large amounts of data. The iLDCBOF method is well suited for different data stream applications because it requires no distribution assumptions, it is parameterless (determined automatically), and it is easy to implement. ROC-AUC and statistical test analysis results from extensive experiments performed on 16 different real world datasets showed that the iLDCBOF method significantly outperformed benchmark methods.(c) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页码:901 / 920
页数:20
相关论文
共 50 条
  • [1] Incremental local outlier detection for data streams
    Pokrajac, Dragojub
    Lazarevic, Aleksandar
    Latecki, Longin Jan
    2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, VOLS 1 AND 2, 2007, : 504 - 515
  • [2] TADILOF: Time Aware Density-Based Incremental Local Outlier Detection in Data Streams
    Huang, Jen-Wei
    Zhong, Meng-Xun
    Jaysawal, Bijay Prasad
    SENSORS, 2020, 20 (20) : 1 - 25
  • [3] Robust Incremental Outlier Detection Approach Based on a New Metric in Data Streams
    Degirmenci, Ali
    Karal, Omer
    IEEE ACCESS, 2021, 9 : 160347 - 160360
  • [4] INCREMENTAL PRINCIPAL COMPONENT ANALYSIS BASED OUTLIER DETECTION METHODS FOR SPATIOTEMPORAL DATA STREAMS
    Bhushan, Alka
    Sharker, Monir H.
    Karimi, Hassan A.
    ISPRS INTERNATIONAL WORKSHOP ON SPATIOTEMPORAL COMPUTING, 2015, : 67 - 71
  • [5] Improved incremental local outlier detection for data streams based on the landmark window model
    Aihua Li
    Weijia Xu
    Zhidong Liu
    Yong Shi
    Knowledge and Information Systems, 2021, 63 : 2129 - 2155
  • [6] Improved incremental local outlier detection for data streams based on the landmark window model
    Li, Aihua
    Xu, Weijia
    Liu, Zhidong
    Shi, Yong
    KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 63 (08) : 2129 - 2155
  • [7] A Fast and Efficient Local Outlier Detection in Data Streams
    Yang, Xing
    Zhou, Wenli
    Shu, Nanfei
    Zhang, Hao
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO AND SIGNAL PROCESSING (IVSP 2019), 2019, : 111 - 116
  • [8] Outlier detection based on cluster outlier factor and mutual density
    Zhang Z.
    Zhu M.
    Qiu J.
    Liu C.
    Zhang D.
    Qi J.
    International Journal of Intelligent Information and Database Systems, 2019, 12 (1-2) : 91 - 108
  • [9] Outlier detection based on cluster outlier factor and mutual density
    Zhang Z.
    Qiu J.
    Liu C.
    Zhu M.
    Zhang D.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2019, 25 (09): : 2314 - 2323
  • [10] Fast Memory Efficient Local Outlier Detection in Data Streams
    Salehi, Mahsa
    Leckie, Christopher
    Bezdek, James C.
    Vaithianathan, Tharshan
    Zhang, Xuyun
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (12) : 3246 - 3260