Improved outlier detection and interpretation method for DPC clustering algorithm

被引:0
|
作者
Zhou, Yu [1 ]
Xia, Hao [1 ]
Pei, Zexuan [1 ]
机构
[1] School of Electrical Engineering, North China University of Water Resources and Electric Power, Zhengzhou,450045, China
关键词
Anomaly detection - Clustering algorithms - Nearest neighbor search;
D O I
10.11918/202305067
中图分类号
学科分类号
摘要
To address the limitatios of global outlier detection methods in detecting local outliers and the performance degradation of local anomaly factors in the presence of a large number of local outliers, this paper proposes an outlier detection and interpretation method based on an improved fast search and discovery density peak clustering algorithm (KDPC), utilizing k-nearest neighbor (KNN) and kernel density estimation (KDE) methods. This method enables simultaneous analysis of both global and local data points. Firstly, the local density of data points is calculated using the k-nearest neighbor and kernel density estimation methods instead of the local density based on the truncation distance in the traditional DPC algorithm. Secondly, the sum of the k-nearest neighbor distances of the data points is used as the global outlier and the cluster density as well as the local outliers of the data points are calculated by the KDPC clustering algorithm. Finally, the global and local outliers of the data points are multiplied as the final anomaly score. The Top-n data points with the highest anomaly score is selected as the outlier, and the global and local outliers are interpreted by constructing a global-local outlier decision diagram. Experiments were conducted using both artificial and UCI datasets and our method was compared with 10 commonly used outlier detection methods. The results show that our method achieves high detection accuracy and performance for both global and local outliers. Moreover, the AUC performance is minimally affected by the k-value. Additionally, our method is also used to analyze NBA player data, further demonstrating its practicality and effectiveness. © 2024 Harbin Institute of Technology. All rights reserved.
引用
收藏
页码:68 / 85
相关论文
共 50 条
  • [41] INGC: Graph Clustering & Outlier Detection Algorithm Using Label Propagation
    Bhatia, Vandana
    Saneja, Bharti
    Rani, Rinkle
    2017 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND DATA SCIENCE (MLDS 2017), 2017, : 68 - 74
  • [42] Multibeam outlier detection by clustering and topological persistence approach, ToMATo algorithm
    Michel, Marceau
    Le Deunf, Julian
    Debese, Nathalie
    Bazinet, Laurene
    Dejoie, Loic
    OCEANS 2021: SAN DIEGO - PORTO, 2021,
  • [43] Obstacle Clustering and Outlier Detection
    Shi, Yong
    PROCEEDINGS OF THE 48TH ANNUAL SOUTHEAST REGIONAL CONFERENCE (ACM SE 10), 2010, : 423 - 424
  • [44] Statistical hierarchical clustering algorithm for outlier detection in evolving data streams
    Krleza, Dalibor
    Vrdoljak, Boris
    Brcic, Mario
    MACHINE LEARNING, 2021, 110 (01) : 139 - 184
  • [45] A parallel point cloud clustering algorithm for subset segmentation and outlier detection
    Teutsch, Christian
    Trostmann, Erik
    Berndt, Dirk
    VIDEOMETRICS, RANGE IMAGING, AND APPLICATIONS XI, 2011, 8085
  • [46] An Improved KNN Based Outlier Detection Algorithm for Large Datasets
    Wang, Qian
    Zheng, Min
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2010, PT I, 2010, 6440 : 585 - 592
  • [47] On Integrated Clustering and Outlier Detection
    Ott, Lionel
    Pang, Linsey
    Ramos, Fabio
    Chawla, Sanjay
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [48] A Novel Clustering Algorithm Based on DPC and PSO
    Cai, Jianghui
    Wei, Huiling
    Yang, Haifeng
    Zhao, Xujun
    IEEE ACCESS, 2020, 8 : 88200 - 88214
  • [49] Application of improved Clustering Algorithm in Intrusion Detection
    Dai Kunyu
    Hu Bin
    2ND INTERNATIONAL SYMPOSIUM ON COMPUTER NETWORK AND MULTIMEDIA TECHNOLOGY (CNMT 2010), VOLS 1 AND 2, 2010, : 621 - 624
  • [50] Research on DPC Clustering Multi-Objective Detection Method for Disorderly Grasping
    Chen, Zeyu
    Li, Xiangguo
    Cao, Dengfeng
    Zhu, Denglin
    Computer Engineering and Applications, 2023, 59 (23) : 175 - 182