Outlier detection method based on improved DPC algorithm and centrifugal factor

被引:0
|
作者
Xia, Hao [1 ]
Zhou, Yu [1 ]
Li, Jiguang [2 ]
Yue, Xuezhen [1 ]
Li, Jichun [3 ]
机构
[1] North China Univ Water Resources & Elect Power, Sch Elect Engn, Zhengzhou 450045, Peoples R China
[2] Univ Salford, Sch Sci Engn & Environm, Salford M5 4NT, England
[3] Newcastle Univ, Sch Comp, Newcastle Upon Tyne NE4 5TG, England
基金
中国国家自然科学基金;
关键词
Outlier detection; Clustering algorithm; Centrifugal factor; k -nearest neighbor; Local density; Local kernel density;
D O I
10.1016/j.ins.2024.121255
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Outlier detection aims to identify data anomalies exhibiting significant deviations from normal patterns. However, existing outlier detection methods based on k-nearest neighbors often struggle with challenges such as increasing outlier counts and cluster formation issues. Additionally, selecting appropriate nearest-neighbor parameters presents a significant challenge, as researchers commonly evaluate detection accuracy across various k values. To enhance the accuracy and robustness of outlier detection, in this paper we propose an outlier detection method based on the improved DPC algorithm and centrifugal factor. Initially, we leverage k-nearest neighbors, kreciprocal nearest neighbors, and Gaussian kernel function to determine the local density of samples, particularly addressing scenarios where the DPC algorithm struggles to identify cluster centers in sparse clusters. Subsequently, to reduce the DPC algorithm's computational complexity, we screen the samples based on mutual nearest neighbor counts and select cluster centers accordingly. Non-central points are then distributed using k-nearest neighbors, k-reciprocal nearest neighbors, and reverse k-nearest neighbors. The centrifugal factor, whose magnitude reflects the outlier degree of samples, is then computed by calculating the ratio of the local kernel density at the cluster center to that of samples. Finally, we propose a method for choosing the nearest neighbor parameter, k. To comprehensively evaluate the outlier detection performance of the proposed algorithm, we conduct experiments on 12 complex synthetic datasets and 25 public real-world datasets, comparing the results with 12 state-of-the-art outlier detection methods.
引用
收藏
页数:33
相关论文
共 50 条
  • [41] A Novel Cluster Based Algorithm for Outlier Detection
    Mahajan, Manish
    Kumar, Santosh
    Pant, Bhasker
    COMPUTING, COMMUNICATION AND SIGNAL PROCESSING, ICCASP 2018, 2019, 810 : 449 - 456
  • [42] An Outlier Detection Algorithm Based on Differential Privacy
    Shou, Zhaoyu
    Yan, Ye
    Zou, Fengbo
    FUZZY SYSTEMS AND DATA MINING V (FSDM 2019), 2019, 320 : 984 - 990
  • [43] Continuous Bad Data Detection Method for PMU Based on Local Outlier Factor
    Liu, Hao
    Zhu, Shijia
    Bi, Tianshu
    Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2022, 46 (01): : 25 - 32
  • [44] Outlier Detection Algorithm Based on Iterative Clustering
    古平
    罗辛
    杨瑞龙
    张程
    Journal of Donghua University(English Edition), 2015, 32 (04) : 554 - 558
  • [45] A new Algorithm for Outlier Detection based on Offset
    Zhang, Yue
    Liu, Jie
    Song, Bo
    FIFTH INTERNATIONAL CONFERENCE ON INFORMATION ASSURANCE AND SECURITY, VOL 2, PROCEEDINGS, 2009, : 3 - 6
  • [46] Multimode process fault detection method based on variable local outlier factor
    Wang, Lei
    Deng, Xiaogang
    2017 9TH INTERNATIONAL CONFERENCE ON MODELLING, IDENTIFICATION AND CONTROL (ICMIC 2017), 2017, : 175 - 180
  • [47] Fault Detection and Identification Based on the Neighborhood Standardized Local Outlier Factor Method
    Ma, Hehe
    Hu, Yi
    Shi, Hongbo
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2013, 52 (06) : 2389 - 2402
  • [48] Intrusion detection method based on an improved Bayesian algorithm
    Wen, Qiao
    Wang, Weiping
    Jisuanji Gongcheng/Computer Engineering, 2006, 32 (12): : 160 - 162
  • [49] Pedestrian detection method based on improved AdaBoost algorithm
    Xiang, Yi
    Chai, Yi
    Wu, Ying
    Journal of Computational Information Systems, 2010, 6 (07): : 2213 - 2221
  • [50] Improved Face Detection Method Based on Optimization Algorithm
    Mohammed, Eman Jasim
    Ahmed, Ismail Taha
    2024 IEEE 15TH CONTROL AND SYSTEM GRADUATE RESEARCH COLLOQUIUM, ICSGRC 2024, 2024, : 76 - 81