Outlier detection method based on improved DPC algorithm and centrifugal factor

被引:0
|
作者
Xia, Hao [1 ]
Zhou, Yu [1 ]
Li, Jiguang [2 ]
Yue, Xuezhen [1 ]
Li, Jichun [3 ]
机构
[1] North China Univ Water Resources & Elect Power, Sch Elect Engn, Zhengzhou 450045, Peoples R China
[2] Univ Salford, Sch Sci Engn & Environm, Salford M5 4NT, England
[3] Newcastle Univ, Sch Comp, Newcastle Upon Tyne NE4 5TG, England
基金
中国国家自然科学基金;
关键词
Outlier detection; Clustering algorithm; Centrifugal factor; k -nearest neighbor; Local density; Local kernel density;
D O I
10.1016/j.ins.2024.121255
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Outlier detection aims to identify data anomalies exhibiting significant deviations from normal patterns. However, existing outlier detection methods based on k-nearest neighbors often struggle with challenges such as increasing outlier counts and cluster formation issues. Additionally, selecting appropriate nearest-neighbor parameters presents a significant challenge, as researchers commonly evaluate detection accuracy across various k values. To enhance the accuracy and robustness of outlier detection, in this paper we propose an outlier detection method based on the improved DPC algorithm and centrifugal factor. Initially, we leverage k-nearest neighbors, kreciprocal nearest neighbors, and Gaussian kernel function to determine the local density of samples, particularly addressing scenarios where the DPC algorithm struggles to identify cluster centers in sparse clusters. Subsequently, to reduce the DPC algorithm's computational complexity, we screen the samples based on mutual nearest neighbor counts and select cluster centers accordingly. Non-central points are then distributed using k-nearest neighbors, k-reciprocal nearest neighbors, and reverse k-nearest neighbors. The centrifugal factor, whose magnitude reflects the outlier degree of samples, is then computed by calculating the ratio of the local kernel density at the cluster center to that of samples. Finally, we propose a method for choosing the nearest neighbor parameter, k. To comprehensively evaluate the outlier detection performance of the proposed algorithm, we conduct experiments on 12 complex synthetic datasets and 25 public real-world datasets, comparing the results with 12 state-of-the-art outlier detection methods.
引用
收藏
页数:33
相关论文
共 50 条
  • [31] An improved Outlier Detection Method in high-dimension Based on Weighted Hypergraph
    Li, YinZhao
    Wu, Di
    Ren, JiaDong
    Hu, ChangZhen
    PROCEEDINGS OF THE SECOND INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY, VOL II, 2009, : 159 - +
  • [32] Improved Clustering And Outlier Detection Based Shortwave Direction Finding And Crossing Location Algorithm
    Jiang, Shuhui
    Shen, Xi
    Wang, Yan
    Zhang, Xiaofei
    Liu, Dayong
    2014 2ND INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2014, : 581 - 585
  • [33] A Novel Battery State of Health Estimation Method Based on Outlier Detection Algorithm
    Piao, Chang-hao
    Hu, Zi-hao
    Su, Ling
    Zhao, Jian-fei
    JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2016, 11 (06) : 1802 - 1811
  • [34] A Parameter-Free Outlier Detection Algorithm Based on Dataset Optimization Method
    Wang, Liying
    Shi, Lei
    Xu, Liancheng
    Liu, Peiyu
    Zhang, Lindong
    Dong, Yanru
    INFORMATION, 2020, 11 (01)
  • [35] A novel subspace outlier detection method by entropy-based clustering algorithm
    Zheng Zuo
    Ziqiang Li
    Pengsen Cheng
    Jian Zhao
    Scientific Reports, 13
  • [36] A novel subspace outlier detection method by entropy-based clustering algorithm
    Zuo, Zheng
    Li, Ziqiang
    Cheng, Pengsen
    Zhao, Jian
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [37] Pruning Based Method for Outlier Detection
    Pamula, Rajendra
    Deka, Jatindra Kumar
    Nandi, Sukumar
    2012 THIRD INTERNATIONAL CONFERENCE ON EMERGING APPLICATIONS OF INFORMATION TECHNOLOGY (EAIT), 2012, : 210 - 213
  • [38] An Effective Algorithm of Outlier Detection Based on Clustering
    Xia, Qingsong
    Xing, Changzheng
    Li, Na
    INTERNET OF THINGS-BK, 2012, 312 : 346 - 351
  • [39] Depth-Based Outlier Detection Algorithm
    Cardenas-Montes, Miguel
    HYBRID ARTIFICIAL INTELLIGENCE SYSTEMS, HAIS 2014, 2014, 8480 : 122 - 132
  • [40] An Outlier Detection Algorithm Based on Spectral Clustering
    Yang, Peng
    Huang, Biao
    PACIIA: 2008 PACIFIC-ASIA WORKSHOP ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION, VOLS 1-3, PROCEEDINGS, 2008, : 485 - 488