A novel outlier detecting algorithm based on the outlier turning points

被引:13
|
作者
Huang, Jinlong [1 ]
Cheng, Dongdong [1 ]
Zhang, Sulan [1 ]
机构
[1] Yangtze Normal Univ, Coll Big Data & Intelligent Engn, Chongqing 408100, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Outlier detection; Local outliers; Outlier clusters; Outlier turning points; NATURAL NEIGHBORHOOD GRAPH; CLUSTER;
D O I
10.1016/j.eswa.2023.120799
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Outlier detection is one of the hot research in data mining, and has been applied to various fields such as network anomaly detection, image abnormal analysis, etc. In recent years, many outlier detecting algorithms have been proposed. However, these outlier detecting algorithms are hard to effectively detect global outliers, local outliers and outlier clusters at the same time. In this paper, we propose a novel outlier detecting algorithm based on the following ideas: (1) the density distribution should not be changed dramatically on local area; (2) the ratio of the number of k nearest neighbors and the number of reverse k nearest neighbors should not be very big. Based on above ideas, the proposed algorithm aims to find outlier turning points, then regards all outlier turning points and its sparse neighbors as outliers. Furthermore, the proposed algorithm use natural neighbors to obtain the neighborhood parameter k adaptively. The formal analysis and extensive experiments demonstrate that this technique can detect global outliers, local outliers and outlier clusters without neighborhood parameter k.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] A novel subspace outlier detection method by entropy-based clustering algorithm
    Zheng Zuo
    Ziqiang Li
    Pengsen Cheng
    Jian Zhao
    Scientific Reports, 13
  • [42] A novel subspace outlier detection method by entropy-based clustering algorithm
    Zuo, Zheng
    Li, Ziqiang
    Cheng, Pengsen
    Zhao, Jian
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [43] A novel photovoltaic array outlier cleaning algorithm based on moving standard deviation
    Shi M.
    Yin R.
    Hu A.
    Wu J.
    Dianli Xitong Baohu yu Kongzhi/Power System Protection and Control, 2020, 48 (06): : 108 - 114
  • [44] Detecting Outlier Samples in Microarray Data
    Shieh, Albert D.
    Hung, Yeung Sam
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2009, 8 (01)
  • [45] FCI-Outlier: An Efficient Frequent Closed Itemset-Based Outlier Detecting Approach on Data Stream
    Hao, Shangbo
    Cai, Saihua
    Sun, Ruizhi
    Li, Sicong
    COMPUTER SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING, CHINESECSCW 2018, 2019, 917 : 176 - 187
  • [46] Global High Dimension Outlier Algorithm for Efficient Clustering & Outlier Detection
    Nigam, Nidhi
    Saxena, Tripti
    Richhariya, Vineet
    2016 SYMPOSIUM ON COLOSSAL DATA ANALYSIS AND NETWORKING (CDAN), 2016,
  • [47] Analytical method for detecting outlier evaluators
    Wu, Yujie
    Curhan, Sharon
    Rosner, Bernard
    Curhan, Gary
    Wang, Molin
    BMC MEDICAL RESEARCH METHODOLOGY, 2023, 23 (01)
  • [48] Analytical method for detecting outlier evaluators
    Yujie Wu
    Sharon Curhan
    Bernard Rosner
    Gary Curhan
    Molin Wang
    BMC Medical Research Methodology, 23
  • [49] Hypothesis testing for detecting outlier evaluators
    Xu, Li
    Zucker, David M.
    Wang, Molin
    INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2024, 20 (02): : 419 - 431
  • [50] Outlier detection based on the distribution of distances between data points
    Saltenis, V
    INFORMATICA, 2004, 15 (03) : 399 - 410