RDF: A density-based Outlier detection method using vertical data representation

被引:24
|
作者
Ren, DM [1 ]
Wang, BY [1 ]
Perrizo, W [1 ]
机构
[1] N Dakota State Univ, Dept Comp Sci, Fargo, ND 58105 USA
关键词
D O I
10.1109/ICDM.2004.10010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Outlier detection can lead to discovering unexpected and interesting knowledge, which is critical important to some areas such as monitoring of criminal activities in electronic commerce, credit card fraud, etc. In this paper, we developed an efficient density-based outlier detection method for large datasets. Our contributions are: a) We introduce a relative density factor (RDF); b) Based on RDF, we propose an RDF-based outlier detection method which can efficiently prune the data points which are deep in clusters, and detect outliers only within the remaining small subset of the data; c) The performance of our method is further improved by means of a vertical data representation, P-trees. We tested our method with NHL and NBA data. Our method shows an order of magnitude speed improvement compared to the contemporary approaches.
引用
收藏
页码:503 / 506
页数:4
相关论文
共 50 条
  • [1] A local density-based outlier detection method for high dimension data
    Abdulghafoor, Shahad Adel
    Mohamed, Lekaa Ali
    INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2022, 13 (01): : 1683 - 1699
  • [2] A novel density-based outlier detection method using key attributes
    Qi, Zhuang
    Chen, Xiaming
    INTELLIGENT DATA ANALYSIS, 2022, 26 (06) : 1431 - 1449
  • [3] Density-Based Local Outlier Detection on Uncertain Data
    Cao, Keyan
    Shi, Lingxu
    Wang, Guoren
    Han, Donghong
    Bai, Mei
    WEB-AGE INFORMATION MANAGEMENT, WAIM 2014, 2014, 8485 : 67 - 71
  • [4] Gait Recognition Using Density-Based Outlier Detection and Location Fusion by Sparse Representation
    Tu, Bin-bin
    Xu, Hui
    Xie, Xie
    2019 INTERNATIONAL CONFERENCE ON ENERGY, POWER, ENVIRONMENT AND COMPUTER APPLICATION (ICEPECA 2019), 2019, 334 : 346 - 350
  • [5] Density-Based Evolutionary Outlier Detection
    Banerjee, Amit
    PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTATION COMPANION (GECCO'12), 2012, : 651 - 652
  • [6] A distributed density-based outlier detection algorithm on big data
    Mei, Lin
    Zhang, Fengli
    International Journal of Network Security, 2020, 22 (05): : 775 - 781
  • [7] A Fast Randomized Method for Local Density-Based Outlier Detection in High Dimensional Data
    Minh Quoc Nguyen
    Omiecinski, Edward
    Mark, Leo
    Irani, Danesh
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, 2010, 6263 : 215 - 226
  • [8] An Efficient Density-Based Local Outlier Detection Approach for Scattered Data
    Su, Shubin
    Xiao, Limin
    Ruan, Li
    Gu, Fei
    Li, Shupan
    Wang, Zhaokai
    Xu, Rongbin
    IEEE ACCESS, 2019, 7 : 1006 - 1020
  • [9] An efficient algorithm for distributed density-based outlier detection on big data
    Bai, Mei
    Wang, Xite
    Xin, Junchang
    Wang, Guoren
    NEUROCOMPUTING, 2016, 181 : 19 - 28
  • [10] Relative Density-Based Outlier Detection Algorithm
    Ning, Jin
    Chen, Leiting
    Chen, Junwei
    PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), 2018, : 227 - 231