A Modified Density Based Outlier Mining Algorithm for Large Dataset

被引:0
|
作者
Yang, Peng [1 ]
Huang, Biao [1 ]
机构
[1] Chongqing Univ Arts & Sci, Chongqing 402160, Peoples R China
关键词
D O I
10.1109/FITME.2008.106
中图分类号
F [经济];
学科分类号
02 ;
摘要
Outlier mining is to discover the objects with exceptional behavior in dataset. It is an important challenge from the knowledge discovery standpoint and attracts much attention recently. The density based outlier mining algorithm is an effective approach to detect anomalous points. However, such algorithms usually need amounts of computations. In this paper, we propose a modified density based detection algorithm which utilizes the data partitioning method. Furthermore, it presents some speedup strategies such as the introduction of module information to avoid large number of unnecessary computations while finding outliers. The algorithm is applied on both synthetic and real datasets and the experimental results show that it is efficient for outlier detection in large dataset.
引用
收藏
页码:37 / 40
页数:4
相关论文
共 50 条
  • [1] An Efficient Outlier Mining Algorithm for Large Dataset
    Yang, Peng
    Huang, Biao
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT, INNOVATION MANAGEMENT AND INDUSTRIAL ENGINEERING, VOL 1, 2008, : 199 - 202
  • [2] KNN Based Outlier Detection Algorithm in Large Dataset
    Yang, Peng
    Huang, Biao
    2008 INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND TRAINING AND 2008 INTERNATIONAL WORKSHOP ON GEOSCIENCE AND REMOTE SENSING, VOL 1, PROCEEDINGS, 2009, : 611 - 613
  • [3] Density Based Outlier Mining Algorithm with Application to Intrusion Detection
    Yang, Peng
    Huang, Biao
    PACIIA: 2008 PACIFIC-ASIA WORKSHOP ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION, VOLS 1-3, PROCEEDINGS, 2008, : 489 - 492
  • [4] Optimisation of outlier data mining algorithm for large datasets based on unit
    Li Y.
    Zhou X.
    International Journal of Information Technology and Management, 2023, 22 (3-4) : 175 - 189
  • [5] A compress-based association mining algorithm for large dataset
    Ashrafi, MZ
    Taniar, D
    Smith, K
    COMPUTATIONAL SCIENCE - ICCS 2003, PT IV, PROCEEDINGS, 2003, 2660 : 978 - 987
  • [6] An Outlier Mining Algorithm Based on Dissimilarity
    Zhou, Ming-jian
    Chen, Xue-jiao
    2011 INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND NEURAL COMPUTING (FSNC 2011), VOL II, 2011, : 289 - 291
  • [7] An Outlier Mining Algorithm Based on Dissimilarity
    Zhou, Ming-jian
    Chen, Xue-jiao
    2011 INTERNATIONAL CONFERENCE OF ENVIRONMENTAL SCIENCE AND ENGINEERING, VOL 12, PT B, 2012, 12 : 810 - 814
  • [8] Outlier mining algorithm based on data-partitioning and density-grid
    Xing, Chang Zheng
    Tang, Cheng Long
    Wei, Ke
    2012 INTERNATIONAL CONFERENCE ON CONTROL ENGINEERING AND COMMUNICATION TECHNOLOGY (ICCECT 2012), 2012, : 880 - 884
  • [9] An Outlier Mining Algorithm Based on Attribute Entropy
    Zhou, Ming-Jian
    Tao, Jun-cai
    2011 2ND INTERNATIONAL CONFERENCE ON CHALLENGES IN ENVIRONMENTAL SCIENCE AND COMPUTER ENGINEERING (CESCE 2011), VOL 11, PT A, 2011, 11 : 132 - 138
  • [10] An outlier mining algorithm based on gini index
    1600, ICIC Express Letters Office, Tokai University, Kumamoto Campus, 9-1-1, Toroku, Kumamoto, 862-8652, Japan (07):