A Novel Unsupervised Outlier Detection Algorithm Based on Mutual Information and Reduced Spectral Clustering

被引:1
|
作者
Huang, Yuehua [1 ,2 ,3 ]
Liu, Wenfen [1 ,2 ,3 ]
Li, Song [1 ,2 ]
Guo, Ying [1 ,2 ]
Chen, Wen [1 ,2 ]
机构
[1] Guilin Univ Elect Technol, Sch Comp Sci & Informat Secur, Guilin 541004, Peoples R China
[2] Guilin Univ Elect Technol, Sch Software Engn, Guilin 541004, Peoples R China
[3] Guangxi Key Lab Cryptog & Informat Secur, Guilin 541004, Peoples R China
基金
中国国家自然科学基金;
关键词
outlier detection; unsupervised; mutual information; spectral clustering;
D O I
10.3390/electronics12234864
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Outlier detection is an essential research field in data mining, especially in the areas of network security, credit card fraud detection, industrial flaw detection, etc. The existing outlier detection algorithms, which can be divided into supervised methods and unsupervised methods, suffer from the following problems: curse of dimensionality, lack of labeled data, and hyperparameter tuning. To address these issues, we present a novel unsupervised outlier detection algorithm based on mutual information and reduced spectral clustering, called MISC-OD (Mutual Information and reduced Spectral Clustering-Outlier Detection). MISC-OD first constructs a mutual information matrix between features, then, by applying reduced spectral clustering, divides the feature set into subsets, utilizing the LOF (Local Outlier Factor) for outlier detection within each subset and combining the outlier scores found within each subset. Finally, it outputs the outlier score. Our contributions are as follows: (1) we propose a novel outlier detection method called MISC-OD with high interpretability and scalability; (2) numerous experiments on 18 benchmark datasets demonstrate the superior performance of the MISC-OD algorithm compared with eight state-of-the-art baselines in terms of ROC (receiver operating characteristic) and AP (average precision).
引用
收藏
页数:12
相关论文
共 50 条
  • [21] An Outlier Detection Algorithm Based on Probability Density Clustering
    Wang, Wei
    Ren, Yongjian
    Zhou, Renjie
    Zhang, Jilin
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2023, 19 (01) : 22 - 22
  • [22] A Novel k-means Algorithm for Clustering and Outlier Detection
    Zhou, Yinghua
    Yu, Hong
    Cai, Xuemei
    2009 SECOND INTERNATIONAL CONFERENCE ON FUTURE INFORMATION TECHNOLOGY AND MANAGEMENT ENGINEERING, FITME 2009, 2009, : 476 - +
  • [23] Outlier detection algorithm based on fast density peak clustering outlier factor
    Zhang, Zhongping
    Li, Sen
    Liu, Weixiong
    Liu, Shuxia
    Tongxin Xuebao/Journal on Communications, 2022, 43 (10): : 186 - 195
  • [24] A mutual information based face clustering algorithm for movies
    Vretos, N.
    Solachidis, V.
    Pitas, I.
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1013 - +
  • [25] On normalization and algorithm selection for unsupervised outlier detection
    Kandanaarachchi, Sevvandi
    Munoz, Mario A.
    Hyndman, Rob J.
    Smith-Miles, Kate
    DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 34 (02) : 309 - 354
  • [26] A Novel Cluster Based Algorithm for Outlier Detection
    Mahajan, Manish
    Kumar, Santosh
    Pant, Bhasker
    COMPUTING, COMMUNICATION AND SIGNAL PROCESSING, ICCASP 2018, 2019, 810 : 449 - 456
  • [27] On normalization and algorithm selection for unsupervised outlier detection
    Sevvandi Kandanaarachchi
    Mario A. Muñoz
    Rob J. Hyndman
    Kate Smith-Miles
    Data Mining and Knowledge Discovery, 2020, 34 : 309 - 354
  • [28] An Outlier Detection Algorithm for Data Streams Based on Fuzzy Clustering
    Su, Xiaoke
    Qin, Yuming
    Wan, Renxia
    PROGRESS IN INTELLIGENCE COMPUTATION AND APPLICATIONS, 2008, : 109 - 112
  • [29] An Outlier Detection Algorithm in Wireless Sensor Network Based on Clustering
    Niu, Kun
    Zhao, Fang
    Qiao, Xiuquan
    2013 15TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT), 2013, : 433 - 437
  • [30] A New Outlier Detection Algorithm Based on Fast Density Peak Clustering Outlier Factor
    Zhang, ZhongPing
    Li, Sen
    Liu, WeiXiong
    Wang, Ying
    Li, Daisy Xin
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2023, 19 (02)