A Novel Unsupervised Outlier Detection Algorithm Based on Mutual Information and Reduced Spectral Clustering

被引:1
|
作者
Huang, Yuehua [1 ,2 ,3 ]
Liu, Wenfen [1 ,2 ,3 ]
Li, Song [1 ,2 ]
Guo, Ying [1 ,2 ]
Chen, Wen [1 ,2 ]
机构
[1] Guilin Univ Elect Technol, Sch Comp Sci & Informat Secur, Guilin 541004, Peoples R China
[2] Guilin Univ Elect Technol, Sch Software Engn, Guilin 541004, Peoples R China
[3] Guangxi Key Lab Cryptog & Informat Secur, Guilin 541004, Peoples R China
基金
中国国家自然科学基金;
关键词
outlier detection; unsupervised; mutual information; spectral clustering;
D O I
10.3390/electronics12234864
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Outlier detection is an essential research field in data mining, especially in the areas of network security, credit card fraud detection, industrial flaw detection, etc. The existing outlier detection algorithms, which can be divided into supervised methods and unsupervised methods, suffer from the following problems: curse of dimensionality, lack of labeled data, and hyperparameter tuning. To address these issues, we present a novel unsupervised outlier detection algorithm based on mutual information and reduced spectral clustering, called MISC-OD (Mutual Information and reduced Spectral Clustering-Outlier Detection). MISC-OD first constructs a mutual information matrix between features, then, by applying reduced spectral clustering, divides the feature set into subsets, utilizing the LOF (Local Outlier Factor) for outlier detection within each subset and combining the outlier scores found within each subset. Finally, it outputs the outlier score. Our contributions are as follows: (1) we propose a novel outlier detection method called MISC-OD with high interpretability and scalability; (2) numerous experiments on 18 benchmark datasets demonstrate the superior performance of the MISC-OD algorithm compared with eight state-of-the-art baselines in terms of ROC (receiver operating characteristic) and AP (average precision).
引用
收藏
页数:12
相关论文
共 50 条
  • [31] SPECTRAL CLUSTERING BASED UNSUPERVISED CHANGE DETECTION IN SAR IMAGES
    Zhang, Xiangrong
    Li, Zemin
    Hou, Biao
    Jiao, Licheng
    2011 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2011, : 712 - 715
  • [32] A dynamic auto-stopped clustering algorithm based on outlier information
    Lv, TY
    Huang, SB
    Zuo, WL
    Wang, ZX
    Proceedings of the 8th Joint Conference on Information Sciences, Vols 1-3, 2005, : 1501 - 1504
  • [33] Novel Clustering-Based Approach for Local Outlier Detection
    Du, Haizhou
    Zhao, Shengjie
    Zhang, Daqiang
    Wu, Jinsong
    2016 IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2016,
  • [34] Novel information-theoretic clustering algorithm for robust, unsupervised classification
    Temel, Turgay
    Aydin, Nizamettin
    2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3, 2007, : 859 - +
  • [35] Unsupervised clustering of mammograms for outlier detection and breast density estimation
    Tlusty, Tal
    Amit, Guy
    Ben-Ari, Rami
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3808 - 3813
  • [36] Unsupervised Outlier Detection in Streaming Data Using Weighted Clustering
    Thakran, Yogita
    Toshniwal, Durga
    2012 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2012, : 947 - 952
  • [37] Quantum spectral clustering algorithm for unsupervised learning
    Qingyu LI
    Yuhan HUANG
    Shan JIN
    Xiaokai HOU
    Xiaoting WANG
    ScienceChina(InformationSciences), 2022, 65 (10) : 43 - 52
  • [38] Quantum spectral clustering algorithm for unsupervised learning
    Qingyu Li
    Yuhan Huang
    Shan Jin
    Xiaokai Hou
    Xiaoting Wang
    Science China Information Sciences, 2022, 65
  • [39] Quantum spectral clustering algorithm for unsupervised learning
    Li, Qingyu
    Huang, Yuhan
    Jin, Shan
    Hou, Xiaokai
    Wang, Xiaoting
    SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (10)
  • [40] Attribute Weighted Fuzzy Clustering Algorithm Based on Mutual Information
    Cao, Yao Zhu
    Lin, He
    Liu, Biao
    2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017,