A Novel Unsupervised Outlier Detection Algorithm Based on Mutual Information and Reduced Spectral Clustering

被引:1
|
作者
Huang, Yuehua [1 ,2 ,3 ]
Liu, Wenfen [1 ,2 ,3 ]
Li, Song [1 ,2 ]
Guo, Ying [1 ,2 ]
Chen, Wen [1 ,2 ]
机构
[1] Guilin Univ Elect Technol, Sch Comp Sci & Informat Secur, Guilin 541004, Peoples R China
[2] Guilin Univ Elect Technol, Sch Software Engn, Guilin 541004, Peoples R China
[3] Guangxi Key Lab Cryptog & Informat Secur, Guilin 541004, Peoples R China
基金
中国国家自然科学基金;
关键词
outlier detection; unsupervised; mutual information; spectral clustering;
D O I
10.3390/electronics12234864
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Outlier detection is an essential research field in data mining, especially in the areas of network security, credit card fraud detection, industrial flaw detection, etc. The existing outlier detection algorithms, which can be divided into supervised methods and unsupervised methods, suffer from the following problems: curse of dimensionality, lack of labeled data, and hyperparameter tuning. To address these issues, we present a novel unsupervised outlier detection algorithm based on mutual information and reduced spectral clustering, called MISC-OD (Mutual Information and reduced Spectral Clustering-Outlier Detection). MISC-OD first constructs a mutual information matrix between features, then, by applying reduced spectral clustering, divides the feature set into subsets, utilizing the LOF (Local Outlier Factor) for outlier detection within each subset and combining the outlier scores found within each subset. Finally, it outputs the outlier score. Our contributions are as follows: (1) we propose a novel outlier detection method called MISC-OD with high interpretability and scalability; (2) numerous experiments on 18 benchmark datasets demonstrate the superior performance of the MISC-OD algorithm compared with eight state-of-the-art baselines in terms of ROC (receiver operating characteristic) and AP (average precision).
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Fast outlier mining algorithm in uncertain data set based on spectral clustering
    Kang Y.-L.
    Feng L.-L.
    Zhang J.-A.
    Cao S.-E.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2023, 53 (04): : 1181 - 1186
  • [42] A Practical Algorithm for Distributed Clustering and Outlier Detection
    Chen, Jiecao
    Azer, Erfan Sadeqi
    Zhang, Qin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [43] Automatic PAM clustering algorithm for outlier detection
    Zhu, Q. (qszhu@cqu.edu.cn), 1600, Academy Publisher (07):
  • [44] An unsupervised clustering algorithm for intrusion detection
    Guan, Y
    Ghorbani, AA
    Belacel, N
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, 2671 : 616 - 617
  • [45] Unsupervised Outlier detection algorithm based on k-NN and fuzzy logic
    Renan Velazquez-Gonzalez, J.
    Peregrina-Barreto, Hayde
    Fco Martinez-Trinidad, Jose
    2019 IEEE INTERNATIONAL AUTUMN MEETING ON POWER, ELECTRONICS AND COMPUTING (ROPEC 2019), 2019,
  • [46] A Hybrid Outlier Detection Algorithm Based On Partitioning Clustering And Density Measures
    Rizk, Hamada
    Elgokhy, Sherin
    Sarhan, Amany
    2015 TENTH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS (ICCES), 2015, : 175 - 181
  • [47] Unsupervised Outlier Detection: A Meta-Learning Algorithm Based on Feature Selection
    Papastefanopoulos, Vasilis
    Linardatos, Pantelis
    Kotsiantis, Sotiris
    ELECTRONICS, 2021, 10 (18)
  • [48] Outlier detection based on cluster outlier factor and mutual density
    Zhang Z.
    Zhu M.
    Qiu J.
    Liu C.
    Zhang D.
    Qi J.
    International Journal of Intelligent Information and Database Systems, 2019, 12 (1-2) : 91 - 108
  • [49] A spectral clustering algorithm based on intuitionistic fuzzy information
    Xu, Dawei
    Xu, Zeshui
    Liu, Shousheng
    Zhao, Hua
    KNOWLEDGE-BASED SYSTEMS, 2013, 53 : 20 - 26
  • [50] Outlier detection based on cluster outlier factor and mutual density
    Zhang Z.
    Qiu J.
    Liu C.
    Zhu M.
    Zhang D.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2019, 25 (09): : 2314 - 2323