VAGA: Towards Accurate and Interpretable Outlier Detection Based on Variational Auto-Encoder and Genetic Algorithm for High-Dimensional Data

被引:4
|
作者
Li, Jiamu [1 ]
Zhang, Ji [2 ,3 ]
Wang, Jian [1 ]
Zhu, Youwen [1 ]
Bah, Mohamed Jaward [3 ]
Yang, Gaoming [4 ]
Gan, Yuquan [5 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Nanjing, Peoples R China
[2] Univ Southern Queensland, Toowoomba, Qld, Australia
[3] Zhejiang Lab, Hangzhou, Peoples R China
[4] Anhui Univ Sci & Technol, Huainan, Peoples R China
[5] Xian Univ Posts & Telecommun, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
outlier detection; variational autoencoder; genetic algorithm;
D O I
10.1109/BigData52589.2021.9671744
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The curse of dimensionality in high-dimensional data makes it difficult to capture the abnormality of data points in full data space. To deal with this problem, we propose an outlier detection model based on Variational Autoencoder and Genetic Algorithm for subspace outlier analysis of high-dimensional data (VAGA). The proposed VAGA model constructs a variational autoencoder (VAE) to preliminarily detect outliers. Then the genetic algorithm (GA) is used to search the abnormal subspace of the outliers obtained by the VAE layer to provide a basis for subspace outlier analysis. The subsequent clustering of the abnormal subspaces help filter out the false positives which are fed back to the VAE layer to adjust network weights. The comparative experiments performed on three public benchmark datasets show that the outlier detection results of the proposed VAGA model are highly interpretable and have better accuracy performance than the state-of-the-art outlier detection methods.
引用
收藏
页码:5956 / 5958
页数:3
相关论文
共 50 条
  • [21] Efficient Outlier Detection for High-Dimensional Data
    Liu, Huawen
    Li, Xuelong
    Li, Jiuyong
    Zhang, Shichao
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2018, 48 (12): : 2451 - 2461
  • [22] Network Intrusion Detection Based on Supervised Adversarial Variational Auto-Encoder With Regularization
    Yang, Yanqing
    Zheng, Kangfeng
    Wu, Bin
    Yang, Yixian
    Wang, Xiujuan
    IEEE ACCESS, 2020, 8 : 42169 - 42184
  • [23] Network Intrusion Detection Based on Semi-supervised Variational Auto-Encoder
    Osada, Genki
    Omote, Kazumasa
    Nishide, Takashi
    COMPUTER SECURITY - ESORICS 2017, PT II, 2017, 10493 : 344 - 361
  • [24] Unsupervised Anomaly Detection Using Variational Auto-Encoder based Feature Extraction
    Yao, Rong
    Liu, Chongdang
    Zhang, Linxuan
    Peng, Peng
    2019 IEEE INTERNATIONAL CONFERENCE ON PROGNOSTICS AND HEALTH MANAGEMENT (ICPHM), 2019,
  • [25] Manifold-based denoising, outlier detection, and dimension reduction algorithm for high-dimensional data
    Zhao, Guanghua
    Yang, Tao
    Fu, Dongmei
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (11) : 3923 - 3942
  • [26] An effective and efficient algorithm for high-dimensional outlier detection
    Charu C. Aggarwal
    Philip S. Yu
    The VLDB Journal, 2005, 14 : 211 - 221
  • [27] An effective and efficient algorithm for high-dimensional outlier detection
    Aggarwal, CC
    Yu, PS
    VLDB JOURNAL, 2005, 14 (02): : 211 - 221
  • [28] Manifold-based denoising, outlier detection, and dimension reduction algorithm for high-dimensional data
    Guanghua Zhao
    Tao Yang
    Dongmei Fu
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 3923 - 3942
  • [29] Computationally Efficient Outlier Detection for High-Dimensional Data Using the MDP Algorithm
    Tsagris, Michail
    Papadakis, Manos
    Alenazi, Abdulaziz
    Alzeley, Omar
    COMPUTATION, 2024, 12 (09)
  • [30] Producing accurate interpretable clusters from high-dimensional data
    Greene, D
    Cunningham, P
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2005, 2005, 3721 : 486 - 494