VAGA: Towards Accurate and Interpretable Outlier Detection Based on Variational Auto-Encoder and Genetic Algorithm for High-Dimensional Data

被引:4
|
作者
Li, Jiamu [1 ]
Zhang, Ji [2 ,3 ]
Wang, Jian [1 ]
Zhu, Youwen [1 ]
Bah, Mohamed Jaward [3 ]
Yang, Gaoming [4 ]
Gan, Yuquan [5 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Nanjing, Peoples R China
[2] Univ Southern Queensland, Toowoomba, Qld, Australia
[3] Zhejiang Lab, Hangzhou, Peoples R China
[4] Anhui Univ Sci & Technol, Huainan, Peoples R China
[5] Xian Univ Posts & Telecommun, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
outlier detection; variational autoencoder; genetic algorithm;
D O I
10.1109/BigData52589.2021.9671744
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The curse of dimensionality in high-dimensional data makes it difficult to capture the abnormality of data points in full data space. To deal with this problem, we propose an outlier detection model based on Variational Autoencoder and Genetic Algorithm for subspace outlier analysis of high-dimensional data (VAGA). The proposed VAGA model constructs a variational autoencoder (VAE) to preliminarily detect outliers. Then the genetic algorithm (GA) is used to search the abnormal subspace of the outliers obtained by the VAE layer to provide a basis for subspace outlier analysis. The subsequent clustering of the abnormal subspaces help filter out the false positives which are fed back to the VAE layer to adjust network weights. The comparative experiments performed on three public benchmark datasets show that the outlier detection results of the proposed VAGA model are highly interpretable and have better accuracy performance than the state-of-the-art outlier detection methods.
引用
收藏
页码:5956 / 5958
页数:3
相关论文
共 50 条
  • [1] An Auto-Encoder with Genetic Algorithm for High Dimensional Data: Towards Accurate and Interpretable Outlier Detection
    Li, Jiamu
    Zhang, Ji
    Bah, Mohamed Jaward
    Wang, Jian
    Zhu, Youwen
    Yang, Gaoming
    Li, Lingling
    Zhang, Kexin
    ALGORITHMS, 2022, 15 (11)
  • [2] A trajectory outlier detection method based on variational auto-encoder
    Zhang, Longmei
    Lu, Wei
    Xue, Feng
    Chang, Yanshuo
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (08) : 15075 - 15093
  • [3] Outlier Detection for Power Data Based on Contractive Auto-Encoder
    Lu, Yuan
    Leng, Xiaojie
    Xu, Kang
    Luan, Weiping
    Yang, Wei
    Li, Jing
    PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION SCIENCE AND SYSTEM, AISS 2019, 2019,
  • [4] Detection Algorithm of the Mimicry Attack based on Variational Auto-Encoder
    Wang, Qunke
    Fang, Lanting
    Zhu, Zhenchao
    Huang, Jie
    51ST ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN-W 2021), 2021, : 114 - 120
  • [5] Outlier Detection for Water Supply Data Based on Joint Auto-Encoder
    Fang, Shu
    Huang, Lei
    Wan, Yi
    Sun, Weize
    Xu, Jingxin
    CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 64 (01): : 541 - 555
  • [6] Outlier detection for water supply data based on joint auto-encoder
    Fang S.
    Huang L.
    Wan Y.
    Sun W.
    Xu J.
    Computers, Materials and Continua, 2020, 64 (01): : 541 - 555
  • [7] Variational autoencoder-based outlier detection for high-dimensional data
    Li, Yongmou
    Wang, Yijie
    Ma, Xingkong
    INTELLIGENT DATA ANALYSIS, 2019, 23 (05) : 991 - 1002
  • [8] Anomaly detection method based on convolutional variational auto-encoder
    Yu X.
    Xu M.
    Wang Y.
    Wang S.
    Hu N.
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2021, 42 (05): : 151 - 158
  • [9] Auto-encoder Based for High Spectral Dimensional Data Classification and Visualization
    Zhu, Jiang
    Wu, Lingda
    Hao, Hongxing
    Song, Xiaorui
    Lu, Yi
    2017 IEEE SECOND INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC), 2017, : 350 - 354
  • [10] High-dimensional data stream outlier detection algorithm based on angle distribution
    Lu, S. (lusheng@cqupt.edu.cn), 1600, Shanghai Jiaotong University (48):