A Hybrid Deep Learning-Based Unsupervised Anomaly Detection in High Dimensional Data

被引:23
|
作者
Muneer, Amgad [1 ,2 ]
Taib, Shakirah Mohd [1 ,2 ]
Fati, Suliman Mohamed [3 ]
Balogun, Abdullateef O. [1 ]
Aziz, Izzatdin Abdul [1 ,2 ]
机构
[1] Univ Teknol PETRONAS, Dept Comp & Informat Sci, Seri Iskandar 32160, Perak, Malaysia
[2] Univ Teknol PETRONAS, Ctr Res Data Sci CERDAS, Seri Iskandar 32610, Perak, Malaysia
[3] Prince Sultan Univ, Informat Syst Dept, Riyadh 11586, Saudi Arabia
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2022年 / 70卷 / 03期
关键词
Anomaly detection; outlier detection; unsupervised learning; autoencoder; deep learning; hybrid model; OUTLIER DETECTION; MINING OUTLIERS; SUBSPACES;
D O I
10.32604/cmc.2022.021113
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Anomaly detection in high dimensional data is a critical research issue with serious implication in the real-world problems. Many issues in this field still unsolved, so several modern anomaly detection methods struggle to maintain adequate accuracy due to the highly descriptive nature of big data. Such a phenomenon is referred to as the "curse of dimensionality" that affects traditional techniques in terms of both accuracy and performance. Thus, this research proposed a hybrid model based on Deep Autoencoder Neural Network (DANN) with five layers to reduce the difference between the input and output. The proposed model was applied to a real-world gas turbine (GT) dataset that contains 87620 columns and 56 rows. During the experiment, two issues have been investigated and solved to enhance the results. The first is the dataset class imbalance, which solved using SMOTE technique. The second issue is the poor performance, which can be solved using one of the optimization algorithms. Several optimization algorithms have been investigated and tested, including stochastic gradient descent (SGD), RMSprop, Adam and Adamax. However, Adamax optimization algorithm showed the best results when employed to train the DANN model. The experimental results show that our proposed model can detect the anomalies by efficiently reducing the high dimensionality of dataset with accuracy of 99.40%, F1-score of 0.9649, Area Under the Curve (AUC) rate of 0.9649, and a minimal loss function during the hybrid model training.
引用
收藏
页码:5363 / 5381
页数:19
相关论文
共 50 条
  • [1] A hybrid deep learning-based unsupervised anomaly detection in high dimensional data
    Muneer A.
    Taib S.M.
    Fati S.M.
    Balogun A.O.
    Aziz I.A.
    Computers, Materials and Continua, 2022, 70 (03): : 6073 - 6088
  • [2] Deep Learning-based Hybrid Model for Efficient Anomaly Detection
    Osamor, Frances
    Wellman, Briana
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (04) : 975 - 979
  • [3] A Deep Learning-based Approach to Anomaly Detection with 2-Dimensional Data in Manufacturing
    Maggipinto, Marco
    Beghi, Alessandro
    Susto, Gian Antonio
    2019 IEEE 17TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2019, : 187 - 192
  • [4] Coupling of unsupervised and supervised deep learning-based approaches for surface anomaly detection
    Racki, Domen
    Tomazevic, Dejan
    Skocaj, Danijel
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (03)
  • [5] Unsupervised Deep Learning-based End-to-end Network for Anomaly Detection and Localization
    Olimov, Bekhzod
    Subramanian, Barathi
    Kim, Jeonghong
    2022 THIRTEENTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN), 2022, : 444 - 449
  • [6] Deep Learning-based Anomaly Detection for Compressors Using Audio Data
    Mobtahej, Pooyan
    Zhang, Xulong
    Hamidi, Maryam
    Zhang, Jing
    67TH ANNUAL RELIABILITY & MAINTAINABILITY SYMPOSIUM (RAMS 2021), 2021,
  • [7] A data-driven metric learning-based scheme for unsupervised network anomaly detection
    Aliakbarisani, Roya
    Ghasemi, Abdorasoul
    Wu, Shyhtsun Felix
    COMPUTERS & ELECTRICAL ENGINEERING, 2019, 73 : 71 - 83
  • [8] A Hybrid Deep Learning-Based Model for Anomaly Detection in Cloud Datacenter Networks
    Garg, Sahil
    Kaur, Kuljeet
    Kumar, Neeraj
    Kaddoum, Georges
    Zomaya, Albert Y.
    Ranjan, Rajiv
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2019, 16 (03): : 924 - 935
  • [9] A Hybrid Deep Learning Framework for Unsupervised Anomaly Detection in Multivariate Spatio-Temporal Data
    Karadayi, Yildiz
    Aydin, Mehmet N.
    Ogrenci, A. Selcuk
    APPLIED SCIENCES-BASEL, 2020, 10 (15):
  • [10] High-Dimensional Energy Consumption Anomaly Detection: A Deep Learning-Based Method for Detecting Anomalies
    Pan, Haipeng
    Yin, Zhongqian
    Jiang, Xianzhi
    ENERGIES, 2022, 15 (17)