A Hybrid Deep Learning-Based Unsupervised Anomaly Detection in High Dimensional Data

被引:23
|
作者
Muneer, Amgad [1 ,2 ]
Taib, Shakirah Mohd [1 ,2 ]
Fati, Suliman Mohamed [3 ]
Balogun, Abdullateef O. [1 ]
Aziz, Izzatdin Abdul [1 ,2 ]
机构
[1] Univ Teknol PETRONAS, Dept Comp & Informat Sci, Seri Iskandar 32160, Perak, Malaysia
[2] Univ Teknol PETRONAS, Ctr Res Data Sci CERDAS, Seri Iskandar 32610, Perak, Malaysia
[3] Prince Sultan Univ, Informat Syst Dept, Riyadh 11586, Saudi Arabia
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2022年 / 70卷 / 03期
关键词
Anomaly detection; outlier detection; unsupervised learning; autoencoder; deep learning; hybrid model; OUTLIER DETECTION; MINING OUTLIERS; SUBSPACES;
D O I
10.32604/cmc.2022.021113
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Anomaly detection in high dimensional data is a critical research issue with serious implication in the real-world problems. Many issues in this field still unsolved, so several modern anomaly detection methods struggle to maintain adequate accuracy due to the highly descriptive nature of big data. Such a phenomenon is referred to as the "curse of dimensionality" that affects traditional techniques in terms of both accuracy and performance. Thus, this research proposed a hybrid model based on Deep Autoencoder Neural Network (DANN) with five layers to reduce the difference between the input and output. The proposed model was applied to a real-world gas turbine (GT) dataset that contains 87620 columns and 56 rows. During the experiment, two issues have been investigated and solved to enhance the results. The first is the dataset class imbalance, which solved using SMOTE technique. The second issue is the poor performance, which can be solved using one of the optimization algorithms. Several optimization algorithms have been investigated and tested, including stochastic gradient descent (SGD), RMSprop, Adam and Adamax. However, Adamax optimization algorithm showed the best results when employed to train the DANN model. The experimental results show that our proposed model can detect the anomalies by efficiently reducing the high dimensionality of dataset with accuracy of 99.40%, F1-score of 0.9649, Area Under the Curve (AUC) rate of 0.9649, and a minimal loss function during the hybrid model training.
引用
收藏
页码:5363 / 5381
页数:19
相关论文
共 50 条
  • [31] Impact of log parsing on deep learning-based anomaly detection
    Khan, Zanis Ali
    Shin, Donghwan
    Bianculli, Domenico
    Briand, Lionel C.
    EMPIRICAL SOFTWARE ENGINEERING, 2024, 29 (06)
  • [32] Unsupervised Deep Learning-Based Hybrid Beamforming in Massive MISO Systems
    Zhang, Teng
    Dong, Anming
    Zhang, Chuanting
    Yu, Jiguo
    Qiu, Jing
    Li, Sufang
    Zhang, Li
    Zhou, You
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS (WASA 2022), PT II, 2022, 13472 : 3 - 15
  • [33] Unsupervised Learning-based Early Anomaly Detection in AMS Circuits of Automotive SoCs
    Arunachalam, Ayush
    Kizhakkayil, Athulya
    Kundu, Shamik
    Raha, Arnab
    Banerjee, Suvadeep
    Jin, Robert
    Su, Fei
    Basu, Kanad
    2022 IEEE INTERNATIONAL TEST CONFERENCE (ITC), 2022, : 229 - 238
  • [34] Explaining deep learning-based anomaly detection in energy consumption data by focusing on contextually relevant data
    Noorchenarboo, Mohammad
    Grolinger, Katarina
    ENERGY AND BUILDINGS, 2025, 328
  • [35] Machine learning- and deep learning-based anomaly detection in firewalls: a surveyMachine learning- and deep learning-based anomaly detection...H. Dhrir et al.
    Hanen Dhrir
    Maha Charfeddine
    Nesrine Tarhouni
    Habib M. Kammoun
    The Journal of Supercomputing, 81 (6)
  • [36] Anomaly Detection in Health Data Based on Deep Learning
    Han, Ning
    Gao, Sheng
    Li, Jin
    Zhang, Xinming
    Guo, Jun
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC), 2018, : 188 - 192
  • [37] Computer vision and deep learning-based data anomaly detection method for structural health monitoring
    Bao, Yuequan
    Tang, Zhiyi
    Li, Hui
    Zhang, Yufeng
    STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2019, 18 (02): : 401 - 421
  • [38] An automated unsupervised deep learning-based approach for diabetic retinopathy detection
    Naz, Huma
    Nijhawan, Rahul
    Ahuja, Neelu Jyothi
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2022, 60 (12) : 3635 - 3654
  • [39] Anomaly detection in injection molding process data based on unsupervised learning
    Schiffers, Reinhard
    Morik, Katharina
    Struchtrup, Alexander Schulze
    Honysz, Philipp-Jan
    Wortberg, Johannes
    Zeitschrift Kunststofftechnik/Journal of Plastics Technology, 2019, 2019 (05): : 301 - 347
  • [40] Deep Learning-Based Survival Analysis for High-Dimensional Survival Data
    Hao, Lin
    Kim, Juncheol
    Kwon, Sookhee
    Ha, Il Do
    MATHEMATICS, 2021, 9 (11)