Anomaly Detection for Cloud Systems with Dynamic Spatiotemporal Learning

被引:1
|
作者
Yu, Mingguang [1 ,2 ]
Zhang, Xia [1 ,2 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110169, Peoples R China
[2] Neusoft Corp, Shenyang 110179, Peoples R China
来源
关键词
System maintenance; anomaly detection; GCN; LSTM; AIOps;
D O I
10.32604/iasc.2023.038798
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As cloud system architectures evolve continuously, the interac-tions among distributed components in various roles become increasingly complex. This complexity makes it difficult to detect anomalies in cloud systems. The system status can no longer be determined through individual key performance indicators (KPIs) but through joint judgments based on syn-ergistic relationships among distributed components. Furthermore, anomalies in modern cloud systems are usually not sudden crashes but rather grad-ual, chronic, localized failures or quality degradations in a weakly available state. Therefore, accurately modeling cloud systems and mining the hidden system state is crucial. To address this challenge, we propose an anomaly detection method with dynamic spatiotemporal learning (AD-DSTL). AD-DSTL leverages the spatiotemporal dynamics of the system to train an end -to-end deep learning model driven by data from system monitoring to detect underlying anomalous states in complex cloud systems. Unlike previous work that focuses on the KPIs of separate components, AD-DSTL builds a model for the entire system and characterizes its spatiotemporal dynamics based on graph convolutional networks (GCN) and long short-term memory (LSTM). We validated AD-DSTL using four datasets from different backgrounds, and it demonstrated superior robustness compared to other baseline algorithms. Moreover, when raising the target exception level, both the recall and precision of AD-DSTL reached approximately 0.9. Our experimental results demon-strate that AD-DSTL can meet the requirements of anomaly detection for complex cloud systems.
引用
收藏
页码:1787 / 1806
页数:20
相关论文
共 50 条
  • [1] A Spatiotemporal Deep Learning Approach for Unsupervised Anomaly Detection in Cloud Systems
    He, Zilong
    Chen, Pengfei
    Li, Xiaoyun
    Wang, Yongfeng
    Yu, Guangba
    Chen, Cailin
    Li, Xinrui
    Zheng, Zibin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (04) : 1705 - 1719
  • [2] Spatiotemporal Representation Learning for Video Anomaly Detection
    Li, Zhaoyan
    Li, Yaoshun
    Gao, Zhisheng
    IEEE ACCESS, 2020, 8 (08): : 25531 - 25542
  • [3] Sparse spatiotemporal feature learning for pipeline anomaly detection
    Ma, King
    Leung, Henry
    PROCEEDINGS OF THE 2019 IEEE 18TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC 2019), 2019, : 123 - 129
  • [4] Point Cloud Video Anomaly Detection Based on Point Spatiotemporal Autoencoder
    He, Tengjiao
    Wang, Wenguang
    Zeng, Guoqi
    IEEE SENSORS JOURNAL, 2024, 24 (13) : 20884 - 20895
  • [5] Spatiotemporal Real-Time Anomaly Detection for Supercornputing Systems
    Kang, Qiao
    Agrawal, Ankit
    Choudhary, Alok
    Sim, Alex
    Wu, Kesheng
    Kettimuthu, Rajkumar
    Beckman, Peter H.
    Liu, Zhengchun
    Liao, Wei-keng
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 4381 - 4389
  • [6] Challenging Anomaly Detection in Complex Dynamic Systems
    Zoppi, Tommaso
    Ceccarelli, Andrea
    Bondavalli, Andrea
    PROCEEDINGS OF 2016 IEEE 35TH SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS (SRDS), 2016, : 213 - 214
  • [7] Machine Learning Anomaly Detection in Large Systems
    Murphree, Jerry
    2016 IEEE AUTOTESTCON PROCEEDINGS, 2016,
  • [8] Spatiotemporal polynomial graph neural network for anomaly detection of complex systems
    Ma, Meng
    Hua, Xuanhao
    Zhang, Yang
    Zhai, Zhi
    MEASUREMENT, 2024, 235
  • [9] A Pragmatical Approach to Anomaly Detection Evaluation in Edge Cloud Systems
    Skaperas, Sotiris
    Koukist, Georgios
    Kapetanidou, Ioanna Angeliki
    Tsaousis, Vasilis
    Mamatas, Lefteris
    IEEE INFOCOM 2024-IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS, INFOCOM WKSHPS 2024, 2024,
  • [10] Statistical Learning for Anomaly Detection in Cloud Server Systems: A Multi-Order Markov Chain Framework
    Sha, Wenyao
    Zhu, Yongxin
    Chen, Min
    Huang, Tian
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2018, 6 (02) : 401 - 413