Towards optimization of anomaly detection in DevOps

被引:3
|
作者
Hrusto, Adha [1 ,2 ]
Engstrom, Emelie [1 ]
Runeson, Per [1 ]
机构
[1] Lund Univ, Dept Comp Sci, Box 118, SE-22100 Lund, Sweden
[2] Syst Verificat Sweden AB, Hyllie Stationstorg 31, SE-21532 Malmo, Sweden
关键词
Microservices; DevOps; Anomaly detection; Deep learning;
D O I
10.1016/j.infsof.2023.107241
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Context: DevOps has recently become a mainstream solution for bridging the gaps between development (Dev) and operations (Ops) enabling cross-functional collaboration. The DevOps concept of continuous monitoring may bring a lot of benefits to development teams such as early detection of run-time errors and various performance anomalies. Objective: We aim to explore deep learning (DL) solutions for detection of anomalous systems behavior based on collected monitoring data that consists of applications' and systems' performance metrics. Moreover, we specifically address a shortage of approaches for evaluating DL models without any ground truth data. Methods: We perform a case study in a real DevOps environment, following the principles of the design science paradigm. The research activities span from practice to theory and from problem to solution domain, including problem conceptualization, solution design, instantiation, and empirical validation. Results: We proposed and implemented a cloud solution for DL model deployment and evaluation empowered by feedback from the development team. The labeled data generated through the feedback was used for evaluation of current and training of new DL models in several iterations. The overall results showed that reconstruction-based models such as autoencoders, are quite robust to any parameter modification and are among the preferred for anomaly detection in multivariate monitoring data. Conclusion: Leveraging raw monitoring data and DL-inspired solutions, DevOps teams may get critical insights into the software and its operation. In our case, this proved to be an efficient way of discovering early signs of production failures.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Towards Anomaly Traffic Detection with Causal Interpretability Methods
    Zeng, Zengri
    Zhao, Baokang
    Liu, Xuhui
    Deng, Xiaoheng
    FRONTIERS OF NETWORKING TECHNOLOGIES, CCF CHINANET 2023, 2024, 1988 : 84 - 98
  • [22] Towards Practical Unsupervised Anomaly Detection on Retinal Images
    Ouardini, Khalil
    Yang, Huijuan
    Unnikrishnan, Balagopal
    Romain, Manon
    Garcin, Camille
    Zenati, Houssam
    Campbell, J. Peter
    Chiang, Michael F.
    Kalpathy-Cramer, Jayashree
    Chandrasekhar, Vijay
    Krishnaswamy, Pavitra
    Foo, Chuan-Sheng
    DOMAIN ADAPTATION AND REPRESENTATION TRANSFER AND MEDICAL IMAGE LEARNING WITH LESS LABELS AND IMPERFECT DATA, DART 2019, MIL3ID 2019, 2019, 11795 : 225 - 234
  • [23] Towards Periodicity Based Anomaly Detection in SCADA Networks
    Barbosa, Rafael Ramos Regis
    Sadre, Ramin
    Pras, Aiko
    2012 IEEE 17TH CONFERENCE ON EMERGING TECHNOLOGIES & FACTORY AUTOMATION (ETFA), 2012,
  • [24] Towards Adaptive Anomaly Detection in Cellular Mobile Networks
    Sun, Bo
    Chen, Zhi
    Wang, Ruhai
    Yu, Fei
    Leung, Victor C. M.
    2006 3RD IEEE CONSUMER COMMUNICATIONS AND NETWORKING CONFERENCE, VOLS 1-3, 2006, : 666 - +
  • [25] Towards Useful Anomaly Detection for Back Office Networks
    Yuksel, Omer
    Den Hartog, Jerry
    Etalle, Sandro
    INFORMATION SYSTEMS SECURITY, 2016, 10063 : 509 - 520
  • [26] Towards Provenance-Based Anomaly Detection in MapReduce
    Liao, Cong
    Squicciarini, Anna
    2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 647 - 656
  • [27] Towards a practical process model for Anomaly Detection Systems
    Schwenzfeier, Nils
    Gruhn, Volker
    2018 IEEE/ACM 1ST INTERNATIONAL WORKSHOP ON SOFTWARE ENGINEERING FOR COGNITIVE SERVICES (SE4COG), 2018, : 41 - 44
  • [28] IoT for Water Management: Towards Intelligent Anomaly Detection
    Gonzalez-Vidal, Aurora
    Cuenca-Jara, Jesus
    Skarmeta, Antonio F.
    2019 IEEE 5TH WORLD FORUM ON INTERNET OF THINGS (WF-IOT), 2019, : 858 - 863
  • [29] Towards Continuous Consistency Checking of DevOps Artefacts
    Colantoni, Alessandro
    Horvath, Benedek
    Horvath, Akos
    Berardinelli, Luca
    Wimmer, Manuel
    24TH ACM/IEEE INTERNATIONAL CONFERENCE ON MODEL-DRIVEN ENGINEERING LANGUAGES AND SYSTEMS COMPANION (MODELS-C 2021), 2021, : 450 - 454
  • [30] Towards Continuous Safety Assessment in Context of DevOps
    Zeller, Marc
    COMPUTER SAFETY, RELIABILITY, AND SECURITY (SAFECOMP 2021), 2021, 12853 : 145 - 157