Graph neural networks for detecting anomalies in scientific workflows

被引:2
|
作者
Jin, Hongwei [1 ,6 ]
Raghavan, Krishnan [1 ]
Papadimitriou, George [2 ]
Wang, Cong [3 ]
Mandal, Anirban [3 ]
Kiran, Mariam [4 ]
Deelman, Ewa [2 ]
Balaprakash, Prasanna [5 ]
机构
[1] Argonne Natl Lab, Lemont, IL USA
[2] Univ Southern Calif, Los Angeles, CA USA
[3] Renaissance Comp Inst RENCI, Chapel Hill, NC USA
[4] Energy Sci Network ESnet, Berkeley, CA USA
[5] Oak Ridge Natl Lab, Oak Ridge, TN USA
[6] Argonne Natl Lab, Math & Comp Sci Div, 9700 S Cass Ave, Lemont, IL 60439 USA
关键词
Anomaly detection; machine learning; graph neural networks; scientific workflows; hyperparameter tuning; explainable predictions;
D O I
10.1177/10943420231172140
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Identifying and addressing anomalies in complex, distributed systems can be challenging for reliable execution of scientific workflows. We model these workflows as directed acyclic graphs (DAGs), where the nodes and edges of the DAGs represent jobs and their dependencies, respectively. We develop graph neural networks (GNNs) to learn patterns in the DAGs and to detect anomalies at the node (job) and graph (workflow) levels. We investigate workflow-specific GNN models that are trained on a particular workflow and workflow-agnostic GNN models that are trained across the workflows. Our GNN models, which incorporate both individual job features and topological information from the workflow, show improved accuracy and efficiency compared to conventional learning methods for detecting anomalies. While joint trained with multiple scientific workflows, our GNN models reached an accuracy more than 80% for workflow level and 75% for job level anomalies. In addition, we illustrate the importance of hyperparameter tuning method in our study that can significantly improve the metric(s) measure of evaluating the GNN models. Finally, we integrate explainable GNN methods to provide insights on job features in the workflow that cause an anomaly.
引用
收藏
页码:394 / 411
页数:18
相关论文
共 50 条
  • [21] Detecting and locating configuration errors in IP VPNs with Graph Neural Networks
    Mohammedi, El-Heithem
    Lavinal, Emmanuel
    Fleury, Guillaume
    PROCEEDINGS OF THE IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM 2022, 2022,
  • [22] NESTEDGNN: Detecting Malicious Network Activity with Nested Graph Neural Networks
    Ji, Yuede
    Huang, H. Howie
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 2694 - 2699
  • [23] Detecting anomalies in cargo using graph properties
    Eberle, William
    Holder, Lawrence
    INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS, 2006, 3975 : 728 - 730
  • [24] Detecting Non-Transient Anomalies in Visual Information Using Neural Networks
    Kounavis, Michael E.
    Morrissette, Joel
    Srinivasan, Sadagopan
    Yavatkar, Raj
    2011 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2011,
  • [25] LSTM Neural Networks for Detecting Anomalies Caused by Web Application Cyber Attacks
    Kotenko, Igor
    Lauta, Oleg
    Kribel, Kseniya
    Saenko, Igor
    NEW TRENDS IN INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES, 2021, 337 : 127 - 140
  • [26] MODELING HIERARCHICAL TOPOLOGICAL STRUCTURE IN SCIENTIFIC IMAGES WITH GRAPH NEURAL NETWORKS
    Leventhal, Samuel
    Gyulassy, Attila
    Pascucci, Valerio
    Heimann, Mark
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2995 - 2999
  • [27] NED-GNN: Detecting and Dropping Noisy Edges in Graph Neural Networks
    Xu, Ming
    Zhang, Baoming
    Yuan, Jinliang
    Cao, Meng
    Wang, Chongjun
    WEB AND BIG DATA, PT I, APWEB-WAIM 2022, 2023, 13421 : 91 - 105
  • [28] A framework for detecting anomalies in VoIP networks
    Bouzida, Yacine
    Mangin, Christophe
    ARES 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON AVAILABILITY, SECURITY AND RELIABILITY, 2008, : 204 - +
  • [29] DETECTING ANOMALIES IN PROCESS CONTROL NETWORKS
    Rrushi, Julian
    Kang, Kyoung-Don
    CRITICAL INFRASTRUCTURE PROTECTION III, 2009, 311 : 151 - 165
  • [30] Detecting Network Anomalies in Backbone Networks
    Callegari, Christian
    Gazzarrini, Loris
    Giordano, Stefano
    Pagano, Michele
    Pepe, Teresa
    RECENT ADVANCES IN INTRUSION DETECTION, 2010, 6307 : 490 - 491