Graph neural networks for detecting anomalies in scientific workflows

被引：2

作者：

Jin, Hongwei ^{[1
,6
]}

Raghavan, Krishnan ^{[1
]}

Papadimitriou, George ^{[2
]}

Wang, Cong ^{[3
]}

Mandal, Anirban ^{[3
]}

Kiran, Mariam ^{[4
]}

Deelman, Ewa ^{[2
]}

Balaprakash, Prasanna ^{[5
]}

机构：

[1] Argonne Natl Lab, Lemont, IL USA

[2] Univ Southern Calif, Los Angeles, CA USA

[3] Renaissance Comp Inst RENCI, Chapel Hill, NC USA

[4] Energy Sci Network ESnet, Berkeley, CA USA

[5] Oak Ridge Natl Lab, Oak Ridge, TN USA

[6] Argonne Natl Lab, Math & Comp Sci Div, 9700 S Cass Ave, Lemont, IL 60439 USA

来源：

INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS | 2023年 / 37卷 / 3-4期

关键词：

Anomaly detection; machine learning; graph neural networks; scientific workflows; hyperparameter tuning; explainable predictions;

D O I：

10.1177/10943420231172140

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Identifying and addressing anomalies in complex, distributed systems can be challenging for reliable execution of scientific workflows. We model these workflows as directed acyclic graphs (DAGs), where the nodes and edges of the DAGs represent jobs and their dependencies, respectively. We develop graph neural networks (GNNs) to learn patterns in the DAGs and to detect anomalies at the node (job) and graph (workflow) levels. We investigate workflow-specific GNN models that are trained on a particular workflow and workflow-agnostic GNN models that are trained across the workflows. Our GNN models, which incorporate both individual job features and topological information from the workflow, show improved accuracy and efficiency compared to conventional learning methods for detecting anomalies. While joint trained with multiple scientific workflows, our GNN models reached an accuracy more than 80% for workflow level and 75% for job level anomalies. In addition, we illustrate the importance of hyperparameter tuning method in our study that can significantly improve the metric(s) measure of evaluating the GNN models. Finally, we integrate explainable GNN methods to provide insights on job features in the workflow that cause an anomaly.

引用

页码：394 / 411

页数：18

共 50 条

[21] Detecting and locating configuration errors in IP VPNs with Graph Neural Networks
Mohammedi, El-Heithem
Lavinal, Emmanuel
Fleury, Guillaume
PROCEEDINGS OF THE IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM 2022, 2022,
[22] NESTEDGNN: Detecting Malicious Network Activity with Nested Graph Neural Networks
Ji, Yuede
Huang, H. Howie
IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 2694 - 2699
[23] Detecting anomalies in cargo using graph properties
Eberle, William
Holder, Lawrence
INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS, 2006, 3975 : 728 - 730
[24] Detecting Non-Transient Anomalies in Visual Information Using Neural Networks
Kounavis, Michael E.
Morrissette, Joel
Srinivasan, Sadagopan
Yavatkar, Raj
2011 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2011,
[25] LSTM Neural Networks for Detecting Anomalies Caused by Web Application Cyber Attacks
Kotenko, Igor
Lauta, Oleg
Kribel, Kseniya
Saenko, Igor
NEW TRENDS IN INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES, 2021, 337 : 127 - 140
[26] MODELING HIERARCHICAL TOPOLOGICAL STRUCTURE IN SCIENTIFIC IMAGES WITH GRAPH NEURAL NETWORKS
Leventhal, Samuel
Gyulassy, Attila
Pascucci, Valerio
Heimann, Mark
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2995 - 2999
[27] NED-GNN: Detecting and Dropping Noisy Edges in Graph Neural Networks
Xu, Ming
Zhang, Baoming
Yuan, Jinliang
Cao, Meng
Wang, Chongjun
WEB AND BIG DATA, PT I, APWEB-WAIM 2022, 2023, 13421 : 91 - 105
[28] A framework for detecting anomalies in VoIP networks
Bouzida, Yacine
Mangin, Christophe
ARES 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON AVAILABILITY, SECURITY AND RELIABILITY, 2008, : 204 - +
[29] DETECTING ANOMALIES IN PROCESS CONTROL NETWORKS
Rrushi, Julian
Kang, Kyoung-Don
CRITICAL INFRASTRUCTURE PROTECTION III, 2009, 311 : 151 - 165
[30] Detecting Network Anomalies in Backbone Networks
Callegari, Christian
Gazzarrini, Loris
Giordano, Stefano
Pagano, Michele
Pepe, Teresa
RECENT ADVANCES IN INTRUSION DETECTION, 2010, 6307 : 490 - 491

← 1 2 3 4 5 →