Graph neural networks for detecting anomalies in scientific workflows

被引：2

作者：

Jin, Hongwei ^{[1
,6
]}

Raghavan, Krishnan ^{[1
]}

Papadimitriou, George ^{[2
]}

Wang, Cong ^{[3
]}

Mandal, Anirban ^{[3
]}

Kiran, Mariam ^{[4
]}

Deelman, Ewa ^{[2
]}

Balaprakash, Prasanna ^{[5
]}

机构：

[1] Argonne Natl Lab, Lemont, IL USA

[2] Univ Southern Calif, Los Angeles, CA USA

[3] Renaissance Comp Inst RENCI, Chapel Hill, NC USA

[4] Energy Sci Network ESnet, Berkeley, CA USA

[5] Oak Ridge Natl Lab, Oak Ridge, TN USA

[6] Argonne Natl Lab, Math & Comp Sci Div, 9700 S Cass Ave, Lemont, IL 60439 USA

来源：

INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS | 2023年 / 37卷 / 3-4期

关键词：

Anomaly detection; machine learning; graph neural networks; scientific workflows; hyperparameter tuning; explainable predictions;

D O I：

10.1177/10943420231172140

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Identifying and addressing anomalies in complex, distributed systems can be challenging for reliable execution of scientific workflows. We model these workflows as directed acyclic graphs (DAGs), where the nodes and edges of the DAGs represent jobs and their dependencies, respectively. We develop graph neural networks (GNNs) to learn patterns in the DAGs and to detect anomalies at the node (job) and graph (workflow) levels. We investigate workflow-specific GNN models that are trained on a particular workflow and workflow-agnostic GNN models that are trained across the workflows. Our GNN models, which incorporate both individual job features and topological information from the workflow, show improved accuracy and efficiency compared to conventional learning methods for detecting anomalies. While joint trained with multiple scientific workflows, our GNN models reached an accuracy more than 80% for workflow level and 75% for job level anomalies. In addition, we illustrate the importance of hyperparameter tuning method in our study that can significantly improve the metric(s) measure of evaluating the GNN models. Finally, we integrate explainable GNN methods to provide insights on job features in the workflow that cause an anomaly.

引用

页码：394 / 411

页数：18

共 50 条

[41] Graph Clustering with Graph Neural Networks
Tsitsulin, Anton
Palowitch, John
Perozzi, Bryan
Mueller, Emmanuel
JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
[42] Graphs, Convolutions, and Neural Networks: From Graph Filters to Graph Neural Networks
Gama, Fernando
Isufi, Elvin
Leus, Geert
Ribeiro, Alejandro
IEEE SIGNAL PROCESSING MAGAZINE, 2020, 37 (06) : 128 - 138
[43] LONGAN: Detecting Lateral Movement based on Heterogeneous Graph Neural Networks with Temporal Features
Zong, Yangyang
Shi, Zhixin
Huang, Weiqing
2024 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, ISCC 2024, 2024,
[44] Graph neural networks and cross-protocol analysis for detecting malicious IP addresses
Huang, Yonghong
Negrete, Joanna
Wagener, John
Fralick, Celeste
Rodriguez, Armando
Peterson, Eric
Wosotowsky, Adam
COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (04) : 3857 - 3869
[45] On the Use of Heterogeneous Graph Neural Networks for Detecting Malicious Activities: a Case Study with Cryptocurrencies
Ferretti, Stefano
D'Angelo, Gabriele
Ghini, Vittorio
PROCEEDINGS OF THE 2024 WORKSHOP ON OPEN CHALLENGES IN ONLINE SOCIAL NETWORKS, OASIS 2024, 2024, : 33 - 40
[46] Graph neural networks and cross-protocol analysis for detecting malicious IP addresses
Yonghong Huang
Joanna Negrete
John Wagener
Celeste Fralick
Armando Rodriguez
Eric Peterson
Adam Wosotowsky
Complex & Intelligent Systems, 2023, 9 : 3857 - 3869
[47] Neural Pooling for Graph Neural Networks
Harsha, Sai Sree
Mishra, Deepak
PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2021, 2024, 13102 : 171 - 180
[48] Graphon Neural Networks and the Transferability of Graph Neural Networks
Ruiz, Luana
Chamon, Luiz F. O.
Ribeiro, Alejandro
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[49] Scientific workflows
Anna-Lena Lamprecht
Kenneth J. Turner
International Journal on Software Tools for Technology Transfer, 2016, 18 : 575 - 580
[50] Deep Auscultation with Demographic Data: Detecting Respiratory Anomalies Using Convolutional Neural Networks and Autoencoders
Xu, Mohan
Wiese, Lena
PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2024, VOL 3, 2024, 1013 : 275 - 289

← 1 2 3 4 5 →