Graph neural networks for detecting anomalies in scientific workflows

被引:2
|
作者
Jin, Hongwei [1 ,6 ]
Raghavan, Krishnan [1 ]
Papadimitriou, George [2 ]
Wang, Cong [3 ]
Mandal, Anirban [3 ]
Kiran, Mariam [4 ]
Deelman, Ewa [2 ]
Balaprakash, Prasanna [5 ]
机构
[1] Argonne Natl Lab, Lemont, IL USA
[2] Univ Southern Calif, Los Angeles, CA USA
[3] Renaissance Comp Inst RENCI, Chapel Hill, NC USA
[4] Energy Sci Network ESnet, Berkeley, CA USA
[5] Oak Ridge Natl Lab, Oak Ridge, TN USA
[6] Argonne Natl Lab, Math & Comp Sci Div, 9700 S Cass Ave, Lemont, IL 60439 USA
关键词
Anomaly detection; machine learning; graph neural networks; scientific workflows; hyperparameter tuning; explainable predictions;
D O I
10.1177/10943420231172140
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Identifying and addressing anomalies in complex, distributed systems can be challenging for reliable execution of scientific workflows. We model these workflows as directed acyclic graphs (DAGs), where the nodes and edges of the DAGs represent jobs and their dependencies, respectively. We develop graph neural networks (GNNs) to learn patterns in the DAGs and to detect anomalies at the node (job) and graph (workflow) levels. We investigate workflow-specific GNN models that are trained on a particular workflow and workflow-agnostic GNN models that are trained across the workflows. Our GNN models, which incorporate both individual job features and topological information from the workflow, show improved accuracy and efficiency compared to conventional learning methods for detecting anomalies. While joint trained with multiple scientific workflows, our GNN models reached an accuracy more than 80% for workflow level and 75% for job level anomalies. In addition, we illustrate the importance of hyperparameter tuning method in our study that can significantly improve the metric(s) measure of evaluating the GNN models. Finally, we integrate explainable GNN methods to provide insights on job features in the workflow that cause an anomaly.
引用
收藏
页码:394 / 411
页数:18
相关论文
共 50 条
  • [41] Graph Clustering with Graph Neural Networks
    Tsitsulin, Anton
    Palowitch, John
    Perozzi, Bryan
    Mueller, Emmanuel
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [42] Graphs, Convolutions, and Neural Networks: From Graph Filters to Graph Neural Networks
    Gama, Fernando
    Isufi, Elvin
    Leus, Geert
    Ribeiro, Alejandro
    IEEE SIGNAL PROCESSING MAGAZINE, 2020, 37 (06) : 128 - 138
  • [43] LONGAN: Detecting Lateral Movement based on Heterogeneous Graph Neural Networks with Temporal Features
    Zong, Yangyang
    Shi, Zhixin
    Huang, Weiqing
    2024 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, ISCC 2024, 2024,
  • [44] Graph neural networks and cross-protocol analysis for detecting malicious IP addresses
    Huang, Yonghong
    Negrete, Joanna
    Wagener, John
    Fralick, Celeste
    Rodriguez, Armando
    Peterson, Eric
    Wosotowsky, Adam
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (04) : 3857 - 3869
  • [45] On the Use of Heterogeneous Graph Neural Networks for Detecting Malicious Activities: a Case Study with Cryptocurrencies
    Ferretti, Stefano
    D'Angelo, Gabriele
    Ghini, Vittorio
    PROCEEDINGS OF THE 2024 WORKSHOP ON OPEN CHALLENGES IN ONLINE SOCIAL NETWORKS, OASIS 2024, 2024, : 33 - 40
  • [46] Graph neural networks and cross-protocol analysis for detecting malicious IP addresses
    Yonghong Huang
    Joanna Negrete
    John Wagener
    Celeste Fralick
    Armando Rodriguez
    Eric Peterson
    Adam Wosotowsky
    Complex & Intelligent Systems, 2023, 9 : 3857 - 3869
  • [47] Neural Pooling for Graph Neural Networks
    Harsha, Sai Sree
    Mishra, Deepak
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2021, 2024, 13102 : 171 - 180
  • [48] Graphon Neural Networks and the Transferability of Graph Neural Networks
    Ruiz, Luana
    Chamon, Luiz F. O.
    Ribeiro, Alejandro
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [49] Scientific workflows
    Anna-Lena Lamprecht
    Kenneth J. Turner
    International Journal on Software Tools for Technology Transfer, 2016, 18 : 575 - 580
  • [50] Deep Auscultation with Demographic Data: Detecting Respiratory Anomalies Using Convolutional Neural Networks and Autoencoders
    Xu, Mohan
    Wiese, Lena
    PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2024, VOL 3, 2024, 1013 : 275 - 289