Signal Processing Based Method for Real-Time Anomaly Detection in High-Performance Computing

被引:1
|
作者
Dey, ArwIavo [1 ]
Islam, Tanzima [1 ]
Phelps, Chase [1 ]
Kelly, Christopher [2 ]
机构
[1] Texas State Univ, Dept Comp Sci, San Marcos, TX 78666 USA
[2] Brookhaven Natl Lab, Comp Sci Initiat, Long Isl City, NY USA
关键词
Real-time anomaly detection in HPC; Signal based anomaly detection; Fast Fourier Transform; CHIMBUKO;
D O I
10.1109/COMPSAC57700.2023.00037
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Performance anomalies can manifest as irregular execution times or abnormal execution events for many reasons, including network congestion and resource contention. Detecting such anomalies in real-time by analyzing the details of performance traces at scale is impractical due to the sheer volume of data High-Performance Computing (HPC) applications produce. In this paper, we propose formulating HPC performance anomaly detection as a signal-processing problem where anomalies can be treated as noise. We evaluate our proposed method in comparison with two other commonly used anomaly detection techniques of varying complexity based on their detection accuracy and scalability. Since real-time in-situ anomaly detection at a large scale requires lightweight methods that can handle a large volume of streaming data, we find that our proposed method provides the best trade-off. We then implement the proposed method in CHIMBUKO, the first online, distributed, and scalable workflow-level performance trace analysis framework. We compare our proposed signal-based anomaly detection algorithm with two other methods using a function of their accuracy, F1 score, and detection overhead. Our experiments demonstrate that our proposed approach achieves a 99% improvement for the benchmark datasets and a 93% improvement with CHIMBUKO traces.
引用
收藏
页码:233 / 240
页数:8
相关论文
共 50 条
  • [1] A High-Performance Scalable computing system for real-time signal processing applications
    Zhang, Xiongkui
    Liu, Guoman
    Gao, Meiguo
    CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 2, PROCEEDINGS, 2008, : 556 - 560
  • [2] REAL-TIME PROCESSING - A GROWING DOMAIN OF HIGH-PERFORMANCE COMPUTING
    MALINOWSKI, CW
    ELECTRONIC ENGINEERING, 1989, 61 (748): : 55 - &
  • [3] Design of a Flexible High-performance Real-time SAR Signal Processing System
    Jin, Ting
    Wang, Hongxian
    Liu, Hongwei
    PROCEEDINGS OF 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2016), 2016, : 513 - 517
  • [4] High-performance scalable computing for real-time applications
    Boggess, T
    Shirley, F
    SIXTH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS, PROCEEDINGS, 1997, : 332 - 335
  • [5] High-performance computing in real-time ultrasonic imaging
    Nocetti, DFG
    González, JS
    Casique, MFV
    Ramirez, RO
    Hernández, EM
    ACOUSTICAL IMAGING, VOL 24, 2000, 24 : 113 - 120
  • [6] High-performance computing for real-time spectral estimation
    Madeira, MM
    Bellis, SJ
    Beltran, LAA
    González, JS
    Nocetti, DFG
    Marnane, WP
    Tokhi, MO
    Ruano, MG
    CONTROL ENGINEERING PRACTICE, 1999, 7 (05) : 679 - 686
  • [7] Real-Time Causal Processing of Anomaly Detection
    Wang, Yulei
    Chen, Shih-Yu
    Wu, Chao-Cheng
    Liu, Chunghong
    Chang, Chein-, I
    HIGH-PERFORMANCE COMPUTING IN REMOTE SENSING II, 2012, 8539
  • [8] Research on high-performance, real-time periodic signal detection method based on field-programmable gate arrays (FPGAs)
    Wang, Xuan
    Shen, Zhongtao
    Shui, Yanbin
    Liu, Shubin
    REVIEW OF SCIENTIFIC INSTRUMENTS, 2025, 96 (01):
  • [9] Timing Predictability in High-Performance Computing With Probabilistic Real-Time
    Reghenzani, Federico
    Massari, Giuseppe
    Fornaciari, William
    IEEE ACCESS, 2020, 8 (08): : 208566 - 208582
  • [10] High-performance computing nodes for real-time parallel applications
    Carden, TC
    Dobinson, RW
    Fisher, S
    Maley, PD
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 1997, 394 (1-2): : 211 - 218