Hardware Performance Counters for System Reliability Monitoring

被引:0
|
作者
Leng, Elena Woo Lai [1 ]
Zwolinski, Mark [1 ]
Halak, Basel [1 ]
机构
[1] Univ Southampton, Dept Elect & Comp Sci, Southampton SO17 1BJ, Hants, England
关键词
VARIABILITY;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As technology scaling reaches nanometre scales, the error rate due to variations in temperature and voltage, single event effects and component degradation increases, making components less reliable. In order to ensure a system continues to function correctly while facing known reliability issues, it is imperative that the system should have the means to detect the occurrence of errors due to the presence of faults. A system that behaves normally (no error detected in the system) exhibits a profile, and any deviations from this profile indicate that there is an anomaly in the system. In this paper, we propose to use hardware performance counters (HPCs) to measure events that occur during the execution of the program. We explore the various counters available which could be use to identify the anomalous behaviour in the system and develop a methodology to observe the anomalies using HPCs by creating a fault-free pattern and observing any subsequent changes in that pattern. We evaluate the proposed technique using GemFI, an architectural simulator based on Gem5 with additional fault injection capabilities. We compare the results obtained at the end of the execution with data collected during a time interval. Our results show that HPCs can be used to identify anomalous behaviour in a system that would lead to failure.
引用
收藏
页码:76 / 81
页数:6
相关论文
共 50 条
  • [31] Metis: a profiling toolkit based on the virtualization of hardware performance counters
    Xie, Xia
    Jiang, Haiou
    Jin, Hai
    Cao, Wenzhi
    Yuan, Pingpeng
    Yang, Laurence Tianruo
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2012, 2
  • [32] Early Detection of Ransomware Activity based on Hardware Performance Counters
    Anand, P. Mohan
    Charan, P. V. Sai
    Shukla, Sandeep K.
    PROCEEDINGS OF 2023 AUSTRALIAN COMPUTER SCIENCE WEEK, ACSW 2023, 2023, : 10 - 17
  • [33] Exploiting hardware performance counters with flow and context sensitive profiling
    Ammons, G
    Ball, T
    Larus, JR
    ACM SIGPLAN NOTICES, 1997, 32 (05) : 85 - 96
  • [34] HARDWARE MONITORING OF REAL-TIME COMPUTER SYSTEM PERFORMANCE
    ARNDT, FR
    OLIVER, GM
    COMPUTER, 1972, 5 (04) : 25 - &
  • [35] Myths in power estimation with Performance Monitoring Counters
    Mair, Jason
    Eyers, David
    Huang, Zhiyi
    Zhang, Haibo
    SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2014, 4 (02): : 83 - 93
  • [36] Detecting Malicious Attacks Exploiting Hardware Vulnerabilities Using Performance Counters
    Li, Congmiao
    Gaudiot, Jean-Luc
    2019 IEEE 43RD ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 1, 2019, : 588 - 597
  • [37] Intelligent Malware Detection based on Hardware Performance Counters: A Comprehensive Survey
    Sayadi, Hossein
    He, Zhangying
    Makrani, Hosein Mohammadi
    Homayoun, Houman
    2024 25TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN, ISQED 2024, 2024,
  • [38] Online Capacity Identification of Multitier Websites Using Hardware Performance Counters
    Rao, Jia
    Xu, Cheng-Zhong
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2011, 22 (03) : 426 - 438
  • [39] Application Profiling Using Register-Instruction Hardware Performance Counters
    Menon, Anand
    Srivastava, Amisha
    Kundu, Shamik
    Basu, Kanad
    2023 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, ISVLSI, 2023, : 199 - 204
  • [40] A Theoretical Study of Hardware Performance Counters-Based Malware Detection
    Basu, Kanad
    Krishnamurthy, Prashanth
    Khorrami, Farshad
    Karri, Ramesh
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2020, 15 : 512 - 525