Assessing Data Usefulness for Failure Analysis in Anonymized System Logs

被引:4
|
作者
Ghiasvand, Siavash [1 ]
Ciorba, Florina M. [2 ]
机构
[1] Tech Univ Dresden, Dresden, Germany
[2] Univ Basel, Basel, Switzerland
关键词
D O I
10.1109/ISPDC2018.2018.00031
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
System logs are a valuable source of information for the analysis and understanding of systems behavior for the purpose of improving their performance. Such logs contain various types of information, including sensitive information. Information deemed sensitive can either directly be extracted from system log entries by correlation of several log entries, or can be inferred from the combination of the (non-sensitive) information contained within system logs with other logs and/or additional datasets. The analysis of system logs containing sensitive information compromises data privacy. Therefore, various anonymization techniques, such as generalization and suppression have been employed, over the years, by data and computing centers to protect the privacy of their users, their data, and the system as a whole. Privacy-preserving data resulting from anonymization via generalization and suppression may lead to significantly decreased data usefulness, thus, hindering the intended analysis for understanding the system behavior. Maintaining a balance between data usefulness and privacy preservation, therefore, remains an open and important challenge. Irreversible encoding of system logs using collision-resistant hashing algorithms, such as SHAKE-128, is a novel approach previously introduced by the authors to mitigate data privacy concerns. The present work describes a study of the applicability of the encoding approach from earlier work on the system logs of a production high performance computing system. Moreover, a metric is introduced to assess the data usefulness of the anonymized system logs to detect and identify the failures encountered in the system.
引用
收藏
页码:164 / 171
页数:8
相关论文
共 50 条
  • [11] Screen failure data in clinical trials: Are screening logs worth it?
    Elm, Jordan J.
    Palesch, Yuko
    Easton, J. Donald
    Lindblad, Anne
    Barsan, William
    Silbergleit, Robert
    Conwit, Robin
    Dillon, Catherine
    Farrant, Mary
    Battenhouse, Holly
    Perlmutter, Aaron
    Johnston, S. Claiborne
    CLINICAL TRIALS, 2014, 11 (04) : 467 - 472
  • [12] Utility Analysis of Differentially Private Anonymized Data Based on Random Sampling
    Sugiyama, Takumi
    Oosugi, Hiroto
    Yamanaka, Io
    Minami, Kazuhiro
    PRIVACY IN STATISTICAL DATABASES, PSD 2024, 2024, 14915 : 35 - 47
  • [13] ASSESSING THE RISK FOR SYSTEM FAILURE
    ANDREWS, DC
    JOURNAL OF SYSTEMS MANAGEMENT, 1982, 33 (12): : 30 - 36
  • [14] Runtime Prediction of Failure Modes from System Error Logs
    Shalan, Atef
    Zulkernine, Mohammad
    2013 18TH INTERNATIONAL CONFERENCE ON ENGINEERING OF COMPLEX COMPUTER SYSTEMS (ICECCS), 2013, : 232 - 241
  • [15] Usefulness of the Edmonton Frail Scale in Assessing the Impact of Heart Failure on Frailty
    Iovanovici, Diana-Carina
    Negru, Alina-Gariela
    Cseppento, Carmen Delia Nistor
    Bochis, Calin Florin
    Dogaru, Bombonica Gabriela
    Jurcau, Anamaria
    Emilia, Babes Elena
    Pascalau, Nicoleta Anamaria
    Ghitea, Timea Claudia
    Bungau, Alexa Florina
    Bungau, Simona Gabriela
    IN VIVO, 2023, 37 (04): : 1857 - 1866
  • [16] Using Message Logs and Resource Use Data for Cluster Failure Diagnosis
    Chuah, Edward
    Jhumka, Arshad
    Browne, James C.
    Gurumdimma, Nentawe
    Narasimhamurthy, Sai
    Barth, Bill
    PROCEEDINGS OF 2016 IEEE 23RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2016, : 232 - 241
  • [17] Publishing Anonymized Set-Valued Data via Disassociation towards Analysis
    Awad, Nancy
    Couchot, Jean-Francois
    Al Bouna, Bechara
    Philippe, Laurent
    FUTURE INTERNET, 2020, 12 (04):
  • [19] The Usefulness of Assessing Glaucoma Progression With Postprocessed Visual Field Data
    Abu, Sampson L.
    Poleon, Shervonne
    Racette, Lyne
    TRANSLATIONAL VISION SCIENCE & TECHNOLOGY, 2022, 11 (05):
  • [20] Safety Analysis of High-Dimensional Anonymized Data from Multiple Perspectives
    Nramazoe, Takaya
    Omote, Kazumasa
    NETWORK AND SYSTEM SECURITY, NSS 2020, 2020, 12570 : 94 - 111