Assessing Data Usefulness for Failure Analysis in Anonymized System Logs

被引:4
|
作者
Ghiasvand, Siavash [1 ]
Ciorba, Florina M. [2 ]
机构
[1] Tech Univ Dresden, Dresden, Germany
[2] Univ Basel, Basel, Switzerland
关键词
D O I
10.1109/ISPDC2018.2018.00031
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
System logs are a valuable source of information for the analysis and understanding of systems behavior for the purpose of improving their performance. Such logs contain various types of information, including sensitive information. Information deemed sensitive can either directly be extracted from system log entries by correlation of several log entries, or can be inferred from the combination of the (non-sensitive) information contained within system logs with other logs and/or additional datasets. The analysis of system logs containing sensitive information compromises data privacy. Therefore, various anonymization techniques, such as generalization and suppression have been employed, over the years, by data and computing centers to protect the privacy of their users, their data, and the system as a whole. Privacy-preserving data resulting from anonymization via generalization and suppression may lead to significantly decreased data usefulness, thus, hindering the intended analysis for understanding the system behavior. Maintaining a balance between data usefulness and privacy preservation, therefore, remains an open and important challenge. Irreversible encoding of system logs using collision-resistant hashing algorithms, such as SHAKE-128, is a novel approach previously introduced by the authors to mitigate data privacy concerns. The present work describes a study of the applicability of the encoding approach from earlier work on the system logs of a production high performance computing system. Moreover, a metric is introduced to assess the data usefulness of the anonymized system logs to detect and identify the failures encountered in the system.
引用
收藏
页码:164 / 171
页数:8
相关论文
共 50 条
  • [31] Contextual Analysis of Program Logs for Understanding System Behaviors
    Fu, Qiang
    Lou, Jian-Guang
    Lin, Qingwei
    Ding, Rui
    Zhang, Dongmei
    Xie, Tao
    2013 10TH IEEE WORKING CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR), 2013, : 397 - 400
  • [32] MASS FLOW LOGS ANALYSIS SYSTEM BASED ON HADOOP
    Yang, Jie
    Zhang, Yanshen
    Zhang, Shuo
    He, Dazhong
    2013 5TH IEEE INTERNATIONAL CONFERENCE ON BROADBAND NETWORK & MULTIMEDIA TECHNOLOGY (IC-BNMT), 2013, : 115 - 118
  • [33] Automating Microservices Test Failure Analysis using Kubernetes Cluster Logs
    Sarika, Pawan Kumar
    Badampudi, Deepika
    Josyula, Sai Prashanth
    Usman, Muhammad
    27TH INTERNATIONAL CONFERENCE ON EVALUATION AND ASSESSMENT IN SOFTWARE ENGINEERING, EASE 2023, 2023, : 192 - 195
  • [34] Failure Analysis of Direct Liquid Cooling System in Data Centers
    Alkharabsheh, Sami
    Puvvadi, Udaya L. N.
    Ramakrishnan, Bharath
    Ghose, Kanad
    Sammakia, Bahgat
    JOURNAL OF ELECTRONIC PACKAGING, 2018, 140 (02)
  • [35] FAILURE ANALYSIS OF DIRECT LIQUID COOLING SYSTEM IN DATA CENTERS
    Alkharabsheh, Sami
    Ramakrishnan, Bharath
    Sammakia, Bahgat
    PROCEEDINGS OF THE ASME INTERNATIONAL TECHNICAL CONFERENCE AND EXHIBITION ON PACKAGING AND INTEGRATION OF ELECTRONIC AND PHOTONIC MICROSYSTEMS, 2017, 2017,
  • [36] A failure-mechanism-and-data integrated system degradation analysis
    Hao, Zhipeng
    Zeng, Shengkui
    Guo, Jianbin
    2015 61ST ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM (RAMS 2015), 2015,
  • [37] NetflowVis: A Temporal Visualization System for Netflow Logs Analysis
    He, Likun
    Tang, Binbin
    Zhu, Min
    Lu, Binbin
    Huang, Weidong
    COOPERATIVE DESIGN, VISUALIZATION, AND ENGINEERING, CDVE 2016, 2016, 9929 : 202 - 209
  • [38] Solving voting system by data envelopment analysis for assessing sustainability of suppliers
    Mohammad Izadikhah
    Reza Farzipoor Saen
    Group Decision and Negotiation, 2019, 28 : 641 - 669
  • [39] Assessing the root system of urban trees by geostatistical analysis of GPR data
    Lantini, Livia
    Trevisani, Sebastiano
    Tosti, Fabio
    Alani, Amir M.
    EARTH RESOURCES AND ENVIRONMENTAL REMOTE SENSING/GIS APPLICATIONS XIII, 2022, 12268
  • [40] Solving voting system by data envelopment analysis for assessing sustainability of suppliers
    Izadikhah, Mohammad
    Saen, Reza Farzipoor
    GROUP DECISION AND NEGOTIATION, 2019, 28 (03) : 641 - 669