Root-Cause Diagnosis Using Logs Generated by User Actions

被引:0
|
作者
Ikeuchi, Hiroki [1 ]
Watanabe, Akio [1 ]
Kawata, Takehiro [1 ]
Kawahara, Ryoichi [2 ]
机构
[1] NTT Corp, NTT Network Technol Labs, Tokyo 1808585, Japan
[2] Toyo Univ, Fac Informat Networking Innovat & Design, Tokyo 1150053, Japan
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Identifying the root cause of failures in a complicated communication system such as a cloud platform is time-consuming for system operators. Although most current diagnosis methods depend on logs that are observed passively, some failures generate quite similar logs and cannot be distinguished from one another with these methods. To overcome this difficulty, we propose a framework in which operators execute user actions and use logs generated by the actions in root-cause analysis. We focus on the fact that even if we do not see any differences between failures in logs observed passively, logs generated by a particular action may change depending on the failure. We also propose two methods for executing such effective actions in a proper order and obtaining informative logs efficiently. With these methods, we can identify the root cause of failures that are indistinguishable with current methods. We experimentally evaluated the effectiveness of our framework in a cloud system constructed with OpenStack.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Root-Cause Diagnosis for Rare Failures using Bayesian Network with Dynamic Modification
    Matsuo, Yoichi
    Nakano, Yuusuke
    Watanabe, Akio
    Watanabe, Keishiro
    Ishibashi, Keisuke
    Kawahara, Ryoichi
    2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2018,
  • [2] Using and justifying systematic root-cause analysis
    Paradies, M
    Marquardt, A
    HYDROCARBON PROCESSING, 2000, 79 (03): : 117 - +
  • [3] Beyond root-cause analysis
    Bergman, BLS
    Fundin, AP
    Gremyr, IC
    Johansson, PM
    ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM, 2002 PROCEEDINGS, 2002, : 140 - 146
  • [4] Autonomous root-cause fault diagnosis using symbolic dynamic based causality analysis
    Rashidi, Bahador
    Zhao, Qing
    NEUROCOMPUTING, 2020, 401 : 10 - 27
  • [5] ROOT-CAUSE ANALYSIS AND CLEANROOM
    ADAMS, T
    IEEE SOFTWARE, 1994, 11 (04) : 4 - 4
  • [6] Root-Cause Localization using Restricted Boltzmann Machines
    Steinhauer, H. Joe
    Karlsson, Alexander
    Mathiason, Gunnar
    Helldin, Tove
    2016 19TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2016, : 248 - 255
  • [7] Identifying Root-Cause Metrics for Incident Diagnosis in Online Service Systems
    Wu, Canhua
    Zhao, Nengwen
    Wang, Lixin
    Yang, Xiaoqin
    Li, Shining
    Zhang, Ming
    Jin, Xing
    Wen, Xidao
    Nie, Xiaohui
    Zhang, Wenchi
    Sui, Kaixin
    Pei, Dan
    2021 IEEE 32ND INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE 2021), 2021, : 91 - 102
  • [8] Functional directed graphical models and applications in root-cause analysis and diagnosis
    Gomez, Ana Maria Estrada
    Paynabar, Kamran
    Pacella, Massimo
    JOURNAL OF QUALITY TECHNOLOGY, 2020, 53 (04) : 421 - 437
  • [9] Succeeding With OOS and Root-Cause Investigations
    Stumpff, James P.
    BIOPHARM INTERNATIONAL, 2020, 33 (07) : 36 - +
  • [10] Engineering practice: Using root-cause analysis to prevent failures
    2000, McGraw-Hill Inc, New York, NY, USA (107):