InvarNet-X: A Black-Box Invariant-Based Approach to Diagnosing Big Data Systems

被引:4
|
作者
Chen, Pengfei [1 ]
Qi, Yong [1 ]
Hou, Di [1 ]
机构
[1] Xi An Jiao Tong Univ, Dept Comp Sci & Technol, Xian 710049, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Big data; Hadoop; invariant; maximal information criterion; performance diagnosis; LIKELY INVARIANTS;
D O I
10.1109/TETC.2015.2497143
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As big data spreads rapidly, performance problems in these systems become common concerns. As the first line of defending these problems, performance diagnosis plays an essential role in big data systems. It is notoriously difficult to conduct performance diagnosis in large distributed systems. Previous work either pinpoint the root causes by instrumenting the applications or runtime systems in a white box way, which leads to a considerable overhead, or just provide some hints to the hidden root causes in a black-box way. Very few works focus on pinpointing the real root causes in a black-box way. To address this problem, this paper proposes a black-box invariant-based diagnosing approach and implements a proof-of concept system named InvarNet-X. In this paper, performance diagnosis is formalized as a pattern recognition problem, meaning that each performance problem is identified by a specific pattern. The rationale of InvarNet-X is that the unobservable root causes of performance problems always expose themselves through the violations of the associations among directly observable performance metrics. Such observable associations are called likely invariants calculated by the maximal information criterion, and each performance problem is signified by a sparse distributed representation. A problem signature database is constructed by training multiple real performance problems in advance. Once a performance anomaly is detected, the diagnosing procedure is triggered. The root cause is pinpointed by retrieving similar signatures in the signature database. The experimental evaluations in a controlled big data system show that InvarNet-X can achieve a high accuracy in diagnosing some real performance problems reported in software bug repositories, which is superior to several state-of-the-art approaches. Moreover, the light-weight property makes InvarNet-X easily facilitated in large-scale big data systems in real time.
引用
收藏
页码:450 / 465
页数:16
相关论文
共 50 条
  • [1] Big data and black-box medical algorithms
    Price, W. Nicholson
    SCIENCE TRANSLATIONAL MEDICINE, 2018, 10 (471)
  • [2] Data-driven invariant subspace identification for black-box switched linear systems
    Berger, Guillaume O.
    Jungers, Raphael M.
    Wang, Zheming
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 32 - 37
  • [3] An invariant-based approach to the design of hybrid control systems
    Stiver, JA
    Koutsoukos, XD
    Antsaklis, PJ
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2001, 11 (05) : 453 - 478
  • [4] A data-driven method for computing polyhedral invariant sets of black-box switched linear systems
    Wang, Zheming
    Jungers, Raphael M.
    2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 3416 - 3421
  • [5] A Data-Driven Method for Computing Polyhedral Invariant Sets of Black-Box Switched Linear Systems
    Wang, Zheming
    Jungers, Raphael M.
    IEEE CONTROL SYSTEMS LETTERS, 2021, 5 (05): : 1843 - 1848
  • [6] On the physical interpretation of statistical data from black-box systems
    Eliazar, Iddo I.
    Cohen, Morrel H.
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2013, 392 (13) : 2924 - 2939
  • [8] System identification of power converters based on a black-box approach
    Choi, JY
    Cho, BH
    VanLandingham, HF
    Mok, H
    Song, JH
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-FUNDAMENTAL THEORY AND APPLICATIONS, 1998, 45 (11): : 1148 - 1158
  • [9] A fuzzy-based black-box approach to IGBT modelling
    Monti, A
    ICECS 96 - PROCEEDINGS OF THE THIRD IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS, AND SYSTEMS, VOLS 1 AND 2, 1996, : 1147 - 1150
  • [10] GranMicro: A Black-Box Based Approach for Optimizing Microservices Based Applications
    Mustafa, Ola
    Gomez, Jorge Marx
    Hamed, Mohamad
    Pargmann, Hergen
    FROM SCIENCE TO SOCIETY: NEW TRENDS IN ENVIRONMENTAL INFORMATICS, 2018, : 283 - 294