InvarNet-X: A Black-Box Invariant-Based Approach to Diagnosing Big Data Systems

被引:4
|
作者
Chen, Pengfei [1 ]
Qi, Yong [1 ]
Hou, Di [1 ]
机构
[1] Xi An Jiao Tong Univ, Dept Comp Sci & Technol, Xian 710049, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Big data; Hadoop; invariant; maximal information criterion; performance diagnosis; LIKELY INVARIANTS;
D O I
10.1109/TETC.2015.2497143
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As big data spreads rapidly, performance problems in these systems become common concerns. As the first line of defending these problems, performance diagnosis plays an essential role in big data systems. It is notoriously difficult to conduct performance diagnosis in large distributed systems. Previous work either pinpoint the root causes by instrumenting the applications or runtime systems in a white box way, which leads to a considerable overhead, or just provide some hints to the hidden root causes in a black-box way. Very few works focus on pinpointing the real root causes in a black-box way. To address this problem, this paper proposes a black-box invariant-based diagnosing approach and implements a proof-of concept system named InvarNet-X. In this paper, performance diagnosis is formalized as a pattern recognition problem, meaning that each performance problem is identified by a specific pattern. The rationale of InvarNet-X is that the unobservable root causes of performance problems always expose themselves through the violations of the associations among directly observable performance metrics. Such observable associations are called likely invariants calculated by the maximal information criterion, and each performance problem is signified by a sparse distributed representation. A problem signature database is constructed by training multiple real performance problems in advance. Once a performance anomaly is detected, the diagnosing procedure is triggered. The root cause is pinpointed by retrieving similar signatures in the signature database. The experimental evaluations in a controlled big data system show that InvarNet-X can achieve a high accuracy in diagnosing some real performance problems reported in software bug repositories, which is superior to several state-of-the-art approaches. Moreover, the light-weight property makes InvarNet-X easily facilitated in large-scale big data systems in real time.
引用
收藏
页码:450 / 465
页数:16
相关论文
共 50 条
  • [21] Identification of nonlinear black-box systems based on Universal Learning Networks
    Hu, JL
    Hirasawa, K
    Murata, J
    Ohbayashi, M
    Kumamaru, K
    IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE, 1998, : 2465 - 2470
  • [22] An Black-Box Testing Approach on User Modeling in Practical Movie Recommendation Systems
    Xuan Hau Pham
    Tu Ngoc Luong
    Jung, Jason J.
    COMPUTATIONAL COLLECTIVE INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, 2013, 8083 : 72 - 79
  • [23] PoisonRec: An Adaptive Data Poisoning Framework for Attacking Black-box Recommender Systems
    Song, Junshuai
    Li, Zhao
    Hu, Zehong
    Wu, Yucheng
    Li, Zhenpeng
    Li, Jian
    Gao, Jun
    2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 157 - 168
  • [24] Improved approach to KDI-based fault detection for non-linear black-box systems
    Kumamaru, K
    Inoue, K
    Iwamura, T
    SICE 2004 ANNUAL CONFERENCE, VOLS 1-3, 2004, : 927 - 932
  • [25] Data-Driven Approximate Abstraction for Black-Box Piecewise Affine Systems
    Chen, Gang
    Kong, Zhaodan
    2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 786 - 791
  • [26] DDImage: an image reduction based approach for automatically explaining black-box classifiers
    Jiang, Mingyue
    Tang, Chengjian
    Zhang, Xiao-Yi
    Zhao, Yangyang
    Ding, Zuohua
    EMPIRICAL SOFTWARE ENGINEERING, 2024, 29 (05)
  • [27] A kriging-based approach to MINLP containing black-box models and noise
    Davis, Eddie
    Ierapetritou, Marianthi
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2008, 47 (16) : 6101 - 6125
  • [28] foREST: A Tree-based Black-box Fuzzing Approach for RESTful APIs
    Lin, Jiaxian
    Li, Tianyu
    Chen, Yang
    Wei, Guangsheng
    Lin, Jiadong
    Zhang, Sen
    Xu, Hui
    2023 IEEE 34TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING, ISSRE, 2023, : 695 - 705
  • [29] Defending mutation-based adversarial text perturbation: a black-box approach
    Demetrio Deanda
    Izzat Alsmadi
    Jesus Guerrero
    Gongbo Liang
    Cluster Computing, 2025, 28 (3)
  • [30] An approach to testing black-box components using contract-based mutation
    Jiang, Ying
    Hou, Shan-Shan
    Shan, Jin-Hui
    Zhang, Lu
    Xie, Bing
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2008, 18 (01) : 93 - 117