Graph-Based Root Cause Localization in Microservice Systems with Protection Mechanisms

被引:0
|
作者
Tian, Wei [1 ]
Zhang, Haitao [1 ]
Yang, Neng [1 ]
Zhang, Yepeng [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing Key Lab Intelligent Telecommun Software &, Beijing 100876, Peoples R China
基金
中国国家自然科学基金;
关键词
Microservice; root cause localization; attributed graph; protection mechanism; random walking;
D O I
10.1142/S0218194023500304
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Service anomalies are difficult to locate accurately due to their propagation through service dependencies in microservice systems. Besides, the protection mechanisms are introduced into the microservice systems to ensure the stable operation of services. However, the existing approaches ignore the impact of protection mechanisms on the root cause localization of abnormal services. Specifically, the circuit breaking and rate limiting mechanisms can refuse service requests and thus change the way of anomaly propagation. Moreover, the different service request frequencies and latency make service dependencies change dynamically, resulting in the different probabilities of anomaly propagation among services. In this paper, we propose a novel framework named MicroGBPM to locate the root cause of abnormal services. We model the anomaly propagation among services as a dynamically constructed service attributed graph with metrics and traces when a failure occurs. To eliminate the impact of the protection mechanisms, we design a two-stage dynamic calibration strategy to adjust the probability of anomaly propagation among services. Then, we propose a random walking approach to calculate the root cause results by using the PageRank algorithm. The experimental results show that MicroGBPM improves the accuracy of root cause localization compared to other approaches in the microservice systems with protection mechanisms.
引用
收藏
页码:1211 / 1238
页数:28
相关论文
共 50 条
  • [1] Graph-based root cause analysis for service-oriented and microservice architectures
    Brandon, Alvaro
    Sole, Marc
    Huelamo, Alberto
    Solans, David
    Perez, Maria S.
    Muntes-Mulero, Victor
    JOURNAL OF SYSTEMS AND SOFTWARE, 2020, 159
  • [2] DGERCL: A Dynamic Graph Embedding Approach for Root Cause Localization in Microservice Systems
    Cheng, Han
    Li, Qian
    Liu, Bingchen
    Liu, Shijun
    Pan, Li
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (06) : 3417 - 3428
  • [3] MicroEGRCL: An Edge-Attention-Based Graph Neural Network Approach for Root Cause Localization in Microservice Systems
    Chen, Ruibo
    Ren, Jian
    Wang, Lingfeng
    Pu, Yanjun
    Yang, Kaiyuan
    Wu, Wenjun
    SERVICE-ORIENTED COMPUTING (ICSOC 2022), 2022, 13740 : 264 - 272
  • [4] The design of graph-based privacy protection mechanisms for mobile systems
    Zhang, Zhong
    Yoon, Sungha
    Shin, Minho
    2019 INTERNATIONAL CONFERENCE ON PLATFORM TECHNOLOGY AND SERVICE (PLATCON), 2019, : 55 - 60
  • [5] ModelCoder: A Fault Model based Automatic Root Cause Localization Framework for Microservice Systems
    Cai, Yang
    Han, Biao
    Li, Jie
    Zhao, Na
    Su, Jinshu
    2021 IEEE/ACM 29TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2021,
  • [6] Practical Root Cause Localization for Microservice Systems via Trace Analysis
    Li, Zeyan
    Chen, Junjie
    Jiao, Rui
    Zhao, Nengwen
    Wang, Zhijun
    Zhang, Shuwei
    Wu, Yanjun
    Jiang, Long
    Yan, Leiqin
    Wang, Zikai
    Chen, Zhekang
    Zhang, Wenchi
    Nie, Xiaohui
    Sui, Kaixin
    Pei, Dan
    2021 IEEE/ACM 29TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2021,
  • [7] MicroIRC: Instance-level Root Cause Localization for Microservice Systems
    Zhu, Yuhan
    Wang, Jian
    Li, Bing
    Zhao, Yuqi
    Zhang, Zekun
    Xiong, Yiming
    Chen, Shiping
    JOURNAL OF SYSTEMS AND SOFTWARE, 2024, 216
  • [8] Graph-Based IoT Microservice Security
    Pahl, Marc-Oliver
    Aubet, Francois-Xavier
    Liebald, Stefan
    NOMS 2018 - 2018 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, 2018,
  • [9] TraceModel: An Automatic Anomaly Detection and Root Cause Localization Framework for Microservice Systems
    Cai, Yang
    Han, Biao
    Su, Jinshu
    Wang, Xiaoyan
    2021 17TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2021), 2021, : 512 - 519
  • [10] Interpretable Failure Localization for Microservice Systems Based on Graph Autoencoder
    Sun, Yongqian
    Lin, Zihan
    Shi, Binpeng
    Zhang, Shenglin
    Ma, Shiyu
    Jin, Pengxiang
    Zhong, Zhenyu
    Pan, Lemeng
    Guo, Yicheng
    Pei, Dan
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2025, 34 (02)