CausalRCA: Causal Inference based Precise Fine-grained Root Cause Localization for Microservice Applications

09/06/2022
by   Ruyue Xin, et al.
0

For microservice applications with detected performance anomalies, localizing root causes based on monitoring data is important to enabling rapid recovery and loss mitigation. Existing research mainly focuses on coarse-grained faulty service localization. However, the fine-grained root cause localization to identify not only faulty service but also the root cause metric in the service is more helpful for operators to fix application anomalies, which is also more challenging. Recently, causal inference (CI) based methods is becoming popular but currently used CI methods have limitations, such as linear causal relations assumption. Therefore, this paper provides a framework named CausalRCA to implement fine-grained, automated, and real-time root cause localization. The CausalRCA works with a gradient-based causal structure learning method to generate weighted causal graphs and a root cause inference method to localize root cause metrics. We conduct coarse-grained and fine-grained root cause localization to validate the localization performance of CausalRCA. Experimental results show that CausalRCA performs best localization accuracy compared with baseline methods, e.g., the average AC@3 of the fine-grained root cause metric localization in the faulty service is 0.719, and the average improvement is 17% compared with baseline methods.

READ FULL TEXT

page 1

page 7

research
06/13/2022

Causal Inference-Based Root Cause Analysis for Online Service Systems with Intervention Recognition

Fault diagnosis is critical in many domains, as faults may lead to safet...
research
09/05/2022

FIRED: a fine-grained robust performance diagnosis framework for cloud applications

To run a cloud application with the required service quality, operators ...
research
03/30/2022

CMMD: Cross-Metric Multi-Dimensional Root Cause Analysis

In large-scale online services, crucial metrics, a.k.a., key performance...
research
08/13/2018

Simple Root Cause Analysis by Separable Likelihoods

Root Cause Analysis for Anomalies is challenging because of the trade-of...
research
10/20/2021

KabOOM: Unsupervised Crash Categorization through Timeseries Fingerprinting

Modern mobile applications include instrumentation that sample internal ...
research
02/03/2023

Hierarchical Graph Neural Networks for Causal Discovery and Root Cause Localization

In this paper, we propose REASON, a novel framework that enables the aut...
research
01/31/2023

BALANCE: Bayesian Linear Attribution for Root Cause Localization

Root Cause Analysis (RCA) plays an indispensable role in distributed dat...

Please sign up or login with your details

Forgot password? Click here to reset