CADET: A Systematic Method For Debugging Misconfigurations using Counterfactual Reasoning

10/12/2020
by   Rahul Krishna, et al.
0

Modern computing platforms are highly-configurable with thousands of interacting configurations. However, configuring these systems is challenging. Erroneous configurations can cause unexpected non-functional faults. This paper proposes CADET (short for Causal Debugging Toolkit) that enables users to identify, explain, and fix the root cause of non-functional faults early and in a principled fashion. CADET builds a causal model by observing the performance of the system under different configurations. Then, it uses casual path extraction followed by counterfactual reasoning over the causal model to: (a) identify the root causes of non-functional faults, (b) estimate the effects of various configurable parameters on the performance objective(s), and (c) prescribe candidate repairs to the relevant configuration options to fix the non-functional fault. We evaluated CADET on 5 highly-configurable systems deployed on 3 NVIDIA Jetson systems-on-chip. We compare CADET with state-of-the-art configuration optimization and ML-based debugging approaches. The experimental results indicate that CADET can find effective repairs for faults in multiple non-functional properties with (at most) 17 28 debugging methods. Compared to multi-objective optimization approaches, CADET can find fixes (at most) 9× faster with comparable or better performance gain. Our case study of non-functional faults reported in NVIDIA's forum show that CADET can find 14 30 minutes.

READ FULL TEXT

page 3

page 4

page 5

page 6

page 8

page 9

page 13

page 14

research
01/18/2023

CaRE: Finding Root Causes of Configuration Issues in Highly-Configurable Robots

Robotic systems have several subsystems that possess a huge combinatoria...
research
01/20/2022

Unicorn: Reasoning about Configurable System Performance through the lens of Causality

Modern computer systems are highly configurable, with the total variabil...
research
11/18/2019

Configuration-dependent Fault Localization

In a buggy configurable system, configuration-dependent bugs cause the f...
research
02/26/2019

Transfer Learning for Performance Modeling of Configurable Systems: A Causal Analysis

Modern systems (e.g., deep neural networks, big data analytics, and comp...
research
03/03/2023

Holistic IJTAG-based External and Internal Fault Monitoring in UAVs

Cyber-Physical Systems (CPSs), such as Unmanned Aerial Vehicles (UAVs), ...
research
06/04/2021

VEER: Disagreement-Free Multi-objective Configuration

Software comes with many configuration options, satisfying varying needs...
research
05/18/2023

Counterfactuals for Design: A Model-Agnostic Method For Design Recommendations

We introduce Multi-Objective Counterfactuals for Design (MCD), a novel m...

Please sign up or login with your details

Forgot password? Click here to reset