T-REx: Table Repair Explanations

07/08/2020
by   Daniel Deutch, et al.
0

Data repair is a common and crucial step in many frameworks today, as applications may use data from different sources and of different levels of credibility. Thus, this step has been the focus of many works, proposing diverse approaches. To assist users in understanding the output of such data repair algorithms, we propose T-REx, a system for providing data repair explanations through Shapley values. The system is generic and not specific to a given repair algorithm or approach: it treats the algorithm as a black box. Given a specific table cell selected by the user, T-REx employs Shapley values to explain the significance of each constraint and each table cell in the repair of the cell of interest. T-REx then ranks the constraints and table cells according to their importance in the repair of this cell. This explanation allows users to understand the repair process, as well as to act based on this knowledge, to modify the most influencing constraints or the original database.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/24/2022

AIREPAIR: A Repair Platform for Neural Networks

We present AIREPAIR, a platform for repairing neural networks. It featur...
research
12/20/2019

Rule-based Graph Repair

Model repair is an essential topic in model-driven engineering. Since mo...
research
08/05/2019

Repair Pipelining for Erasure-Coded Storage: Algorithms and Evaluation

We propose repair pipelining, a technique that speeds up the repair perf...
research
07/13/2023

An Analysis of Dialogue Repair in Virtual Voice Assistants

Language speakers often use what are known as repair initiators to mend ...
research
12/26/2017

Pattern-Driven Data Cleaning

Data is inherently dirty and there has been a sustained effort to come u...
research
01/18/2011

Automated Image Processing for the Analysis of DNA Repair Dynamics

The efficient repair of cellular DNA is essential for the maintenance an...
research
02/24/2022

Consistent data fusion with Parker

When combining data from multiple sources, inconsistent data complicates...

Please sign up or login with your details

Forgot password? Click here to reset