LinCQA: Faster Consistent Query Answering with Linear Time Guarantees

08/25/2022
by   Zhiwei Fan, et al.
0

Most data analytical pipelines often encounter the problem of querying inconsistent data that violate pre-determined integrity constraints. Data cleaning is an extensively studied paradigm that singles out a consistent repair of the inconsistent data. Consistent query answering (CQA) is an alternative approach to data cleaning that asks for all tuples guaranteed to be returned by a given query on all (in most cases, exponentially many) repairs of the inconsistent data. This paper identifies a class of acyclic select-project-join (SPJ) queries for which CQA can be solved via SQL rewriting with a linear time guarantee. Our rewriting method can be viewed as a generalization of Yannakakis's algorithm for acyclic joins to the inconsistent setting. We present LinCQA, a system that can output rewritings in both SQL and non-recursive Datalog rules for every query in this class. We show that LinCQA often outperforms the existing CQA systems on both synthetic and real-world workloads, and in some cases, by orders of magnitude.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/07/2019

A SAT-based System for Consistent Query Answering

An inconsistent database is a database that violates one or more integri...
research
02/18/2016

Query Answering with Inconsistent Existential Rules under Stable Model Semantics

Traditional inconsistency-tolerent query answering in ontology-based dat...
research
01/14/2022

Ranked Enumeration of Join Queries with Projections

Join query evaluation with ordering is a fundamental data processing tas...
research
02/24/2022

Consistent data fusion with Parker

When combining data from multiple sources, inconsistent data complicates...
research
01/09/2023

Consistent Query Answering without Repairs in Tables with Nulls and Functional Dependencies

In this paper, we study consistent query answering in tables with nulls ...
research
12/26/2017

Pattern-Driven Data Cleaning

Data is inherently dirty and there has been a sustained effort to come u...
research
04/07/2021

Audio declipping performance enhancement via crossfading

Some audio declipping methods produce waveforms that do not fully respec...

Please sign up or login with your details

Forgot password? Click here to reset