A More General Theory of Static Approximations for Conjunctive Queries

04/01/2019
by   Pablo Barceló, et al.
0

Conjunctive query (CQ) evaluation is NP-complete, but becomes tractable for fragments of bounded hypertreewidth. Approximating a hard CQ by a query from such a fragment can thus allow for an efficient approximate evaluation. While underapproximations (i.e., approximations that return correct answers only) are well-understood, the dual notion of overapproximations (i.e, approximations that return complete - but not necessarily sound - answers), and also a more general notion of approximation based on the symmetric difference of query results, are almost unexplored. In fact, the decidability of the basic problems of evaluation, identification, and existence of those approximations has been open. This article establishes a connection between overapproximations and existential pebble games that allows for studying such problems systematically. Building on this connection, it is shown that the evaluation and identification problem for overapproximations can be solved in polynomial time. While the general existence problem remains open, the problem is shown to be decidable in 2EXPTIME over the class of acyclic CQs and in PTIME for Boolean CQs over binary schemata. Additionally we propose a more liberal notion of overapproximations to remedy the known shortcoming that queries might not have an overapproximation, and study how queries can be overapproximated in the presence of tuple generating and equality generating dependencies. The techniques are then extended to symmetric difference approximations and used to provide several complexity results for the identification, existence, and evaluation problem for this type of approximations.

READ FULL TEXT
research
12/28/1998

Hypertree Decompositions and Tractable Queries

Several important decision problems on conjunctive queries (CQs) are NP-...
research
03/23/2021

Approximately Counting Answers to Conjunctive Queries with Disequalities and Negations

We study the complexity of approximating the number of answers to a smal...
research
12/24/2017

On tractable query evaluation for SPARQL

Despite much work within the last decade on foundational properties of S...
research
09/10/2010

The Complexity of Causality and Responsibility for Query Answers and non-Answers

An answer to a query has a well-defined lineage expression (alternativel...
research
06/15/2023

Functional Dependencies with Predicates: What Makes the g_3-error Easy to Compute?

The notion of functional dependencies (FDs) can be used by data scientis...
research
05/29/2021

Towards a Dichotomy for Minimally Factorizing the Provenance of Self-Join Free Conjunctive Queries

We consider the problem of finding the minimal-size factorization of the...
research
08/23/2021

QDEF and Its Approximations in OBDM

Given an input dataset (i.e., a set of tuples), query definability in On...

Please sign up or login with your details

Forgot password? Click here to reset