The Complexity of Dependency Detection and Discovery in Relational Databases

03/24/2021
by   Thomas Bläsius, et al.
0

Multi-column dependencies in relational databases come associated with two different computational tasks. The detection problem is to decide whether a dependency of a certain type and size holds in a given database, the discovery problem asks to enumerate all valid dependencies of that type. We settle the complexity of both of these problems for unique column combinations (UCCs), functional dependencies (FDs), and inclusion dependencies (INDs). We show that the detection of UCCs and FDs is W[2]-complete when parameterized by the solution size. The discovery of inclusion-wise minimal UCCs is proven to be equivalent under parsimonious reductions to the transversal hypergraph problem of enumerating the minimal hitting sets of a hypergraph. The discovery of FDs is equivalent to the simultaneous enumeration of the hitting sets of multiple input hypergraphs. We further identify the detection of INDs as one of the first natural W[3]-complete problems. The discovery of maximal INDs is shown to be equivalent to enumerating the maximal satisfying assignments of antimonotone, 3-normalized Boolean formulas.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/19/2013

A finite axiomatization of conditional independence and inclusion dependencies

We present a complete finite axiomatization of the unrestricted implicat...
research
12/09/2012

Deciding Monotone Duality and Identifying Frequent Itemsets in Quadratic Logspace

The monotone duality problem is defined as follows: Given two monotone f...
research
06/12/2020

Enumerating Teams in First-Order Team Logics

We start the study of the enumeration complexity of different satisfiabi...
research
11/19/2021

Fast Discovery of Nested Dependencies on JSON Data

Functional and inclusion dependencies are the most widely used classes o...
research
01/07/2021

On the Interaction of Functional and Inclusion Dependencies with Independence Atoms

Infamously, the finite and unrestricted implication problems for the cla...
research
04/20/2021

Inference of Common Multidimensional Equally-Distributed Attributes

Given two relations containing multiple measurements - possibly with unc...
research
01/31/2011

Dependency detection with similarity constraints

Unsupervised two-view learning, or detection of dependencies between two...

Please sign up or login with your details

Forgot password? Click here to reset