Functional Dependencies with Predicates: What Makes the g_3-error Easy to Compute?

06/15/2023
by   Simon Vilmin, et al.
0

The notion of functional dependencies (FDs) can be used by data scientists and domain experts to confront background knowledge against data. To overcome the classical, too restrictive, satisfaction of FDs, it is possible to replace equality with more meaningful binary predicates, and use a coverage measure such as the g_3-error to estimate the degree to which a FD matches the data. It is known that the g_3-error can be computed in polynomial time if equality is used, but unfortunately, the problem becomes NP-complete when relying on more general predicates instead. However, there has been no analysis of which class of predicates or which properties alter the complexity of the problem, especially when going from equality to more general predicates. In this work, we provide such an analysis. We focus on the properties of commonly used predicates such as equality, similarity relations, and partial orders. These properties are: reflexivity, transitivity, symmetry, and antisymmetry. We show that symmetry and transitivity together are sufficient to guarantee that the g_3-error can be computed in polynomial time. However, dropping either of them makes the problem NP-complete.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/01/2021

The complete classification for quantified equality constraints

We prove that QCSP(ℕ;x=y→ y=z) is PSpace-complete, settling a question o...
research
03/17/2018

A modifiction of the CSP algorithm for infinite languages

Constraint Satisfaction Problem on finite sets is known to be NP-complet...
research
05/28/2020

Discovering Domain Orders through Order Dependencies

Much real-world data come with explicitly defined domain orders; e.g., l...
research
02/11/2020

A polynomial time parallel algorithm for graph isomorphism using a quasipolynomial number of processors

The Graph Isomorphism (GI) problem is a theoretically interesting proble...
research
05/14/2019

A new algebraic approach to the graph isomorphism and clique problems

As it follows from Gödel's incompleteness theorems, any consistent forma...
research
04/01/2019

A More General Theory of Static Approximations for Conjunctive Queries

Conjunctive query (CQ) evaluation is NP-complete, but becomes tractable ...
research
09/27/2019

Possible/Certain Functional Dependencies

Incomplete information allow to deal with data with errors, uncertainty ...

Please sign up or login with your details

Forgot password? Click here to reset