Approximate Denial Constraints

05/18/2020
by   Ester Livshits, et al.
0

The problem of mining integrity constraints from data has been extensively studied over the past two decades for commonly used types of constraints including the classic Functional Dependencies (FDs) and the more general Denial Constraints (DCs). In this paper, we investigate the problem of mining approximate DCs (i.e., DCs that are "almost" satisfied) from data. Considering approximate constraints allows us to discover more accurate constraints in inconsistent databases, detect rules that are generally correct but may have a few exceptions, as well as avoid overfitting and obtain more general and less contrived constraints. We introduce the algorithm ADCMiner for mining approximate DCs. An important feature of this algorithm is that it does not assume any specific definition of an approximate DC, but takes the semantics as input. Since there is more than one way to define an approximate DC and different definitions may produce very different results, we do not focus on one definition, but rather on a general family of approximation functions that satisfies some natural axioms defined in this paper and captures commonly used definitions of approximate constraints. We also show how our algorithm can be combined with sampling to return results with high accuracy while significantly reducing the running time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/01/2014

DC approximation approaches for sparse optimization

Sparse optimization refers to an optimization problem involving the zero...
research
12/24/2020

Reproducible Workflow

Reproducibility has been consistently identified as an important compone...
research
03/14/2000

dcs: An Implementation of DATALOG with Constraints

Answer-set programming (ASP) has emerged recently as a viable programmin...
research
08/26/2022

A Subquadratic n^ε-approximation for the Continuous Fréchet Distance

The Fréchet distance is a commonly used similarity measure between curve...
research
11/29/2019

Mining Approximate Acyclic Schemes from Relations

Acyclic schemes have numerous applications in databases and in machine l...
research
06/06/2018

Open Domain Suggestion Mining: Problem Definition and Datasets

We propose a formal definition for the task of suggestion mining in the ...
research
12/24/2022

Triplet Reconstruction and all other Phylogenetic CSPs are Approximation Resistant

We study the natural problem of Triplet Reconstruction (also Rooted Trip...

Please sign up or login with your details

Forgot password? Click here to reset