Database Repairing with Soft Functional Dependencies

09/29/2020
by   Nofar Carmeli, et al.
0

A common interpretation of soft constraints penalizes the database for every violation of every constraint, where the penalty is the cost (weight) of the constraint. A computational challenge is that of finding an optimal subset: a collection of database tuples that minimizes the total penalty when each tuple has a cost of being excluded. When the constraints are strict (i.e., have an infinite cost), this subset is a "cardinality repair" of an inconsistent database; in soft interpretations, this subset corresponds to a "most probable world" of a probabilistic database, a "most likely intention" of a probabilistic unclean database, and so on. Within the class of functional dependencies, the complexity of finding a cardinality repair is thoroughly understood. Yet, very little is known about the complexity of this problem in the more general soft semantics. This paper makes a significant progress in this direction. In addition to general insights about the hardness and approximability of the problem, we present algorithms for two special cases: a single functional dependency, and a bipartite matching. The latter is the problem of finding an optimal "almost matching" of a bipartite graph where a penalty is paid for every lost edge and every violation of monogamy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/30/2017

The Complexity of Computing a Cardinality Repair for Functional Dependencies

For a relation that violates a set of functional dependencies, we consid...
research
12/20/2017

Computing Optimal Repairs for Functional Dependencies

We investigate the complexity of computing an optimal repair of an incon...
research
01/02/2020

Complexity and Efficient Algorithms for Data Inconsistency Evaluating and Repairing

Data inconsistency evaluating and repairing are major concerns in data q...
research
06/30/2011

A Maximal Tractable Class of Soft Constraints

Many researchers in artificial intelligence are beginning to explore the...
research
12/23/2022

The Consistency of Probabilistic Databases with Independent Cells

A probabilistic database with attribute-level uncertainty consists of re...
research
04/15/2010

Propagating Conjunctions of AllDifferent Constraints

We study propagation algorithms for the conjunction of two AllDifferent ...
research
06/18/2012

The Most Persistent Soft-Clique in a Set of Sampled Graphs

When searching for characteristic subpatterns in potentially noisy graph...

Please sign up or login with your details

Forgot password? Click here to reset