On the complexity of finding set repairs for data-graphs

06/15/2022
by   Sergio Abriola, et al.
0

In the deeply interconnected world we live in, pieces of information link domains all around us. As graph databases embrace effectively relationships among data and allow processing and querying these connections efficiently, they are rapidly becoming a popular platform for storage that supports a wide range of domains and applications. As in the relational case, it is expected that data preserves a set of integrity constraints that define the semantic structure of the world it represents. When a database does not satisfy its integrity constraints, a possible approach is to search for a 'similar' database that does satisfy the constraints, also known as a repair. In this work, we study the problem of computing subset and superset repairs for graph databases with data values using a notion of consistency based on a set of Reg-GXPath expressions as integrity constraints. We show that for positive fragments of Reg-GXPath these problems admit a polynomial-time algorithm, while the full expressive power of the language renders them intractable.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/03/2023

Data-graph repairs: the preferred approach

Repairing inconsistent knowledge bases is a task that has been assessed,...
research
12/23/2022

The Consistency of Probabilistic Databases with Independent Cells

A probabilistic database with attribute-level uncertainty consists of re...
research
04/05/2020

Learning Over Dirty Data Without Cleaning

Real-world datasets are dirty and contain many errors. Examples of these...
research
04/06/2019

Inconsistency Measures for Relational Databases

In this paper, building on work done on measuring inconsistency in knowl...
research
09/29/2021

An epistemic approach to model uncertainty in data-graphs

Graph databases are becoming widely successful as data models that allow...
research
09/27/2018

Repair-Based Degrees of Database Inconsistency: Computation and Complexity

We propose a generic numerical measure of the inconsistency of a databas...

Please sign up or login with your details

Forgot password? Click here to reset