Bag Query Containment and Information Theory

06/24/2019
by   Mahmoud Abo Khamis, et al.
0

The query containment problem is a fundamental algorithmic problem in data management. While this problem is well understood under set semantics, it is by far less understood under bag semantics. In particular, it is a long-standing open question whether or not the conjunctive query containment problem under bag semantics is decidable. We unveil tight connections between information theory and the conjunctive query containment under bag semantics. These connections are established using information inequalities, which are considered to be the laws of information theory. Our first main result asserts that deciding the validity of maxima of information inequalities is many-one equivalent to the restricted case of conjunctive query containment in which the containing query is acyclic; thus, either both these problems are decidable or both are undecidable. Our second main result identifies a new decidable case of the conjunctive query containment problem under bag semantics. Specifically, we give an exponential time algorithm for conjunctive query containment under bag semantics, provided the containing query is chordal and admits a simple junction tree.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/23/2021

Determinacy of Real Conjunctive Queries. The Boolean Case

In their classical 1993 paper [CV93] Chaudhuri and Vardi notice that som...
research
03/17/2018

Datalog: Bag Semantics via Set Semantics

Duplicates in data management are common and problematic. In this work, ...
research
11/07/2019

Query-bag Matching with Mutual Coverage for Information-seeking Conversations in E-commerce

Information-seeking conversation system aims at satisfying the informati...
research
08/25/2023

Discovering Dichotomies for Problems in Database Theory

Dichotomy theorems, which characterize the conditions under which a prob...
research
11/23/2022

Run-Based Semantics for RPQs

The formalism of RPQs (regular path queries) is an important building bl...
research
04/24/2023

Applications of Information Inequalities to Database Theory Problems

The paper describes several applications of information inequalities to ...
research
12/17/2022

A Unified Approach for Resilience and Causal Responsibility with Integer Linear Programming (ILP) and LP Relaxations

Resilience is one of the key algorithmic problems underlying various for...

Please sign up or login with your details

Forgot password? Click here to reset