Bag Query Containment and Information Theory

by   Mahmoud Abo Khamis, et al.

The query containment problem is a fundamental algorithmic problem in data management. While this problem is well understood under set semantics, it is by far less understood under bag semantics. In particular, it is a long-standing open question whether or not the conjunctive query containment problem under bag semantics is decidable. We unveil tight connections between information theory and the conjunctive query containment under bag semantics. These connections are established using information inequalities, which are considered to be the laws of information theory. Our first main result asserts that deciding the validity of maxima of information inequalities is many-one equivalent to the restricted case of conjunctive query containment in which the containing query is acyclic; thus, either both these problems are decidable or both are undecidable. Our second main result identifies a new decidable case of the conjunctive query containment problem under bag semantics. Specifically, we give an exponential time algorithm for conjunctive query containment under bag semantics, provided the containing query is chordal and admits a simple junction tree.



page 1

page 2

page 3

page 4


Determinacy of Real Conjunctive Queries. The Boolean Case

In their classical 1993 paper [CV93] Chaudhuri and Vardi notice that som...

Datalog: Bag Semantics via Set Semantics

Duplicates in data management are common and problematic. In this work, ...

Query-bag Matching with Mutual Coverage for Information-seeking Conversations in E-commerce

Information-seeking conversation system aims at satisfying the informati...

Probabilistic Query Evaluation with Bag Semantics

We initiate the study of probabilistic query evaluation under bag semant...

Computing expected multiplicities for bag-TIDBs with bounded multiplicities

In this work, we study the problem of computing a tuple's expected multi...

SPES: A Two-Stage Query Equivalence Verifier

In database-as-a-service platforms, automated verification of query equi...

Quantile Representation for Indirect Immunofluorescence Image Classification

In the diagnosis of autoimmune diseases, an important task is to classif...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.