Quantifying the Loss of Acyclic Join Dependencies

10/26/2022
by   Batya Kenig, et al.
0

Acyclic schemas possess known benefits for database design, speeding up queries, and reducing space requirements. An acyclic join dependency (AJD) is lossless with respect to a universal relation if joining the projections associated with the schema results in the original universal relation. An intuitive and standard measure of loss entailed by an AJD is the number of redundant tuples generated by the acyclic join. Recent work has shown that the loss of an AJD can also be characterized by an information-theoretic measure. Motivated by the problem of automatically fitting an acyclic schema to a universal relation, we investigate the connection between these two characterizations of loss. We first show that the loss of an AJD is captured using the notion of KL-Divergence. We then show that the KL-divergence can be used to bound the number of redundant tuples. We prove a deterministic lower bound on the percentage of redundant tuples. For an upper bound, we propose a random database model, and establish a bound that holds in expectation over a random choice of relation, which coincides with the lower bound for large relation instances.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/29/2019

A New Lower Bound for Kullback-Leibler Divergence Based on Hammersley-Chapman-Robbins Bound

In this paper, we derive a useful lower bound for the Kullback-Leibler d...
research
08/20/2022

Safe Subjoins in Acyclic Joins

It is expensive to compute joins, often due to large intermediate relati...
research
08/24/2019

Relation between the Kantorovich-Wasserstein metric and the Kullback-Leibler divergence

We discuss a relation between the Kantorovich-Wasserstein (KW) metric an...
research
07/28/2019

A Lower Bound on Cycle-Finding in Sparse Digraphs

We consider the problem of finding a cycle in a sparse directed graph G ...
research
03/22/2019

Instance and Output Optimal Parallel Algorithms for Acyclic Joins

Massively parallel join algorithms have received much attention in recen...
research
09/20/2020

Consistency, Acyclicity, and Positive Semirings

In several different settings, one comes across situations in which the ...
research
04/21/2020

An Information-Theoretic Proof of the Streaming Switching Lemma for Symmetric Encryption

Motivated by a fundamental paradigm in cryptography, we consider a recen...

Please sign up or login with your details

Forgot password? Click here to reset