Discovering Domain Orders through Order Dependencies

05/28/2020
by   Reza Karegar, et al.
0

Much real-world data come with explicitly defined domain orders; e.g., lexicographic order for strings, numeric for integers, and chronological for time. Our goal is to discover implicit domain orders that we do not already know; for instance, that the order of months in the Lunar calendar is Corner < Apricot < Peach, and so on. To do so, we enhance data profiling methods by discovering implicit domain orders in data through order dependencies (ODs). We first identify tractable special cases and then proceed towards the most general case, which we prove is NP-complete. Nevertheless, we show that the general case can be effectively handled by a SAT solver. We also propose an interestingness measure to rank the discovered implicit domain orders. Finally, we report on the results of an experimental evaluation using real-world datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/25/2020

Choice functions based on sets of strict partial orders: an axiomatic characterisation

Methods for choosing from a set of options are often based on a strict p...
research
06/15/2023

Functional Dependencies with Predicates: What Makes the g_3-error Easy to Compute?

The notion of functional dependencies (FDs) can be used by data scientis...
research
05/28/2019

Discovery of Band Order Dependencies

We enhance dependency-based data cleaning with approximate band conditio...
research
07/22/2017

Possible and Certain Answers for Queries over Order-Incomplete Data

To combine and query ordered data from multiple sources, one needs to ha...
research
02/28/2020

The Complexity of Possible Winners On Partial Chains

The Possible Winner (PW) problem, a fundamental algorithmic problem in c...
research
11/29/2019

Mining Approximate Acyclic Schemes from Relations

Acyclic schemes have numerous applications in databases and in machine l...
research
01/03/2021

Computing Cliques and Cavities in Networks

Complex networks have complete subgraphs such as nodes, edges, triangles...

Please sign up or login with your details

Forgot password? Click here to reset