MATE: Multi-Attribute Table Extraction

10/01/2021
by   Mahdi Esmailoghli, et al.
0

A core operation in data discovery is to find joinable tables for a given table. Real-world tables include both unary and n-ary join keys. However, existing table discovery systems are optimized for unary joins. These systems are ineffective and slow in the existence of n-ary keys due to a large number of false positives. In this paper, we introduce MATE, a table discovery system that leverages a novel hash-based index that enables n-ary join discovery through a space-efficient super key. We design a filtering layer that uses a novel hash, XASH. This hash function encodes the syntactic features of all column values and aggregates them into a super key, which allows the system to efficiently prune tables with non-joinable rows. Our join discovery system leads to up to 6300x fewer false positives and 370x faster table discovery in comparison to state-of-the-art.

READ FULL TEXT
research
09/18/2022

Scaling and Load-Balancing Equi-Joins

The task of joining two tables is fundamental for querying databases. In...
research
12/11/2020

Discovering Multi-Table Functional Dependencies Without Full Join Computation

In this paper, we study the problem of discovering join FDs, i.e., funct...
research
11/20/2020

Dataset Discovery in Data Lakes

Data analytics stands to benefit from the increasing availability of dat...
research
05/11/2017

ROCKER: A Refinement Operator for Key Discovery

The Linked Data principles provide a decentral approach for publishing s...
research
07/28/2023

Predicate Transfer: Efficient Pre-Filtering on Multi-Join Queries

This paper presents predicate transfer, a novel method that optimizes jo...
research
12/01/2020

Scalable Data Discovery Using Profiles

We study the problem of discovering joinable datasets at scale. This is,...
research
04/17/2023

DIALITE: Discover, Align and Integrate Open Data Tables

We demonstrate a novel table discovery pipeline called DIALITE that allo...

Please sign up or login with your details

Forgot password? Click here to reset