A Lightweight Algorithm to Uncover Deep Relationships in Data Tables

09/07/2020
by   Jin Cao, et al.
0

Many data we collect today are in tabular form, with rows as records and columns as attributes associated with each record. Understanding the structural relationship in tabular data can greatly facilitate the data science process. Traditionally, much of this relational information is stored in table schema and maintained by its creators, usually domain experts. In this paper, we develop automated methods to uncover deep relationships in a single data table without expert or domain knowledge. Our method can decompose a data table into layers of smaller tables, revealing its deep structure. The key to our approach is a computationally lightweight forward addition algorithm that we developed to recursively extract the functional dependencies between table columns that are scalable to tables with many columns. With our solution, data scientists will be provided with automatically generated, data-driven insights when exploring new data sets.

READ FULL TEXT
research
03/27/2022

StruBERT: Structure-aware BERT for Table Search and Matching

A large amount of information is stored in data tables. Users can search...
research
03/05/2022

Selecting Sub-tables for Data Exploration

We present a framework for creating small, informative sub-tables of lar...
research
04/20/2023

Comparative Analysis of Deterministic and Nondeterministic Decision Trees for Decision Tables from Closed Classes

In this paper, we consider classes of decision tables with many-valued d...
research
05/25/2017

Synthesizing Mapping Relationships Using Table Corpus

Mapping relationships, such as (country, country-code) or (company, stoc...
research
06/29/2021

Is 2NF a Stable Normal Form?

Traditionally, it was accepted that a relational database can be normali...
research
08/05/2021

Handling Inconsistencies in Tables with Nulls and Functional Dependencies

In this paper we address the problem of handling inconsistencies in tabl...
research
06/01/2017

One button machine for automating feature engineering in relational databases

Feature engineering is one of the most important and time consuming task...

Please sign up or login with your details

Forgot password? Click here to reset