Table Enrichment System for Machine Learning

04/18/2022
by   Yuyang Dong, et al.
0

Data scientists are constantly facing the problem of how to improve prediction accuracy with insufficient tabular data. We propose a table enrichment system that enriches a query table by adding external attributes (columns) from data lakes and improves the accuracy of machine learning predictive models. Our system has four stages, join row search, task-related table selection, row and column alignment, and feature selection and evaluation, to efficiently create an enriched table for a given query table and a specified machine learning task. We demonstrate our system with a web UI to show the use cases of table enrichment.

READ FULL TEXT
research
09/05/2019

Table-to-Text Generation with Effective Hierarchical Encoder on Three Dimensions (Row, Column and Time)

Although Seq2Seq models for table-to-text generation have achieved remar...
research
06/20/2023

RoTaR: Efficient Row-Based Table Representation Learning via Teacher-Student Training

We propose RoTaR, a row-based table representation learning method, to a...
research
03/11/2011

SPPAM - Statistical PreProcessing AlgorithM

Most machine learning tools work with a single table where each row is a...
research
01/07/2021

An Algorithm for the Discovery of Independence from Data

For years, independence has been considered as an important concept in m...
research
06/14/2019

Comparing Machine Learning Approaches for Table Recognition in Historical Register Books

We present in this paper experiments on Table Recognition in hand-writte...
research
08/11/2021

Retrieval Interaction Machine for Tabular Data Prediction

Prediction over tabular data is an essential task in many data science a...
research
10/28/2021

Generating Table Vector Representations

High-quality Web tables are rich sources of information that can be used...

Please sign up or login with your details

Forgot password? Click here to reset