DIALITE: Discover, Align and Integrate Open Data Tables

04/17/2023
by   Aamod Khatiwada, et al.
0

We demonstrate a novel table discovery pipeline called DIALITE that allows users to discover, integrate and analyze open data tables. DIALITE has three main stages. First, it allows users to discover tables from open data platforms using state-of-the-art table discovery techniques. Second, DIALITE integrates the discovered tables to produce an integrated table. Finally, it allows users to analyze the integration result by applying different downstreaming tasks over it. Our pipeline is flexible such that the user can easily add and compare additional discovery and integration algorithms.

READ FULL TEXT

page 3

page 4

research
12/29/2022

WarpGate: A Semantic Join Discovery System for Cloud Data Warehouses

Data discovery is a major challenge in enterprise data analysis: users o...
research
12/17/2018

Optimizing Organizations for Navigating Data Lakes

Navigation is known to be an effective complement to search. In addition...
research
10/01/2021

MATE: Multi-Attribute Table Extraction

A core operation in data discovery is to find joinable tables for a give...
research
06/25/2021

Auto-Pipeline: Synthesizing Complex Data Pipelines By-Target Using Reinforcement Learning and Search

Recent work has made significant progress in helping users to automate s...
research
01/05/2022

TableParser: Automatic Table Parsing with Weak Supervision from Spreadsheets

Tables have been an ever-existing structure to store data. There exist n...
research
06/29/2015

Detecting Table Region in PDF Documents Using Distant Supervision

Superior to state-of-the-art approaches which compete in table recogniti...
research
05/20/2020

DisCoveR: Accurate Efficient Discovery of Declarative Process Models

Declarative process modeling formalisms - which capture high-level proce...

Please sign up or login with your details

Forgot password? Click here to reset