Tab2KG: Semantic Table Interpretation with Lightweight Semantic Profiles

02/02/2023
by   Simon Gottschalk, et al.
0

Tabular data plays an essential role in many data analytics and machine learning tasks. Typically, tabular data does not possess any machine-readable semantics. In this context, semantic table interpretation is crucial for making data analytics workflows more robust and explainable. This article proposes Tab2KG - a novel method that targets at the interpretation of tables with previously unseen data and automatically infers their semantics to transform them into semantic data graphs. We introduce original lightweight semantic profiles that enrich a domain ontology's concepts and relations and represent domain and table characteristics. We propose a one-shot learning approach that relies on these profiles to map a tabular dataset containing previously unseen instances to a domain ontology. In contrast to the existing semantic table interpretation approaches, Tab2KG relies on the semantic profiles only and does not require any instance lookup. This property makes Tab2KG particularly suitable in the data analytics context, in which data tables typically contain new instances. Our experimental evaluation on several real-world datasets from different application domains demonstrates that Tab2KG outperforms state-of-the-art semantic table interpretation baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/12/2019

Simple-ML: Towards a Framework for Semantic Data Analytics Workflows

In this paper we present the Simple-ML framework that we develop to supp...
research
02/17/2018

TabVec: Table Vectors for Classification of Web Tables

There are hundreds of millions of tables in Web pages that contain usefu...
research
12/29/2022

WarpGate: A Semantic Join Discovery System for Cloud Data Warehouses

Data discovery is a major challenge in enterprise data analysis: users o...
research
10/28/2021

Generating Table Vector Representations

High-quality Web tables are rich sources of information that can be used...
research
11/09/2019

Table-to-Text Natural Language Generation with Unseen Schemas

Traditional table-to-text natural language generation (NLG) tasks focus ...
research
11/20/2020

Dataset Discovery in Data Lakes

Data analytics stands to benefit from the increasing availability of dat...
research
09/11/2021

Making Table Understanding Work in Practice

Understanding the semantics of tables at scale is crucial for tasks like...

Please sign up or login with your details

Forgot password? Click here to reset