ColNet: Embedding the Semantics of Web Tables for Column Type Prediction

11/04/2018
by   Jiaoyan Chen, et al.
0

Automatically annotating column types with knowledge base (KB) concepts is a critical task to gain a basic understanding of web tables. Current methods rely on either table metadata like column name or entity correspondences of cells in the KB, and may fail to deal with growing web tables with incomplete meta information. In this paper we propose a neural network based column type annotation framework named ColNet which is able to integrate KB reasoning and lookup with machine learning and can automatically train Convolutional Neural Networks for prediction. The prediction model not only considers the contextual semantics within a cell using word representation, but also embeds the semantic of a column by learning locality features from multiple cells. The method is evaluated with DBPedia and two different web table datasets, T2Dv2 from the general Web and Limaye from Wikipedia pages, and achieves higher performance than the state-of-the-art approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/30/2019

Learning Semantic Annotations for Tabular Data

The usefulness of tabular data such as web tables critically depends on ...
research
07/05/2022

Entity Linking in Tabular Data Needs the Right Attention

Understanding the semantic meaning of tabular data requires Entity Linki...
research
10/05/2020

TabEAno: Table to Knowledge Graph Entity Annotation

In the Open Data era, a large number of table resources have been made a...
research
09/20/2019

Automatic Table completion using Knowledge Base

Table is a popular data format to organize and present relational inform...
research
09/11/2021

Making Table Understanding Work in Practice

Understanding the semantics of tables at scale is crucial for tasks like...
research
10/03/2022

Russian Web Tables: A Public Corpus of Web Tables for Russian Language Based on Wikipedia

Corpora that contain tabular data such as WebTables are a vital resource...
research
12/15/2020

Semantic Annotation for Tabular Data

Detecting semantic concept of columns in tabular data is of particular i...

Please sign up or login with your details

Forgot password? Click here to reset