Relation Extraction from Tables using Artificially Generated Metadata

08/24/2021
by   Gaurav Singh, et al.
0

Relation Extraction (RE) from tables is the task of identifying relations between pairs of columns of a table. Generally, RE models for this task require labelled tables for training. These labelled tables can also be generated artificially from a Knowledge Graph (KG), which makes the cost to acquire them much lower in comparison to manual annotations. However, unlike real tables, these synthetic tables lack associated metadata, such as, column-headers, captions, etc; this is because synthetic tables are created out of KGs that do not store such metadata. Meanwhile, previous works have shown that metadata is important for accurate RE from tables. To address this issue, we propose methods to artificially create some of this metadata for synthetic tables. Afterward, we experiment with a BERT-based model, in line with recently published works, that takes as input a combination of proposed artificial metadata and table content. Our empirical results show that this leads to an improvement of 9%-45% in F1 score, in absolute terms, over 2 tabular datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/05/2019

TableNet: An Approach for Determining Fine-grained Relations for Wikipedia Tables

Wikipedia tables represent an important resource, where information is o...
research
07/03/2022

DiSCoMaT: Distantly Supervised Composition Extraction from Tables in Materials Science Articles

A crucial component in the curation of KB for a scientific domain is inf...
research
12/17/2018

Optimizing Organizations for Navigating Data Lakes

Navigation is known to be an effective complement to search. In addition...
research
08/18/2020

An Annotated Corpus of Webtables for Information Extraction Tasks

Information Extraction is a well-researched area of Natural Language Pro...
research
06/26/2020

TURL: Table Understanding through Representation Learning

Relational tables on the Web store a vast amount of knowledge. Owing to ...
research
05/23/2023

Schema-Driven Information Extraction from Heterogeneous Tables

In this paper, we explore the question of whether language models (LLMs)...
research
06/08/2022

STable: Table Generation Framework for Encoder-Decoder Models

The output structure of database-like tables, consisting of values struc...

Please sign up or login with your details

Forgot password? Click here to reset