Novel Entity Discovery from Web Tables

02/01/2020
by   Shuo Zhang, et al.
0

When working with any sort of knowledge base (KB) one has to make sure it is as complete and also as up-to-date as possible. Both tasks are non-trivial as they require recall-oriented efforts to determine which entities and relationships are missing from the KB. As such they require a significant amount of labor. Tables on the Web, on the other hand, are abundant and have the distinct potential to assist with these tasks. In particular, we can leverage the content in such tables to discover new entities, properties, and relationships. Because web tables typically only contain raw textual content we first need to determine which cells refer to which known entities—a task we dub table-to-KB matching. This first task aims to infer table semantics by linking table cells and heading columns to elements of a KB. Then second task builds upon these linked entities and properties to not only identify novel ones in the same table but also to bootstrap their type and additional relationships. We refer to this process as novel entity discovery and, to the best of our knowledge, it is the first endeavor on mining the unlinked cells in web tables. Our method identifies not only out-of-KB (“novel”) information but also novel aliases for in-KB (“known”) entities. When evaluated using three purpose-built test collections, we find that our proposed approaches obtain a marked improvement in terms of precision over our baselines whilst keeping recall stable.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/29/2017

EntiTables: Smart Assistance for Entity-Focused Tables

Tables are among the most powerful and practical tools for organizing an...
research
04/30/2023

S2abEL: A Dataset for Entity Linking from Scientific Tables

Entity linking (EL) is the task of linking a textual mention to its corr...
research
02/01/2020

Web Table Extraction, Retrieval and Augmentation: A Survey

Tables are a powerful and popular tool for organizing and manipulating d...
research
10/05/2020

TabEAno: Table to Knowledge Graph Entity Annotation

In the Open Data era, a large number of table resources have been made a...
research
09/08/2019

Auto-completion for Data Cells in Relational Tables

We address the task of auto-completing data cells in relational tables. ...
research
05/16/2018

SmartTable: A Spreadsheet Program with Intelligent Assistance

We introduce SmartTable, an online spreadsheet application that is equip...
research
06/08/2022

STable: Table Generation Framework for Encoder-Decoder Models

The output structure of database-like tables, consisting of values struc...

Please sign up or login with your details

Forgot password? Click here to reset