Semantic Table Retrieval using Keyword and Table Queries

05/13/2021
by   Shuo Zhang, et al.
0

Tables on the Web contain a vast amount of knowledge in a structured form. To tap into this valuable resource, we address the problem of table retrieval: answering an information need with a ranked list of tables. We investigate this problem in two different variants, based on how the information need is expressed: as a keyword query or as an existing table ("query-by-table"). The main novel contribution of this work is a semantic table retrieval framework for matching information needs (keyword or table queries) against tables. Specifically, we (i) represent queries and tables in multiple semantic spaces (both discrete sparse and continuous dense vector representations) and (ii) introduce various similarity measures for matching those semantic representations. We consider all possible combinations of semantic representations and similarity measures and use these as features in a supervised learning model. Using two purpose-built test collections based on Wikipedia tables, we demonstrate significant and substantial improvements over state-of-the-art baselines.

READ FULL TEXT
research
02/16/2018

Ad Hoc Table Retrieval using Semantic Similarity

We introduce and address the problem of ad hoc table retrieval: answerin...
research
03/27/2022

StruBERT: Structure-aware BERT for Table Search and Matching

A large amount of information is stored in data tables. Users can search...
research
06/08/2017

Content-Based Table Retrieval for Web Queries

Understanding the connections between unstructured text and semi-structu...
research
11/21/2019

Schemaless Queries over Document Tables with Dependencies

Unstructured enterprise data such as reports, manuals and guidelines oft...
research
05/31/2019

Table2Vec: Neural Word and Entity Embeddings for Table Population and Retrieval

Tables contain valuable knowledge in a structured form. We employ neural...
research
01/10/2020

TableQnA: Answering List Intent Queries With Web Tables

The web contains a vast corpus of HTML tables. They can be used to provi...
research
02/05/2021

Analysing the use of graphs to represent the results of Systematic Reviews in Software Engineering

The presentation of results from Systematic Literature Reviews (SLRs) is...

Please sign up or login with your details

Forgot password? Click here to reset