TableQnA: Answering List Intent Queries With Web Tables

01/10/2020
by   Kaushik Chakrabarti, et al.
4

The web contains a vast corpus of HTML tables. They can be used to provide direct answers to many web queries. We focus on answering two classes of queries with those tables: those seeking lists of entities (e.g., `cities in california') and those seeking superlative entities (e.g., `largest city in california'). The main challenge is to achieve high precision with significant coverage. Existing approaches train machine learning models to select the answer from the candidates; they rely on textual match features between the query and the content of the table along with features capturing table quality/importance. These features alone are inadequate for achieving the above goals. Our main insight is that we can improve precision by (i) first extracting intent (structured information) from the query for the above query classes and (ii) then performing structure-aware matching (instead of just textual matching) between the extracted intent and the candidates to select the answer. We model (i) as a sequence tagging task. We leverage state-of-the-art deep neural network models with word embeddings. The model requires large scale training data which is expensive to obtain via manual labeling; we therefore develop a novel method to automatically generate the training data. For (ii), we develop novel features to compute structure-aware match and train a machine learning model. Our experiments on real-life web search queries show that (i) our intent extractor for list and superlative intent queries has significantly higher precision and coverage compared with baseline approaches and (ii) our table answer selector significantly outperforms the state-of-the-art baseline approach. This technology has been used in production by Microsoft's Bing search engine since 2016.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/10/2020

Open Domain Question Answering Using Web Tables

Tables extracted from web documents can be used to directly answer many ...
research
05/13/2021

Semantic Table Retrieval using Keyword and Table Queries

Tables on the Web contain a vast amount of knowledge in a structured for...
research
06/08/2017

Content-Based Table Retrieval for Web Queries

Understanding the connections between unstructured text and semi-structu...
research
02/16/2018

Ad Hoc Table Retrieval using Semantic Similarity

We introduce and address the problem of ad hoc table retrieval: answerin...
research
10/09/2021

Lookup or Exploratory: What is Your Search Intent?

Search query specificity is broadly divided into two categories - Explor...
research
03/21/2023

Improving Content Retrievability in Search with Controllable Query Generation

An important goal of online platforms is to enable content discovery, i....
research
11/13/2018

Text Assisted Insight Ranking Using Context-Aware Memory Network

Extracting valuable facts or informative summaries from multi-dimensiona...

Please sign up or login with your details

Forgot password? Click here to reset