A Graph Representation of Semi-structured Data for Web Question Answering

10/14/2020
by   Xingyao Zhang, et al.
0

The abundant semi-structured data on the Web, such as HTML-based tables and lists, provide commercial search engines a rich information source for question answering (QA). Different from plain text passages in Web documents, Web tables and lists have inherent structures, which carry semantic correlations among various elements in tables and lists. Many existing studies treat tables and lists as flat documents with pieces of text and do not make good use of semantic information hidden in structures. In this paper, we propose a novel graph representation of Web tables and lists based on a systematic categorization of the components in semi-structured data as well as their relations. We also develop pre-training and reasoning techniques on the graph model for the QA task. Extensive experiments on several real datasets collected from a commercial engine verify the effectiveness of our approach. Our method improves F1 score by 3.90 points over the state-of-the-art baselines.

READ FULL TEXT
research
01/10/2020

Open Domain Question Answering Using Web Tables

Tables extracted from web documents can be used to directly answer many ...
research
12/29/2020

Unified Open-Domain Question Answering with Structured and Unstructured Knowledge

We study open-domain question answering (ODQA) with structured, unstruct...
research
06/13/2020

Mining Implicit Relevance Feedback from User Behavior for Web Question Answering

Training and refreshing a web-scale Question Answering (QA) system for a...
research
06/13/2020

Mining Implicit Relevance Feedback from User Behavior forWeb Question Answering

Training and refreshing a web-scale Question Answering (QA) system for a...
research
08/20/2018

FedMark: A Marketplace for Federated Data on the Web

The Web of Data (WoD) has experienced a phenomenal growth in the past. T...
research
03/15/2023

Generating contingency tables with fixed marginal probabilities and dependence structures described by loglinear models

We present a method to generate contingency tables that follow loglinear...
research
05/03/2023

Doc2SoarGraph: Discrete Reasoning over Visually-Rich Table-Text Documents with Semantic-Oriented Hierarchical Graphs

Discrete reasoning over table-text documents (e.g., financial reports) g...

Please sign up or login with your details

Forgot password? Click here to reset