Dual Reader-Parser on Hybrid Textual and Tabular Evidence for Open Domain Question Answering

08/05/2021
by   Alexander Hanbo Li, et al.
0

The current state-of-the-art generative models for open-domain question answering (ODQA) have focused on generating direct answers from unstructured textual information. However, a large amount of world's knowledge is stored in structured databases, and need to be accessed using query languages such as SQL. Furthermore, query languages can answer questions that require complex reasoning, as well as offering full explainability. In this paper, we propose a hybrid framework that takes both textual and tabular evidence as input and generates either direct answers or SQL queries depending on which form could better answer the question. The generated SQL queries can then be executed on the associated databases to obtain the final answers. To the best of our knowledge, this is the first paper that applies Text2SQL to ODQA tasks. Empirically, we demonstrate that on several ODQA datasets, the hybrid methods consistently outperforms the baseline models that only take homogeneous input by a large margin. Specifically we achieve state-of-the-art performance on OpenSQuAD dataset using a T5-base model. In a detailed analysis, we demonstrate that the being able to generate structural SQL queries can always bring gains, especially for those questions that requires complex reasoning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/12/2023

Open-WikiTable: Dataset for Open Domain Question Answering with Complex Reasoning over Table

Despite recent interest in open domain question answering (ODQA) over ta...
research
05/03/2022

DrugEHRQA: A Question Answering Dataset on Structured and Unstructured Electronic Health Records For Medicine Related Queries

This paper develops the first question answering dataset (DrugEHRQA) con...
research
08/16/2018

PUG: A Framework and Practical Implementation for Why & Why-Not Provenance (extended version)

Explaining why an answer is (or is not) returned by a query is important...
research
07/02/2020

Leveraging Passage Retrieval with Generative Models for Open Domain Question Answering

Generative models for open domain question answering have proven to be c...
research
12/25/2020

Brain-inspired Search Engine Assistant based on Knowledge Graph

Search engines can quickly response a hyperlink list according to query ...
research
11/15/2018

End-to-End Learning for Answering Structured Queries Directly over Text

Structured queries expressed in languages (such as SQL, SPARQL, or XQuer...
research
06/06/2023

ChatDB: Augmenting LLMs with Databases as Their Symbolic Memory

Large language models (LLMs) with memory are computationally universal. ...

Please sign up or login with your details

Forgot password? Click here to reset