When to Read Documents or QA History: On Unified and Selective Open-domain QA

06/07/2023
by   Kyungjae Lee, et al.
0

This paper studies the problem of open-domain question answering, with the aim of answering a diverse range of questions leveraging knowledge resources. Two types of sources, QA-pair and document corpora, have been actively leveraged with the following complementary strength. The former is highly precise when the paraphrase of given question q was seen and answered during training, often posed as a retrieval problem, while the latter generalizes better for unseen questions. A natural follow-up is thus leveraging both models, while a naive pipelining or integration approaches have failed to bring additional gains over either model alone. Our distinction is interpreting the problem as calibration, which estimates the confidence of predicted answers as an indicator to decide when to use a document or QA-pair corpus. The effectiveness of our method was validated on widely adopted benchmarks such as Natural Questions and TriviaQA.

READ FULL TEXT
research
02/13/2021

PAQ: 65 Million Probably-Asked Questions and What You Can Do With Them

Open-domain Question Answering models which directly leverage question-a...
research
02/06/2020

Generating Scientific Question Answering Corpora from Q A forums

Question Answering (QA) is a natural language processing task that aims ...
research
08/08/2023

On Monotonic Aggregation for Open-domain QA

Question answering (QA) is a critical task for speech-based retrieval fr...
research
10/16/2021

Open Domain Question Answering over Virtual Documents: A Unified Approach for Data and Text

Due to its potential for a universal interface over both data and text, ...
research
04/11/2017

Leveraging Term Banks for Answering Complex Questions: A Case for Sparse Vectors

While open-domain question answering (QA) systems have proven effective ...
research
03/20/2022

Calibration of Machine Reading Systems at Scale

In typical machine learning systems, an estimate of the probability of t...
research
01/01/2021

NeurIPS 2020 EfficientQA Competition: Systems, Analyses and Lessons Learned

We review the EfficientQA competition from NeurIPS 2020. The competition...

Please sign up or login with your details

Forgot password? Click here to reset