End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering

06/09/2021
by   Devendra Singh Sachan, et al.
0

We present an end-to-end differentiable training method for retrieval-augmented open-domain question answering systems that combine information from multiple retrieved documents when generating answers. We model retrieval decisions as latent variables over sets of relevant documents. Since marginalizing over sets of retrieved documents is computationally hard, we approximate this using an expectation-maximization algorithm. We iteratively estimate the value of our latent variable (the set of relevant documents for a given question) and then use this estimate to update the retriever and reader parameters. We hypothesize that such end-to-end training allows training signals to flow to the reader and then to the retriever better than staged-wise training. This results in a retriever that is able to select more relevant documents for a question and a reader that is trained on more accurate documents to generate an answer. Experiments on three benchmark datasets demonstrate that our proposed method outperforms all existing approaches of comparable size by 2-3 state-of-the-art results. Our results also demonstrate the feasibility of learning to retrieve to improve answer generation without explicit supervision of retrieval decisions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/21/2023

Generator-Retriever-Generator: A Novel Approach to Open-domain Question Answering

Open-domain question answering (QA) tasks usually require the retrieval ...
research
09/16/2020

DDRQA: Dynamic Document Reranking for Open-domain Multi-hop Question Answering

Open-domain multi-hop question answering (QA) requires to retrieve multi...
research
06/01/2021

End-to-End Multihop Retrieval for Compositional Question Answering over Long Documents

Answering complex questions from long documents requires aggregating mul...
research
08/20/2018

Adaptive Document Retrieval for Deep Question Answering

State-of-the-art systems in deep question answering proceed as follows: ...
research
01/02/2021

End-to-End Training of Neural Retrievers for Open-Domain Question Answering

Recent work on training neural retrievers for open-domain question answe...
research
09/23/2022

Variational Open-Domain Question Answering

We introduce the Variational Open-Domain (VOD) framework for end-to-end ...
research
08/27/2021

Query-Focused Extractive Summarisation for Finding Ideal Answers to Biomedical and COVID-19 Questions

This paper presents Macquarie University's participation to the BioASQ S...

Please sign up or login with your details

Forgot password? Click here to reset