Neural Retrieval for Question Answering with Cross-Attention Supervised Data Augmentation

09/29/2020
by   Yinfei Yang, et al.
0

Neural models that independently project questions and answers into a shared embedding space allow for efficient continuous space retrieval from large corpora. Independently computing embeddings for questions and answers results in late fusion of information related to matching questions to their answers. While critical for efficient retrieval, late fusion underperforms models that make use of early fusion (e.g., a BERT based classifier with cross-attention between question-answer pairs). We present a supervised data mining method using an accurate early fusion model to improve the training of an efficient late fusion retrieval model. We first train an accurate classification model with cross-attention between questions and answers. The accurate cross-attention model is then used to annotate additional passages in order to generate weighted training examples for a neural retrieval model. The resulting retrieval model with additional data significantly outperforms retrieval models directly trained with gold annotations on Precision at N (P@N) and Mean Reciprocal Rank (MRR).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/22/2019

ANTIQUE: A Non-Factoid Question Answering Benchmark

Considering the widespread use of mobile and voice search, answer passag...
research
06/07/2022

Enhancing Dual-Encoders with Question and Answer Cross-Embeddings for Answer Retrieval

Dual-Encoders is a promising mechanism for answer retrieval in question ...
research
10/16/2020

RocketQA: An Optimized Training Approach to Dense Passage Retrieval for Open-Domain Question Answering

In open-domain question answering, dense passage retrieval has become a ...
research
05/31/2023

Attention-Based Methods For Audio Question Answering

Audio question answering (AQA) is the task of producing natural language...
research
11/21/2018

Overcoming low-utility facets for complex answer retrieval

Many questions cannot be answered simply; their answers must include num...
research
10/21/2022

An Analysis of Fusion Functions for Hybrid Retrieval

We study hybrid search in text retrieval where lexical and semantic sear...
research
09/30/2019

On Incorporating Semantic Prior Knowlegde in Deep Learning Through Embedding-Space Constraints

The knowledge that humans hold about a problem often extends far beyond ...

Please sign up or login with your details

Forgot password? Click here to reset