Attention-guided Generative Models for Extractive Question Answering

10/12/2021
by   Peng Xu, et al.
13

We propose a novel method for applying Transformer models to extractive question answering (QA) tasks. Recently, pretrained generative sequence-to-sequence (seq2seq) models have achieved great success in question answering. Contributing to the success of these models are internal attention mechanisms such as cross-attention. We propose a simple strategy to obtain an extractive answer span from the generative model by leveraging the decoder cross-attention patterns. Viewing cross-attention as an architectural prior, we apply joint training to further improve QA performance. Empirical results show that on open-domain question answering datasets like NaturalQuestions and TriviaQA, our method approaches state-of-the-art performance on both generative and extractive inference, all while using much fewer parameters. Furthermore, this strategy allows us to perform hallucination-free inference while conferring significant improvements to the model's ability to rerank relevant passages.

READ FULL TEXT

page 2

page 3

page 5

page 6

research
06/07/2017

Question Answering and Question Generation as Dual Tasks

We study the problem of joint question answering (QA) and question gener...
research
11/18/2022

FiE: Building a Global Probability Space by Leveraging Early Fusion in Encoder for Open-Domain Question Answering

Generative models have recently started to outperform extractive models ...
research
12/19/2022

Tokenization Consistency Matters for Generative Models on Extractive NLP Tasks

Generative models have been widely applied to solve extractive tasks, wh...
research
09/04/2017

A Unified Query-based Generative Model for Question Generation and Question Answering

We propose a query-based generative model for solving both tasks of ques...
research
01/23/2018

Assertion-based QA with Question-Aware Open Information Extraction

We present assertion based question answering (ABQA), an open domain que...
research
07/02/2020

Leveraging Passage Retrieval with Generative Models for Open Domain Question Answering

Generative models for open domain question answering have proven to be c...
research
03/14/2022

Choose Your QA Model Wisely: A Systematic Study of Generative and Extractive Readers for Question Answering

While both extractive and generative readers have been successfully appl...

Please sign up or login with your details

Forgot password? Click here to reset