Towards Confident Machine Reading Comprehension

01/20/2021
by   Rishav Chakravarti, et al.
0

There has been considerable progress on academic benchmarks for the Reading Comprehension (RC) task with State-of-the-Art models closing the gap with human performance on extractive question answering. Datasets such as SQuAD 2.0 NQ have also introduced an auxiliary task requiring models to predict when a question has no answer in the text. However, in production settings, it is also necessary to provide confidence estimates for the performance of the underlying RC model at both answer extraction and "answerability" detection. We propose a novel post-prediction confidence estimation model, which we call Mr.C (short for Mr. Confident), that can be trained to improve a system's ability to refrain from making incorrect predictions with improvements of up to 4 points as measured by Area Under the Curve (AUC) scores. Mr.C can benefit from a novel white-box feature that leverages the underlying RC model's gradients. Performance prediction is particularly important in cases of domain shift (as measured by training RC models on SQUAD 2.0 and evaluating on NQ), where Mr.C not only improves AUC, but also traditional answerability prediction (as measured by a 5 point improvement in F1).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/10/2017

Stochastic Answer Networks for Machine Reading Comprehension

We propose a simple yet robust stochastic answer network (SAN) that simu...
research
09/09/2019

Span Selection Pre-training for Question Answering

BERT (Bidirectional Encoder Representations from Transformers) and relat...
research
03/20/2021

Self-Supervised Test-Time Learning for Reading Comprehension

Recent work on unsupervised question answering has shown that models can...
research
06/24/2019

EQuANt (Enhanced Question Answer Network)

Machine Reading Comprehension (MRC) is an important topic in the domain ...
research
11/05/2020

Context-Aware Answer Extraction in Question Answering

Extractive QA models have shown very promising performance in predicting...
research
10/07/2020

Acyclic Comprehension is equal to Stratified Comprehension

A new criterion of comprehension is defined, initially termed by myself ...

Please sign up or login with your details

Forgot password? Click here to reset