An Exploration of Data Augmentation and Sampling Techniques for Domain-Agnostic Question Answering

12/04/2019
by   Shayne Longpre, et al.
0

To produce a domain-agnostic question answering model for the Machine Reading Question Answering (MRQA) 2019 Shared Task, we investigate the relative benefits of large pre-trained language models, various data sampling strategies, as well as query and context paraphrases generated by back-translation. We find a simple negative sampling technique to be particularly effective, even though it is typically used for datasets that include unanswerable questions, such as SQuAD 2.0. When applied in conjunction with per-domain sampling, our XLNet (Yang et al., 2019)-based submission achieved the second best Exact Match and F1 in the MRQA leaderboard competition.

READ FULL TEXT
research
09/18/2019

Pre-trained Language Model for Biomedical Question Answering

The recent success of question answering systems is largely attributed t...
research
04/10/2022

Data Augmentation for Biomedical Factoid Question Answering

We study the effect of seven data augmentation (da) methods in factoid q...
research
10/26/2021

Transferring Domain-Agnostic Knowledge in Video Question Answering

Video question answering (VideoQA) is designed to answer a given questio...
research
10/23/2020

Neural Passage Retrieval with Improved Negative Contrast

In this paper we explore the effects of negative sampling in dual encode...
research
05/14/2019

Multi-step Retriever-Reader Interaction for Scalable Open-domain Question Answering

This paper introduces a new framework for open-domain question answering...
research
01/06/2021

EfficientQA : a RoBERTa Based Phrase-Indexed Question-Answering System

State-of-the-art extractive question answering models achieve superhuman...
research
05/10/2021

ReadTwice: Reading Very Large Documents with Memories

Knowledge-intensive tasks such as question answering often require assim...

Please sign up or login with your details

Forgot password? Click here to reset