Single-dataset Experts for Multi-dataset Question Answering

09/28/2021
by   Dan Friedman, et al.
0

Many datasets have been created for training reading comprehension models, and a natural question is whether we can combine them to build models that (1) perform better on all of the training datasets and (2) generalize and transfer better to new datasets. Prior work has addressed this goal by training one network simultaneously on multiple datasets, which works well on average but is prone to over- or under-fitting different sub-distributions and might transfer worse compared to source models with more overlap with the target dataset. Our approach is to model multi-dataset question answering with a collection of single-dataset experts, by training a collection of lightweight, dataset-specific adapter modules (Houlsby et al., 2019) that share an underlying Transformer model. We find that these Multi-Adapter Dataset Experts (MADE) outperform all our baselines in terms of in-distribution accuracy, and simple methods based on parameter-averaging lead to better zero-shot generalization and few-shot transfer performance, offering a strong and versatile starting point for building new reading comprehension systems.

READ FULL TEXT
research
05/31/2019

MultiQA: An Empirical Investigation of Generalization and Transfer in Reading Comprehension

A large number of reading comprehension (RC) datasets has been created r...
research
11/02/2019

How to Pre-Train Your Model? Comparison of Different Pre-Training Models for Biomedical Question Answering

Using deep learning models on small scale datasets would result in overf...
research
05/13/2020

BIOMRC: A Dataset for Biomedical Machine Reading Comprehension

We introduce BIOMRC, a large-scale cloze-style biomedical MRC dataset. C...
research
10/22/2019

MRQA 2019 Shared Task: Evaluating Generalization in Reading Comprehension

We present the results of the Machine Reading for Question Answering (MR...
research
06/30/2021

Zero-Shot Estimation of Base Models' Weights in Ensemble of Machine Reading Comprehension Systems for Robust Generalization

One of the main challenges of the machine reading comprehension (MRC) mo...
research
10/04/2022

Modular Approach to Machine Reading Comprehension: Mixture of Task-Aware Experts

In this work we present a Mixture of Task-Aware Experts Network for Mach...
research
05/15/2022

Not to Overfit or Underfit? A Study of Domain Generalization in Question Answering

Machine learning models are prone to overfitting their source (training)...

Please sign up or login with your details

Forgot password? Click here to reset