Unsupervised Multi-hop Question Answering by Question Generation

10/23/2020
by   Liangming Pan, et al.
0

Obtaining training data for Multi-hop Question Answering (QA) is extremely time-consuming and resource-intensive. To address this, we propose the problem of unsupervised multi-hop QA, assuming that no human-labeled multi-hop question-answer pairs are available. We propose MQA-QG, an unsupervised question answering framework that can generate human-like multi-hop training pairs from both homogeneous and heterogeneous data sources. Our model generates questions by first selecting or generating relevant information from each data source and then integrating the multiple information to form a multi-hop question. We find that we can train a competent multi-hop QA model with only generated data. The F1 gap between the unsupervised and fully-supervised models is less than 20 in both the HotpotQA and the HybridQA dataset. Further experiments reveal that an unsupervised pretraining with the QA data generated by our model would greatly reduce the demand for human-annotated training data for multi-hop QA.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/22/2020

Unsupervised Question Decomposition for Question Answering

We aim to improve question answering (QA) by decomposing hard questions ...
research
04/24/2020

Template-Based Question Generation from Retrieved Sentences for Improved Unsupervised Question Answering

Question Answering (QA) is in increasing demand as the amount of informa...
research
11/17/2019

Quick and (not so) Dirty: Unsupervised Selection of Justification Sentences for Multi-hop Question Answering

We propose an unsupervised strategy for the selection of justification s...
research
06/12/2019

Unsupervised Question Answering by Cloze Translation

Obtaining training data for Question Answering (QA) is time-consuming an...
research
01/03/2023

PIE-QG: Paraphrased Information Extraction for Unsupervised Question Generation from Small Corpora

Supervised Question Answering systems (QA systems) rely on domain-specif...
research
10/12/2022

Improving Question Answering with Generation of NQ-like Questions

Question Answering (QA) systems require a large amount of annotated data...
research
06/01/2023

TimelineQA: A Benchmark for Question Answering over Timelines

Lifelogs are descriptions of experiences that a person had during their ...

Please sign up or login with your details

Forgot password? Click here to reset