DaNetQA: a yes/no Question Answering Dataset for the Russian Language

by   Taisia Glushkova, et al.

DaNetQA, a new question-answering corpus, follows (Clark et. al, 2019) design: it comprises natural yes/no questions. Each question is paired with a paragraph from Wikipedia and an answer, derived from the paragraph. The task is to take both the question and a paragraph as input and come up with a yes/no answer, i.e. to produce a binary output. In this paper, we present a reproducible approach to DaNetQA creation and investigate transfer learning methods for task and language transferring. For task transferring we leverage three similar sentence modelling tasks: 1) a corpus of paraphrases, Paraphraser, 2) an NLI task, for which we use the Russian part of XNLI, 3) another question answering task, SberQUAD. For language transferring we use English to Russian translation together with multilingual language fine-tuning.


page 1

page 2

page 3

page 4


ISS-MULT: Intelligent Sample Selection for Multi-Task Learning in Question Answering

Transferring knowledge from a source domain to another domain is useful,...

Improving Question Answering with External Knowledge

Prior background knowledge is essential for human reading and understand...

StyleKQC: A Style-Variant Paraphrase Corpus for Korean Questions and Commands

Paraphrasing is often performed with less concern for controlled style c...

ELQA: A Corpus of Questions and Answers about the English Language

We introduce a community-sourced dataset for English Language Question A...

When to Fold'em: How to answer Unanswerable questions

We present 3 different question-answering models trained on the SQuAD2.0...

Transferring Domain-Agnostic Knowledge in Video Question Answering

Video question answering (VideoQA) is designed to answer a given questio...

Fine-tuning Multi-hop Question Answering with Hierarchical Graph Network

In this paper, we present a two stage model for multi-hop question answe...