DaNetQA: a yes/no Question Answering Dataset for the Russian Language

10/06/2020
by   Taisia Glushkova, et al.
0

DaNetQA, a new question-answering corpus, follows (Clark et. al, 2019) design: it comprises natural yes/no questions. Each question is paired with a paragraph from Wikipedia and an answer, derived from the paragraph. The task is to take both the question and a paragraph as input and come up with a yes/no answer, i.e. to produce a binary output. In this paper, we present a reproducible approach to DaNetQA creation and investigate transfer learning methods for task and language transferring. For task transferring we leverage three similar sentence modelling tasks: 1) a corpus of paraphrases, Paraphraser, 2) an NLI task, for which we use the Russian part of XNLI, 3) another question answering task, SberQUAD. For language transferring we use English to Russian translation together with multilingual language fine-tuning.

READ FULL TEXT

page 1

page 2

page 3

page 4

08/07/2017

ISS-MULT: Intelligent Sample Selection for Multi-Task Learning in Question Answering

Transferring knowledge from a source domain to another domain is useful,...
02/03/2019

Improving Question Answering with External Knowledge

Prior background knowledge is essential for human reading and understand...
03/24/2021

StyleKQC: A Style-Variant Paraphrase Corpus for Korean Questions and Commands

Paraphrasing is often performed with less concern for controlled style c...
05/01/2022

ELQA: A Corpus of Questions and Answers about the English Language

We introduce a community-sourced dataset for English Language Question A...
05/01/2021

When to Fold'em: How to answer Unanswerable questions

We present 3 different question-answering models trained on the SQuAD2.0...
10/26/2021

Transferring Domain-Agnostic Knowledge in Video Question Answering

Video question answering (VideoQA) is designed to answer a given questio...
04/20/2020

Fine-tuning Multi-hop Question Answering with Hierarchical Graph Network

In this paper, we present a two stage model for multi-hop question answe...