SelQA: A New Benchmark for Selection-based Question Answering

06/27/2016
by   Tomasz Jurczyk, et al.
0

This paper presents a new selection-based question answering dataset, SelQA. The dataset consists of questions generated through crowdsourcing and sentence length answers that are drawn from the ten most prevalent topics in the English Wikipedia. We introduce a corpus annotation scheme that enhances the generation of large, diverse, and challenging datasets by explicitly aiming to reduce word co-occurrences between the question and answers. Our annotation scheme is composed of a series of crowdsourcing tasks with a view to more effectively utilize crowdsourcing in the creation of question answering datasets in various domains. Several systems are compared on the tasks of answer sentence selection and answer triggering, providing strong baseline results for future work to improve upon.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/25/2021

PerCQA: Persian Community Question Answering Dataset

Community Question Answering (CQA) forums provide answers for many real-...
research
05/01/2022

ELQA: A Corpus of Questions and Answers about the English Language

We introduce a community-sourced dataset for English Language Question A...
research
11/16/2017

Crowdsourcing Question-Answer Meaning Representations

We introduce Question-Answer Meaning Representations (QAMRs), which repr...
research
05/21/2020

RuBQ: A Russian Dataset for Question Answering over Wikidata

The paper presents RuBQ, the first Russian knowledge base question answe...
research
09/05/2019

An Empirical Study on the Characteristics of Question-Answering Process on Developer Forums

Developer forums are one of the most popular and useful Q&A websites on ...
research
08/23/2019

Toward Dialogue Modeling: A Semantic Annotation Scheme for Questions and Answers

The present study proposes an annotation scheme for classifying the cont...
research
04/20/2022

Clotho-AQA: A Crowdsourced Dataset for Audio Question Answering

Audio question answering (AQA) is a multimodal translation task where a ...

Please sign up or login with your details

Forgot password? Click here to reset