MilkQA: a Dataset of Consumer Questions for the Task of Answer Selection

01/10/2018
by   Marcelo Criscuolo, et al.
0

We introduce MilkQA, a question answering dataset from the dairy domain dedicated to the study of consumer questions. The dataset contains 2,657 pairs of questions and answers, written in the Portuguese language and originally collected by the Brazilian Agricultural Research Corporation (Embrapa). All questions were motivated by real situations and written by thousands of authors with very different backgrounds and levels of literacy, while answers were elaborated by specialists from Embrapa's customer service. Our dataset was filtered and anonymized by three human annotators. Consumer questions are a challenging kind of question that is usually employed as a form of seeking information. Although several question answering datasets are available, most of such resources are not suitable for research on answer selection models for consumer questions. We aim to fill this gap by making MilkQA publicly available. We study the behavior of four answer selection models on MilkQA: two baseline models and two convolutional neural network archictetures. Our results show that MilkQA poses real challenges to computational models, particularly due to linguistic characteristics of its questions and to their unusually longer lengths. Only one of the experimented models gives reasonable results, at the cost of high computational requirements.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/25/2021

PerCQA: Persian Community Question Answering Dataset

Community Question Answering (CQA) forums provide answers for many real-...
research
05/07/2021

A Dataset of Information-Seeking Questions and Answers Anchored in Research Papers

Readers of academic research papers often read with the goal of answerin...
research
11/08/2019

The TechQA Dataset

We introduce TechQA, a domain-adaptation question answering dataset for ...
research
10/26/2018

Finding Answers from the Word of God: Domain Adaptation for Neural Networks in Biblical Question Answering

Question answering (QA) has significantly benefitted from deep learning ...
research
02/06/2018

Question-Answer Selection in User to User Marketplace Conversations

Sellers in user to user marketplaces can be inundated with questions fro...
research
08/21/2019

How Good is Artificial Intelligence at Automatically Answering Consumer Questions Related to Alzheimer's Disease?

Alzheimer's Disease (AD) is the most common type of dementia, comprising...
research
08/10/2020

Question Identification in Arabic Language Using Emotional Based Features

With the growth of content on social media networks, enterprises and ser...

Please sign up or login with your details

Forgot password? Click here to reset