Think you have Solved Direct-Answer Question Answering? Try ARC-DA, the Direct-Answer AI2 Reasoning Challenge

02/05/2021
by   Sumithra Bhakthavatsalam, et al.
0

We present the ARC-DA dataset, a direct-answer ("open response", "freeform") version of the ARC (AI2 Reasoning Challenge) multiple-choice dataset. While ARC has been influential in the community, its multiple-choice format is unrepresentative of real-world questions, and multiple choice formats can be particularly susceptible to artifacts. The ARC-DA dataset addresses these concerns by converting questions to direct-answer format using a combination of crowdsourcing and expert review. The resulting dataset contains 2985 questions with a total of 8436 valid answers (questions typically have more than one valid answer). ARC-DA is one of the first DA datasets of natural questions that often require reasoning, and where appropriate question decompositions are not evident from the questions themselves. We describe the conversion approach taken, appropriate evaluation metrics, and several strong models. Although high, the best scores (81 considerable room for improvement. In addition, the dataset provides a natural setting for new research on explanation, as many questions require reasoning to construct answers. We hope the dataset spurs further advances in complex question-answering by the community. ARC-DA is available at https://allenai.org/data/arc-da

READ FULL TEXT
research
12/25/2021

PerCQA: Persian Community Question Answering Dataset

Community Question Answering (CQA) forums provide answers for many real-...
research
01/01/2023

Chatbots as Problem Solvers: Playing Twenty Questions with Role Reversals

New chat AI applications like ChatGPT offer an advanced understanding of...
research
10/12/2022

OpenCQA: Open-ended Question Answering with Charts

Charts are very popular to analyze data and convey important insights. P...
research
07/19/2017

Crowdsourcing Multiple Choice Science Questions

We present a novel method for obtaining high-quality, domain-targeted mu...
research
07/06/2021

SOCluster- Towards Intent-based Clustering of Stack Overflow Questions using Graph-Based Approach

Stack Overflow (SO) platform has a huge dataset of questions and answers...
research
08/05/2017

e-QRAQ: A Multi-turn Reasoning Dataset and Simulator with Explanations

In this paper we present a new dataset and user simulator e-QRAQ (explai...

Please sign up or login with your details

Forgot password? Click here to reset