A Systematic Classification of Knowledge, Reasoning, and Context within the ARC Dataset

06/01/2018
by   Michael Boratko, et al.
2

The recent work of Clark et al. introduces the AI2 Reasoning Challenge (ARC) and the associated ARC dataset that partitions open domain, complex science questions into an Easy Set and a Challenge Set. That paper includes an analysis of 100 questions with respect to the types of knowledge and reasoning required to answer them; however, it does not include clear definitions of these types, nor does it offer information about the quality of the labels. We propose a comprehensive set of definitions of knowledge and reasoning types necessary for answering the questions in the ARC dataset. Using ten annotators and a sophisticated annotation interface, we analyze the distribution of labels across the Challenge Set and statistics related to them. Additionally, we demonstrate that although naive information retrieval methods return sentences that are irrelevant to answering the query, sufficient supporting text is often present in the (ARC) corpus. Evaluating with human-selected relevant sentences improves the performance of a neural machine comprehension model by 42 points.

READ FULL TEXT
research
03/14/2018

Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge

We present a new question set, text corpus, and baselines assembled to e...
research
03/18/2018

The Web as a Knowledge-base for Answering Complex Questions

Answering complex questions is a time-consuming activity for humans that...
research
05/31/2018

KG^2: Learning to Reason Science Exam Questions with Contextual Knowledge Graph Embeddings

The AI2 Reasoning Challenge (ARC), a new benchmark dataset for question ...
research
11/01/2021

Discourse Comprehension: A Question Answering Framework to Represent Sentence Connections

While there has been substantial progress in text comprehension through ...
research
04/20/2023

Why Does ChatGPT Fall Short in Answering Questions Faithfully?

Recent advancements in Large Language Models, such as ChatGPT, have demo...
research
04/26/2020

Challenge Closed-book Science Exam: A Meta-learning Based Question Answering System

Prior work in standardized science exams requires support from large tex...
research
12/31/2019

What Does My QA Model Know? Devising Controlled Probes using Expert Knowledge

Open-domain question answering (QA) is known to involve several underlyi...

Please sign up or login with your details

Forgot password? Click here to reset