IIRC: A Dataset of Incomplete Information Reading Comprehension Questions

11/13/2020
by   James Ferguson, et al.
0

Humans often have to read multiple documents to address their information needs. However, most existing reading comprehension (RC) tasks only focus on questions for which the contexts provide all the information required to answer them, thus not evaluating a system's performance at identifying a potential lack of sufficient information and locating sources for that information. To fill this gap, we present a dataset, IIRC, with more than 13K questions over paragraphs from English Wikipedia that provide only partial information to answer them, with the missing information occurring in one or more linked documents. The questions were written by crowd workers who did not have access to any of the linked documents, leading to questions that have little lexical overlap with the contexts where the answers appear. This process also gave many questions without answers, and those that require discrete reasoning, increasing the difficulty of the task. We follow recent modeling work on various reading comprehension datasets to construct a baseline model for this dataset, finding that it achieves 31.1 performance is 88.4 leaderboard can be found at https://allennlp.org/iirc.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/09/2017

TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension

We present TriviaQA, a challenging reading comprehension dataset contain...
research
08/14/2018

How Much Reading Does Reading Comprehension Require? A Critical Investigation of Popular Benchmarks

Many recent papers address reading comprehension, where examples consist...
research
09/17/2016

ReasoNet: Learning to Stop Reading in Machine Comprehension

Teaching a computer to read and answer general questions pertaining to a...
research
03/08/2023

Class Cardinality Comparison as a Fermi Problem

Questions on class cardinality comparisons are quite tricky to answer an...
research
08/16/2019

Reasoning Over Paragraph Effects in Situations

A key component of successfully reading a passage of text is the ability...
research
04/21/2018

DuoRC: Towards Complex Language Understanding with Paraphrased Reading Comprehension

We propose DuoRC, a novel dataset for Reading Comprehension (RC) that mo...
research
04/18/2021

Learning with Instance Bundles for Reading Comprehension

When training most modern reading comprehension models, all the question...

Please sign up or login with your details

Forgot password? Click here to reset