Challenges in Information Seeking QA:Unanswerable Questions and Paragraph Retrieval

10/22/2020
by   Akari Asai, et al.
7

Recent progress in pretrained language model "solved" many reading comprehension benchmark datasets. Yet information-seeking Question Answering (QA) datasets, where questions are written without the evidence document, remain unsolved. We analyze two such datasets (Natural Questions and TyDi QA) to identify remaining headrooms: paragraph selection and answerability classification, i.e. determining whether the paired evidence document contains the answer to the query or not. In other words, given a gold paragraph and knowing whether it contains an answer or not, models easily outperform a single annotator in both datasets. After identifying unanswerability as a bottleneck, we further inspect what makes questions unanswerable. Our study points to avenues for future research, both for dataset creation and model development.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/07/2021

A Dataset of Information-Seeking Questions and Answers Anchored in Research Papers

Readers of academic research papers often read with the goal of answerin...
research
11/30/2022

CREPE: Open-Domain Question Answering with False Presuppositions

Information seeking users often pose questions with false presupposition...
research
09/12/2019

Finding Generalizable Evidence by Learning to Convince Q A Models

We propose a system that finds the strongest supporting evidence for a g...
research
03/01/2023

DIFFQG: Generating Questions to Summarize Factual Changes

Identifying the difference between two versions of the same article is u...
research
03/23/2018

Datasheets for Datasets

Currently there is no standard way to identify how a dataset was created...
research
08/27/2019

Interactive Machine Comprehension with Information Seeking Agents

Existing machine reading comprehension (MRC) models do not scale effecti...
research
10/06/2020

PolicyQA: A Reading Comprehension Dataset for Privacy Policies

Privacy policy documents are long and verbose. A question answering (QA)...

Please sign up or login with your details

Forgot password? Click here to reset