Quasar: Datasets for Question Answering by Search and Reading

07/12/2017
by   Bhuwan Dhingra, et al.
0

We present two new large-scale datasets aimed at evaluating systems designed to comprehend a natural language query and extract its answer from a large corpus of text. The Quasar-S dataset consists of 37000 cloze-style (fill-in-the-gap) queries constructed from definitions of software entity tags on the popular website Stack Overflow. The posts and comments on the website serve as the background corpus for answering the cloze questions. The Quasar-T dataset consists of 43000 open-domain trivia questions and their answers obtained from various internet sources. ClueWeb09 serves as the background corpus for extracting these answers. We pose these datasets as a challenge for two related subtasks of factoid Question Answering: (1) searching for relevant pieces of text that include the correct answer to a query, and (2) reading the retrieved text to answer the query. We also describe a retrieval system for extracting relevant sentences and documents from the corpus given a query, and include these in the release for researchers wishing to only focus on (2). We evaluate several baselines on both datasets, ranging from simple heuristics to powerful neural models, and show that these lag behind human performance by 16.4 https://github.com/bdhingra/quasar .

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/23/2023

Retrieving Supporting Evidence for LLMs Generated Answers

Current large language models (LLMs) can exhibit near-human levels of pe...
research
09/29/2017

A Neural Comprehensive Ranker (NCR) for Open-Domain Question Answering

This paper proposes a novel neural machine reading model for open-domain...
research
06/02/2020

Open-Domain Question Answering with Pre-Constructed Question Spaces

Open-domain question answering aims at solving the task of locating the ...
research
09/20/2023

Retrieving Supporting Evidence for Generative Question Answering

Current large language models (LLMs) can exhibit near-human levels of pe...
research
08/25/2018

Dr. Tux: A Question Answering System for Ubuntu users

Various forums and question answering (Q&A) sites are available online t...
research
03/09/2021

Select, Substitute, Search: A New Benchmark for Knowledge-Augmented Visual Question Answering

Multimodal IR, spanning text corpus, knowledge graph and images, called ...
research
03/26/2022

MQDD: Pre-training of Multimodal Question Duplicity Detection for Software Engineering Domain

This work proposes a new pipeline for leveraging data collected on the S...

Please sign up or login with your details

Forgot password? Click here to reset