Large Scale Question Answering using Tourism Data

09/08/2019
by   Danish Contractor, et al.
0

Real world question answering can be significantly more complex than what most existing QA datasets reflect. Questions posed by users on websites, such as online travel forums, may consist of multiple sentences and not everything mentioned in a question may be relevant for finding its answer. Such questions typically have a huge candidate answer space and require complex reasoning over large knowledge corpora. We introduce the novel task of answering entity-seeking recommendation questions using a collection of reviews that describe candidate answer entities. We harvest a QA dataset that contains 48,147 paragraph-sized real user questions from travelers seeking recommendations for hotels, attractions and restaurants. Each candidate answer is associated with a collection of unstructured reviews. This dataset is challenging because commonly used neural architectures for QA are prohibitively expensive for a task of this scale. As a solution, we design a scalable cluster-select-rerank approach. It first clusters text for each entity to identify exemplar sentences describing an entity. It then uses a scalable neural information retrieval (IR) module to subselect a set of potential entities from the large candidate set. A reranker uses a deeper attention-based architecture to pick the best answers from the selected entities. This strategy performs better than a pure IR or a pure attention-based reasoning approach yielding nearly 10

READ FULL TEXT
research
11/30/2022

CREPE: Open-Domain Question Answering with False Presuppositions

Information seeking users often pose questions with false presupposition...
research
05/31/2022

Neural Retriever and Go Beyond: A Thesis Proposal

Information Retriever (IR) aims to find the relevant documents (e.g. sni...
research
01/23/2019

A Question-Entailment Approach to Question Answering

One of the challenges in large-scale information retrieval (IR) is to de...
research
09/28/2020

Joint Spatio-Textual Reasoning for Answering Tourism Questions

Our goal is to answer real-world tourism questions that seek Points-of-I...
research
12/28/2021

The University of Texas at Dallas HLTRI's Participation in EPIC-QA: Searching for Entailed Questions Revealing Novel Answer Nuggets

The Epidemic Question Answering (EPIC-QA) track at the Text Analysis Con...
research
09/10/2019

WIQA: A dataset for "What if..." reasoning over procedural text

We introduce WIQA, the first large-scale dataset of "What if..." questio...
research
03/30/2023

QUADRo: Dataset and Models for QUestion-Answer Database Retrieval

An effective paradigm for building Automated Question Answering systems ...

Please sign up or login with your details

Forgot password? Click here to reset