DeepAI AI Chat
Log In Sign Up

Tradeoffs in Sentence Selection Techniques for Open-Domain Question Answering

by   Shih-Ting Lin, et al.

Current methods in open-domain question answering (QA) usually employ a pipeline of first retrieving relevant documents, then applying strong reading comprehension (RC) models to that retrieved text. However, modern RC models are complex and expensive to run, so techniques to prune the space of retrieved text are critical to allow this approach to scale. In this paper, we focus on approaches which apply an intermediate sentence selection step to address this issue, and investigate the best practices for this approach. We describe two groups of models for sentence selection: QA-based approaches, which run a full-fledged QA system to identify answer candidates, and retrieval-based models, which find parts of each passage specifically related to each question. We examine trade-offs between processing speed and task performance in these two approaches, and demonstrate an ensemble module that represents a hybrid of the two. From experiments on Open-SQuAD and TriviaQA, we show that very lightweight QA models can do well at this task, but retrieval-based models are faster still. An ensemble module we describe balances between the two and generalizes well cross-domain.


page 1

page 2

page 3

page 4


Knowledge Guided Text Retrieval and Reading for Open Domain Question Answering

This paper presents a general approach for open-domain question answerin...

Investigating Information Inconsistency in Multilingual Open-Domain Question Answering

Retrieval based open-domain QA systems use retrieved documents and answe...

MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models

Retrieval question answering (ReQA) is the task of retrieving a sentence...

XLMRQA: Open-Domain Question Answering on Vietnamese Wikipedia-based Textual Knowledge Source

Question answering (QA) is a natural language understanding task within ...

Modeling Context in Answer Sentence Selection Systems on a Latency Budget

Answer Sentence Selection (AS2) is an efficient approach for the design ...

Applying Deep Learning to Answer Selection: A Study and An Open Task

We apply a general deep learning framework to address the non-factoid qu...

InsertGNN: Can Graph Neural Networks Outperform Humans in TOEFL Sentence Insertion Problem?

Sentence insertion is a delicate but fundamental NLP problem. Current ap...