Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index

06/13/2019
by   Minjoon Seo, et al.
0

Existing open-domain question answering (QA) models are not suitable for real-time usage because they need to process several long documents on-demand for every input query. In this paper, we introduce the query-agnostic indexable representation of document phrases that can drastically speed up open-domain QA and also allows us to reach long-tail targets. In particular, our dense-sparse phrase encoding effectively captures syntactic, semantic, and lexical information of the phrases and eliminates the pipeline filtering of context documents. Leveraging optimization strategies, our model can be trained in a single 4-GPU server and serve entire Wikipedia (up to 60 billion phrases) under 2TB with CPUs only. Our experiments on SQuAD-Open show that our model is more accurate than previous models while achieving 6000x reduced computational cost, which translates into at least 58x faster end-to-end inference benchmark on CPUs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/23/2020

Learning Dense Representations of Phrases at Scale

Open-domain question answering can be reformulated as a phrase retrieval...
research
11/07/2019

Contextualized Sparse Representation with Rectified N-Gram Attention for Open-Domain Question Answering

A sparse representation is known to be an effective means to encode prec...
research
10/13/2021

Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a Sparse One?

Despite their recent popularity and well known advantages, dense retriev...
research
04/08/2021

Video Question Answering with Phrases via Semantic Roles

Video Question Answering (VidQA) evaluation metrics have been limited to...
research
09/16/2021

Phrase Retrieval Learns Passage Retrieval, Too

Dense retrieval methods have shown great promise over sparse retrieval m...
research
09/10/2020

Accelerating Real-Time Question Answering via Question Generation

Existing approaches to real-time question answering (RTQA) rely on learn...
research
01/06/2021

EfficientQA : a RoBERTa Based Phrase-Indexed Question-Answering System

State-of-the-art extractive question answering models achieve superhuman...

Please sign up or login with your details

Forgot password? Click here to reset