Enhanced vectors for top-k document retrieval in Question Answering

10/08/2022
by   Mohammed Hammad, et al.
0

Modern day applications, especially information retrieval webapps that involve "search" as their use cases are gradually moving towards "answering" modules. Conversational chatbots which have been proved to be more engaging to users, use Question Answering as their core. Since, precise answering is computationally expensive, several approaches have been developed to prefetch the most relevant documents/passages from the database that contain the answer. We propose a different approach that retrieves the evidence documents efficiently and accurately, making sure that the relevant document for a given user query is not missed. We do so by assigning each document (or passage in our case), a unique identifier and using them to create dense vectors which can be efficiently indexed. More precisely, we use the identifier to predict randomly sampled context window words of the relevant question corresponding to the passage along with the words of passage itself. This naturally embeds the passage identifier into the vector space in such a way that the embedding is closer to the question without compromising he information content. This approach enables efficient creation of real-time query vectors in  4 milliseconds.

READ FULL TEXT
research
04/17/2019

Document Expansion by Query Prediction

One technique to improve the retrieval effectiveness of a search engine ...
research
10/23/2017

Content Based Document Recommender using Deep Learning

With the recent advancements in information technology there has been a ...
research
07/31/2023

Olio: A Semantic Search Interface for Data Repositories

Search and information retrieval systems are becoming more expressive in...
research
01/03/2019

Coarse-grain Fine-grain Coattention Network for Multi-evidence Question Answering

End-to-end neural models have made significant progress in question answ...
research
10/20/2020

Extracting Procedural Knowledge from Technical Documents

Procedures are an important knowledge component of documents that can be...
research
01/18/2021

Tip of the Tongue Known-Item Retrieval: A Case Study in Movie Identification

While current information retrieval systems are effective for known-item...
research
05/01/2023

CHIC: Corporate Document for Visual question Answering

The massive use of digital documents due to the substantial trend of pap...

Please sign up or login with your details

Forgot password? Click here to reset