Neural Retriever and Go Beyond: A Thesis Proposal

05/31/2022
by   Man Luo, et al.
0

Information Retriever (IR) aims to find the relevant documents (e.g. snippets, passages, and articles) to a given query at large scale. IR plays an important role in many tasks such as open domain question answering and dialogue systems, where external knowledge is needed. In the past, searching algorithms based on term matching have been widely used. Recently, neural-based algorithms (termed as neural retrievers) have gained more attention which can mitigate the limitations of traditional methods. Regardless of the success achieved by neural retrievers, they still face many challenges, e.g. suffering from a small amount of training data and failing to answer simple entity-centric questions. Furthermore, most of the existing neural retrievers are developed for pure-text query. This prevents them from handling multi-modality queries (i.e. the query is composed of textual description and images). This proposal has two goals. First, we introduce methods to address the abovementioned issues of neural retrievers from three angles, new model architectures, IR-oriented pretraining tasks, and generating large scale training data. Second, we identify the future research direction and propose potential corresponding solution.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/08/2019

Large Scale Question Answering using Tourism Data

Real world question answering can be significantly more complex than wha...
research
04/28/2023

Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks

With the wide application of Large Language Models (LLMs) such as ChatGP...
research
01/19/2022

Improving Biomedical Information Retrieval with Neural Retrievers

Information retrieval (IR) is essential in search engines and dialogue s...
research
04/23/2020

TCNN: Triple Convolutional Neural Network Models for Retrieval-based Question Answering System in E-commerce

Automatic question-answering (QA) systems have boomed during last few ye...
research
08/14/2023

Large Language Models for Information Retrieval: A Survey

As a primary means of information acquisition, information retrieval (IR...
research
03/30/2023

QUADRo: Dataset and Models for QUestion-Answer Database Retrieval

An effective paradigm for building Automated Question Answering systems ...

Please sign up or login with your details

Forgot password? Click here to reset