Knowledge Refinement via Interaction Between Search Engines and Large Language Models

05/12/2023
by   Jiazhan Feng, et al.
0

Information retrieval (IR) plays a crucial role in locating relevant resources from vast amounts of data, and its applications have evolved from traditional knowledge bases to modern search engines (SEs). The emergence of large language models (LLMs) has further revolutionized the IR field by enabling users to interact with search systems in natural language. In this paper, we explore the advantages and disadvantages of LLMs and SEs, highlighting their respective strengths in understanding user-issued queries and retrieving up-to-date information. To leverage the benefits of both paradigms while circumventing their limitations, we propose InteR, a novel framework that facilitates knowledge refinement through interaction between SEs and LLMs. InteR allows SEs to expand knowledge in queries using LLM-generated knowledge collections and enables LLMs to enhance prompt formulation using SE-retrieved documents. This iterative refinement process augments the inputs of SEs and LLMs, leading to more accurate retrieval. Experiments on large-scale retrieval benchmarks involving web search and low-resource retrieval tasks demonstrate that InteR achieves overall superior zero-shot retrieval performance compared to state-of-the-art methods, even those using relevance judgment. Source code is available at https://github.com/Cyril-JZ/InteR

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/13/2023

CONVERSER: Few-Shot Conversational Dense Retrieval with Synthetic Data Generation

Conversational search provides a natural interface for information retri...
research
04/19/2023

Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agent

Large Language Models (LLMs) have demonstrated a remarkable ability to g...
research
08/14/2023

Large Language Models for Information Retrieval: A Survey

As a primary means of information acquisition, information retrieval (IR...
research
02/28/2023

Spacerini: Plug-and-play Search Engines with Pyserini and Hugging Face

We present Spacerini, a modular framework for seamless building and depl...
research
05/16/2023

Large Language Models are Built-in Autoregressive Search Engines

Document retrieval is a key stage of standard Web search engines. Existi...
research
05/19/2022

PLAID: An Efficient Engine for Late Interaction Retrieval

Pre-trained language models are increasingly important components across...
research
10/30/2019

Lexical Learning as an Online Optimal Experiment: Building Efficient Search Engines through Human-Machine Collaboration

Information retrieval (IR) systems need to constantly update their knowl...

Please sign up or login with your details

Forgot password? Click here to reset