Zero-Shot Retrieval with Search Agents and Hybrid Environments

09/30/2022
by   Michelle Chen Huebscher, et al.
0

Learning to search is the task of building artificial agents that learn to autonomously use a search box to find information. So far, it has been shown that current language models can learn symbolic query reformulation policies, in combination with traditional term-based retrieval, but fall short of outperforming neural retrievers. We extend the previous learning to search setup to a hybrid environment, which accepts discrete query refinement operations, after a first-pass retrieval step performed by a dual encoder. Experiments on the BEIR task show that search agents, trained via behavioral cloning, outperform the underlying search system based on a combined dual encoder retriever and cross encoder reranker. Furthermore, we find that simple heuristic Hybrid Retrieval Environments (HRE) can improve baseline performance by several nDCG points. The search agent based on HRE (HARE) produces state-of-the-art performance on both zero-shot and in-domain evaluations. We carry out an extensive qualitative analysis to shed light on the agents policies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/20/2022

HYRR: Hybrid Infused Reranking for Passage Retrieval

We present Hybrid Infused Reranking for Passages Retrieval (HYRR), a fra...
research
09/01/2021

Boosting Search Engines with Interactive Agents

Can machines learn to use a search engine as an interactive tool for fin...
research
09/14/2023

Zero-shot Audio Topic Reranking using Large Language Models

The Multimodal Video Search by Examples (MVSE) project investigates usin...
research
07/23/2020

ZSCRGAN: A GAN-based Expectation Maximization Model for Zero-Shot Retrieval of Images from Textual Descriptions

Most existing algorithms for cross-modal Information Retrieval are based...
research
07/18/2023

Zero-shot Query Reformulation for Conversational Search

As the popularity of voice assistants continues to surge, conversational...
research
05/16/2023

Large Language Models are Built-in Autoregressive Search Engines

Document retrieval is a key stage of standard Web search engines. Existi...
research
09/22/2020

Embedding-based Zero-shot Retrieval through Query Generation

Passage retrieval addresses the problem of locating relevant passages, u...

Please sign up or login with your details

Forgot password? Click here to reset