Decoding a Neural Retriever's Latent Space for Query Suggestion

10/21/2022
by   Leonard Adolphs, et al.
0

Neural retrieval models have superseded classic bag-of-words methods such as BM25 as the retrieval framework of choice. However, neural systems lack the interpretability of bag-of-words models; it is not trivial to connect a query change to a change in the latent space that ultimately determines the retrieval results. To shed light on this embedding space, we learn a "query decoder" that, given a latent representation of a neural search engine, generates the corresponding query. We show that it is possible to decode a meaningful query from its latent representation and, when moving in the right direction in latent space, to decode a query that retrieves the relevant paragraph. In particular, the query decoder can be useful to understand "what should have been asked" to retrieve a particular paragraph from the collection. We employ the query decoder to generate a large synthetic dataset of query reformulations for MSMarco, leading to improved retrieval performance. On this data, we train a pseudo-relevance feedback (PRF) T5 model for the application of query suggestion that outperforms both query reformulation and PRF information retrieval baselines.

READ FULL TEXT

page 8

page 13

research
08/13/2021

GQE-PRF: Generative Query Expansion with Pseudo-Relevance Feedback

Query expansion with pseudo-relevance feedback (PRF) is a powerful appro...
research
03/23/2021

Shared Latent Space of Font Shapes and Impressions

We have specific impressions from the style of a typeface (font), sugges...
research
07/20/2022

Introducing Auxiliary Text Query-modifier to Content-based Audio Retrieval

The amount of audio data available on public websites is growing rapidly...
research
05/25/2022

Refining Query Representations for Dense Retrieval at Test Time

Dense retrieval uses a contrastive learning framework to learn dense rep...
research
03/12/2018

Gradient Augmented Information Retrieval with Autoencoders and Semantic Hashing

This paper will explore the use of autoencoders for semantic hashing in ...
research
06/15/2019

Relevance Feedback with Latent Variables in Riemann spaces

In this paper we develop and evaluate two methods for relevance feedback...
research
10/21/2018

3D shape retrieval basing on representatives of classes

In this paper, we present an improvement of our proposed technique for 3...

Please sign up or login with your details

Forgot password? Click here to reset