Exposing Query Identification for Search Transparency

10/14/2021
by   Ruohan Li, et al.
0

Search systems control the exposure of ranked content to searchers. In many cases, creators value not only the exposure of their content but, moreover, an understanding of the specific searches where the content is surfaced. The problem of identifying which queries expose a given piece of content in the ranking results is an important and relatively under-explored search transparency challenge. Exposing queries are useful for quantifying various issues of search bias, privacy, data protection, security, and search engine optimization. Exact identification of exposing queries in a given system is computationally expensive, especially in dynamic contexts such as web search. In quest of a more lightweight solution, we explore the feasibility of approximate exposing query identification (EQI) as a retrieval task by reversing the role of queries and documents in two classes of search systems: dense dual-encoder models and traditional BM25 models. We then propose how this approach can be improved through metric learning over the retrieval embedding space. We further derive an evaluation metric to measure the quality of a ranking of exposing queries, as well as conducting an empirical analysis focusing on various practical aspects of approximate EQI.

READ FULL TEXT
research
07/16/2021

More Robust Dense Retrieval with Contrastive Dual Learning

Dense retrieval conducts text retrieval in the embedding space and has s...
research
10/21/2021

Driving the Herd: Search Engines as Content Influencers

In competitive search settings such as the Web, many documents' authors ...
research
04/17/2019

Document Expansion by Query Prediction

One technique to improve the retrieval effectiveness of a search engine ...
research
10/27/2022

DESSERT: An Efficient Algorithm for Vector Set Search with Vector Set Queries

We study the problem of vector set search with vector set queries. This ...
research
05/03/2018

CYCLOSA: Decentralizing Private Web Search Through SGX-Based Browser Extensions

By regularly querying Web search engines, users (unconsciously) disclose...
research
05/30/2020

Analyzing and Learning from User Interactions for Search Clarification

Asking clarifying questions in response to search queries has been recog...
research
11/14/2020

Analyzing 'Near Me' Services: Potential for Exposure Bias in Location-based Retrieval

The proliferation of smartphones has led to the increased popularity of ...

Please sign up or login with your details

Forgot password? Click here to reset