Improving Query Safety at Pinterest

06/20/2020
by   Abhijit Mahabal, et al.
0

Query recommendations in search engines is a double edged sword, with undeniable benefits but potential of harm. Identifying unsafe queries is necessary to protect users from inappropriate query suggestions. However, identifying these is non-trivial because of the linguistic diversity resulting from large vocabularies, social-group-specific slang and typos, and because the inappropriateness of a term depends on the context. Here we formulate the problem as query-set expansion, where we are given a small and potentially biased seed set and the aim is to identify a diverse set of semantically related queries. We present PinSets, a system for query-set expansion, which applies a simple yet powerful mechanism to search user sessions, expanding a tiny seed set into thousands of related queries at nearly perfect precision, deep into the tail, along with explanations that are easy to interpret. PinSets owes its high quality expansion to using a hybrid of textual and behavioral techniques (i.e., treating queries both as compositional and as black boxes). Experiments show that, for the domain of drugs-related queries, PinSets expands 20 seed queries into 15,670 positive training examples at over 99% precision. The generated expansions have diverse vocabulary and correctly handles words with ambiguous safety. PinSets decreased unsafe query suggestions at Pinterest by 90%.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/23/2020

Coupled intrinsic and extrinsic human language resource-based query expansion

Poor information retrieval performance has often been attributed to the ...
research
05/26/2023

Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering

We propose EAR, a query Expansion And Reranking approach for improving p...
research
12/22/2020

Event-Driven Query Expansion

A significant number of event-related queries are issued in Web search. ...
research
08/10/2021

High Quality Related Search Query Suggestions using Deep Reinforcement Learning

"High Quality Related Search Query Suggestions" task aims at recommendin...
research
04/06/2022

From Little Things Big Things Grow: A Collection with Seed Studies for Medical Systematic Review Literature Search

Medical systematic review query formulation is a highly complex task don...
research
02/22/2023

Effectiveness and Efficiency Trade-off in Selective Query Processing

Query processing in search engines can be optimized for use for all quer...
research
12/07/2022

You Don't Know Search: Helping Users Find Code by Automatically Evaluating Alternative Queries

Tens of thousands of engineers use Sourcegraph day-to-day to search for ...

Please sign up or login with your details

Forgot password? Click here to reset