Integrity and Junkiness Failure Handling for Embedding-based Retrieval: A Case Study in Social Network Search

04/18/2023
by   Wenping Wang, et al.
0

Embedding based retrieval has seen its usage in a variety of search applications like e-commerce, social networking search etc. While the approach has demonstrated its efficacy in tasks like semantic matching and contextual search, it is plagued by the problem of uncontrollable relevance. In this paper, we conduct an analysis of embedding-based retrieval launched in early 2021 on our social network search engine, and define two main categories of failures introduced by it, integrity and junkiness. The former refers to issues such as hate speech and offensive content that can severely harm user experience, while the latter includes irrelevant results like fuzzy text matching or language mismatches. Efficient methods during model inference are further proposed to resolve the issue, including indexing treatments and targeted user cohort treatments, etc. Though being simple, we show the methods have good offline NDCG and online A/B tests metrics gain in practice. We analyze the reasons for the improvements, pointing out that our methods are only preliminary attempts to this important but challenging problem. We put forward potential future directions to explore.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/31/2018

Violence originated from Facebook: A case study in Bangladesh

Facebook as in social network is a great innovation of modern times. Amo...
research
05/09/2020

A Social Search Model for Large Scale Social Networks

With the rise of social networks, information on the internet is no long...
research
06/17/2021

Embedding-based Product Retrieval in Taobao Search

Nowadays, the product search service of e-commerce platforms has become ...
research
07/01/2021

SearchGCN: Powering Embedding Retrieval by Graph Convolution Networks for E-Commerce Search

Graph convolution networks (GCN), which recently becomes new state-of-th...
research
06/20/2020

Embedding-based Retrieval in Facebook Search

Search in social networks such as Facebook poses different challenges th...
research
11/08/2017

An Analysis of Privacy-Aware Personalization Signals by Using Online Evaluation Methods

Personalization despite being an effective solution to the problem infor...
research
09/17/2019

BUDA.ART: A Multimodal Content-Based Analysis and Retrieval System for Buddha Statues

We introduce BUDA.ART, a system designed to assist researchers in Art Hi...

Please sign up or login with your details

Forgot password? Click here to reset