piRank: A Probabilistic Intent Based Ranking Framework for Facebook Search

03/27/2022
by   Zhen Liao, et al.
0

While numerous studies have been conducted in the literature exploring different types of machine learning approaches for search ranking, most of them are focused on specific pre-defined problems but only a few of them have studied the ranking framework which can be applied in a commercial search engine in a scalable way. In the meantime, existing ranking models are often optimized for normalized discounted cumulative gains (NDCG) or online click-through rate (CTR), and both types of machine learning models are built based on the assumption that high-quality training data can be easily obtained and well applied to unseen cases. In practice at Facebook search, we observed that our training data for ML models have certain issues. First, tail query intents are hardly covered in our human rating dataset. Second, search click logs are often noisy and hard to clean up due to various reasons. To address the above issues, in this paper, we propose a probabilistic intent based ranking framework (short for piRank), which can: 1) provide a scalable framework to address various ranking issues for different query intents in a divide-and-conquer way; 2) improve system development agility including iteration speed and system debuggability; 3) combine both machine learning and empirical-based algorithmic methods in a systematic way. We conducted extensive experiments and studies on top of Facebook search engine system and validated the effectiveness of this new ranking architecture.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/26/2018

Demystifying Core Ranking in Pinterest Image Search

Pinterest Image Search Engine helps hundreds of millions of users discov...
research
12/23/2021

Customising Ranking Models for Enterprise Search on Bilingual Click-Through Dataset

In this work, we provide the details about the process of establishing a...
research
06/07/2020

SERank: Optimize Sequencewise Learning to Rank Using Squeeze-and-Excitation Network

Learning-to-rank (LTR) is a set of supervised machine learning algorithm...
research
05/24/2021

Pre-trained Language Model based Ranking in Baidu Search

As the heart of a search engine, the ranking system plays a crucial role...
research
08/06/2020

DeText: A Deep Text Ranking Framework with BERT

Ranking is the most important component in a search system. Mostsearch s...
research
04/22/2018

Query Focused Variable Centroid Vectors for Passage Re-ranking in Semantic Search

In this paper, we propose a new approach for passage re-ranking. We show...
research
02/21/2023

HierCat: Hierarchical Query Categorization from Weakly Supervised Data at Facebook Marketplace

Query categorization at customer-to-customer e-commerce platforms like F...

Please sign up or login with your details

Forgot password? Click here to reset