Controllable Multi-Objective Re-ranking with Policy Hypernetworks

06/08/2023
by   Sirui Chen, et al.
0

Multi-stage ranking pipelines have become widely used strategies in modern recommender systems, where the final stage aims to return a ranked list of items that balances a number of requirements such as user preference, diversity, novelty etc. Linear scalarization is arguably the most widely used technique to merge multiple requirements into one optimization objective, by summing up the requirements with certain preference weights. Existing final-stage ranking methods often adopt a static model where the preference weights are determined during offline training and kept unchanged during online serving. Whenever a modification of the preference weights is needed, the model has to be re-trained, which is time and resources inefficient. Meanwhile, the most appropriate weights may vary greatly for different groups of targeting users or at different time periods (e.g., during holiday promotions). In this paper, we propose a framework called controllable multi-objective re-ranking (CMR) which incorporates a hypernetwork to generate parameters for a re-ranking model according to different preference weights. In this way, CMR is enabled to adapt the preference weights according to the environment changes in an online manner, without retraining the models. Moreover, we classify practical business-oriented tasks into four main categories and seamlessly incorporate them in a new proposed re-ranking model based on an Actor-Evaluator framework, which serves as a reliable real-world testbed for CMR. Offline experiments based on the dataset collected from Taobao App showed that CMR improved several popular re-ranking models by using them as underlying models. Online A/B tests also demonstrated the effectiveness and trustworthiness of CMR.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2023

Rethinking the Role of Pre-ranking in Large-scale E-Commerce Searching System

E-commerce search systems such as Taobao Search, the largest e-commerce ...
research
05/22/2022

Ada-Ranker: A Data Distribution Adaptive Ranking Paradigm for Sequential Recommendation

A large-scale recommender system usually consists of recall and ranking ...
research
02/21/2018

Ordered Preference Elicitation Strategies for Supporting Multi-Objective Decision Making

In multi-objective decision planning and learning, much attention is pai...
research
07/07/2022

Contrastive Information Transfer for Pre-Ranking Systems

Real-word search and recommender systems usually adopt a multi-stage ran...
research
01/27/2017

Modelling Preference Data with the Wallenius Distribution

The Wallenius distribution is a generalisation of the Hypergeometric dis...
research
12/10/2018

Top-N-Rank: A Scalable List-wise Ranking Method for Recommender Systems

We propose Top-N-Rank, a novel family of list-wise Learning-to-Rank mode...
research
05/30/2023

Who Would be Interested in Services? An Entity Graph Learning System for User Targeting

With the growing popularity of various mobile devices, user targeting ha...

Please sign up or login with your details

Forgot password? Click here to reset