Validation Set Evaluation can be Wrong: An Evaluator-Generator Approach for Maximizing Online Performance of Ranking in E-commerce

03/25/2020
by   Guangda Huzhang, et al.
0

Learning-to-rank (LTR) has become a key technology in E-commerce applications. Previous LTR approaches followed the supervised learning paradigm so that learned models should match the labeled data point-wisely or pair-wisely. However, we have noticed that global context information, including the total order of items in the displayed webpage, can play an important role in interactions with the customers. Therefore, to approach the best global ordering, the exploration in a large combinatorial space of items is necessary, which requires evaluating orders that may not appear in the labeled data. In this scenario, we first show that the classical data-based metrics can be inconsistent with online performance, or even misleading. We then propose to learn an evaluator and search the best model guided by the evaluator, which forms the evaluator-generator framework for training the group-wise LTR model. The evaluator is learned from the labeled data, and is enhanced by incorporating the order context information. The generator is trained with the supervision of the evaluator by reinforcement learning to generate the best order in the combinatorial space. Our experiments in one of the world's largest retail platforms disclose that the learned evaluator is a much better indicator than classical data-based metrics. Moreover, our LTR model achieves a significant improvement (2%) from the current industrial-level pair-wise models in terms of both Conversion Rate (CR) and Gross Merchandise Volume (GMV) in online A/B tests.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/25/2020

Beyond the Ground-Truth: An Evaluator-Generator Framework for Group-wise Learning-to-Rank in E-Commerce

Learning-to-rank (LTR) has become a key technology in E-commerce applica...
research
05/25/2020

Generator and Critic: A Deep Reinforcement Learning Approach for Slate Re-ranking in E-commerce

The slate re-ranking problem considers the mutual influences between ite...
research
07/19/2021

Learning-To-Ensemble by Contextual Rank Aggregation in E-Commerce

Ensemble models in E-commerce combine predictions from multiple sub-mode...
research
12/30/2021

A General Traffic Shaping Protocol in E-Commerce

To approach different business objectives, online traffic shaping algori...
research
03/20/2023

Learning Multi-Stage Multi-Grained Semantic Embeddings for E-Commerce Search

Retrieving relevant items that match users' queries from billion-scale c...
research
06/14/2018

Transfer Learning for Context-Aware Question Matching in Information-seeking Conversations in E-commerce

Building multi-turn information-seeking conversation systems is an impor...
research
08/14/2017

Optimizing Gross Merchandise Volume via DNN-MAB Dynamic Ranking Paradigm

With the transition from people's traditional `brick-and-mortar' shoppin...

Please sign up or login with your details

Forgot password? Click here to reset