Log In Sign Up

Embedding-based Product Retrieval in Taobao Search

by   Sen Li, et al.

Nowadays, the product search service of e-commerce platforms has become a vital shopping channel in people's life. The retrieval phase of products determines the search system's quality and gradually attracts researchers' attention. Retrieving the most relevant products from a large-scale corpus while preserving personalized user characteristics remains an open question. Recent approaches in this domain have mainly focused on embedding-based retrieval (EBR) systems. However, after a long period of practice on Taobao, we find that the performance of the EBR system is dramatically degraded due to its: (1) low relevance with a given query and (2) discrepancy between the training and inference phases. Therefore, we propose a novel and practical embedding-based product retrieval model, named Multi-Grained Deep Semantic Product Retrieval (MGDSPR). Specifically, we first identify the inconsistency between the training and inference stages, and then use the softmax cross-entropy loss as the training objective, which achieves better performance and faster convergence. Two efficient methods are further proposed to improve retrieval relevance, including smoothing noisy training data and generating relevance-improving hard negative samples without requiring extra knowledge and training procedures. We evaluate MGDSPR on Taobao Product Search with significant metrics gains observed in offline experiments and online A/B tests. MGDSPR has been successfully deployed to the existing multi-channel retrieval system in Taobao Search. We also introduce the online deployment scheme and share practical lessons of our retrieval system to contribute to the community.


page 1

page 2

page 3

page 4


Modeling Product Search Relevance in e-Commerce

With the rapid growth of e-Commerce, online product search has emerged a...

Learning a Product Relevance Model from Click-Through Data in E-Commerce

The search engine plays a fundamental role in online e-commerce systems,...

A Zero Attention Model for Personalized Product Search

Product search is one of the most popular methods for people to discover...

Uni-Retriever: Towards Learning The Unified Embedding Based Retriever in Bing Sponsored Search

Embedding based retrieval (EBR) is a fundamental building block in many ...

Embracing Structure in Data for Billion-Scale Semantic Product Search

We present principled approaches to train and deploy dyadic neural embed...

Practical User Feedback-driven Internal Search Using Online Learning to Rank

We present a system, Spoke, for creating and searching internal knowledg...

Embedding-based Retrieval in Facebook Search

Search in social networks such as Facebook poses different challenges th...