End-to-End Neural Ranking for eCommerce Product Search: an application of task models and textual embeddings

06/19/2018
by   Eliot Brenner, et al.
0

We consider the problem of retrieving and ranking items in an eCommerce catalog, often called SKUs, in order of relevance to a user-issued query. The input data for the ranking are the texts of the queries and textual fields of the SKUs indexed in the catalog. We review the ways in which this problem both resembles and differs from the problems of IR in the context of web search. The differences between the product-search problem and the IR problem of web search necessitate a different approach in terms of both models and datasets. We first review the recent state-of-the-art models for web search IR, distinguishing between two distinct types of model which we call the distributed type and the local-interaction type. The different types of relevance models developed for IR have complementary advantages and disadvantages when applied to eCommerce product search. Further, we explain why the conventional methods for dataset construction employed in the IR literature fail to produce data which suffices for training or evaluation of models for eCommerce product search. We explain how our own approach, applying task modeling techniques to the click-through logs of an eCommerce site, enables the construction of a large-scale dataset for training and robust benchmarking of relevance models. Our experiments consist of applying several of the models from the IR literature to our own dataset. Empirically, we have established that, when applied to our dataset, certain models of local-interaction type reduce ranking errors by one-third compared to the baseline tf-idf. Applied to our dataset, the distributed models fail to outperform the baseline. As a basis for a deployed system, the distributed models have several advantages, computationally, over the local-interaction models. This motivates an ongoing program of work, which we outline at the conclusion of the paper.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/14/2021

TripClick: The Log Files of a Large Health Web Search Engine

Click logs are valuable resources for a variety of information retrieval...
research
12/23/2021

Customising Ranking Models for Enterprise Search on Bilingual Click-Through Dataset

In this work, we provide the details about the process of establishing a...
research
10/16/2017

DeepRank: A New Deep Architecture for Relevance Ranking in Information Retrieval

This paper concerns a deep learning approach to relevance ranking in inf...
research
12/29/2020

Meta Adaptive Neural Ranking with Contrastive Synthetic Supervision

Neural Information Retrieval (Neu-IR) models have shown their effectiven...
research
04/15/2019

An Axiomatic Approach to Regularizing Neural Ranking Models

Axiomatic information retrieval (IR) seeks a set of principle properties...
research
12/22/2017

Ranking Triples using Entity Links in a Large Web Crawl - The Chicory Triple Scorer at WSDM Cup 2017

This paper describes the participation of team Chicory in the Triple Ran...
research
07/20/2020

A Comparison of Supervised Learning to Match Methods for Product Search

The vocabulary gap is a core challenge in information retrieval (IR). In...

Please sign up or login with your details

Forgot password? Click here to reset