Shopping Queries Dataset: A Large-Scale ESCI Benchmark for Improving Product Search

06/14/2022
by   Chandan K. Reddy, et al.
0

Improving the quality of search results can significantly enhance users experience and engagement with search engines. In spite of several recent advancements in the fields of machine learning and data mining, correctly classifying items for a particular user search query has been a long-standing challenge, which still has a large room for improvement. This paper introduces the "Shopping Queries Dataset", a large dataset of difficult Amazon search queries and results, publicly released with the aim of fostering research in improving the quality of search results. The dataset contains around 130 thousand unique queries and 2.6 million manually labeled (query,product) relevance judgements. The dataset is multilingual with queries in English, Japanese, and Spanish. The Shopping Queries Dataset is being used in one of the KDDCup'22 challenges. In this paper, we describe the dataset and present three evaluation tasks along with baseline results: (i) ranking the results list, (ii) classifying product results into relevance categories, and (iii) identifying substitute products for a given query. We anticipate that this data will become the gold standard for future research in the topic of product search.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/05/2022

A Semantic Alignment System for Multilingual Query-Product Retrieval

This paper mainly describes our winning solution (team name: www) to Ama...
research
05/01/2023

Contextual Multilingual Spellchecker for User Queries

Spellchecking is one of the most fundamental and widely used search feat...
research
10/24/2020

COUGH: A Challenge Dataset and Models for COVID-19 FAQ Retrieval

We present a large challenging dataset, COUGH, for COVID-19 FAQ retrieva...
research
05/24/2023

JDsearch: A Personalized Product Search Dataset with Real Queries and Full Interactions

Recently, personalized product search attracts great attention and many ...
research
07/24/2019

Counterfactual Learning from Logs for Improved Ranking of E-Commerce Products

Improved search quality enhances users' satisfaction, which directly imp...
research
03/31/2020

Managing Diversity in Airbnb Search

One of the long-standing questions in search systems is the role of dive...
research
06/17/2020

MIMICS: A Large-Scale Data Collection for Search Clarification

Search clarification has recently attracted much attention due to its ap...

Please sign up or login with your details

Forgot password? Click here to reset