Amazon-M2: A Multilingual Multi-locale Shopping Session Dataset for Recommendation and Text Generation

by   Wei Jin, et al.

Modeling customer shopping intentions is a crucial task for e-commerce, as it directly impacts user experience and engagement. Thus, accurately understanding customer preferences is essential for providing personalized recommendations. Session-based recommendation, which utilizes customer session data to predict their next interaction, has become increasingly popular. However, existing session datasets have limitations in terms of item attributes, user diversity, and dataset scale. As a result, they cannot comprehensively capture the spectrum of user behaviors and preferences. To bridge this gap, we present the Amazon Multilingual Multi-locale Shopping Session Dataset, namely Amazon-M2. It is the first multilingual dataset consisting of millions of user sessions from six different locales, where the major languages of products are English, German, Japanese, French, Italian, and Spanish. Remarkably, the dataset can help us enhance personalization and understanding of user preferences, which can benefit various existing tasks as well as enable new tasks. To test the potential of the dataset, we introduce three tasks in this work: (1) next-product recommendation, (2) next-product recommendation with domain shifts, and (3) next-product title generation. With the above tasks, we benchmark a range of algorithms on our proposed dataset, drawing new insights for further research and practice. In addition, based on the proposed dataset and tasks, we hosted a competition in the KDD CUP 2023 and have attracted thousands of users and submissions. The winning solutions and the associated workshop can be accessed at our website


page 4

page 5


SIGIR 2021 E-Commerce Workshop Data Challenge

The 2021 SIGIR workshop on eCommerce is hosting the Coveo Data Challenge...

Learning Similarity among Users for Personalized Session-Based Recommendation from hierarchical structure of User-Session-Item

The task of the session-based recommendation is to predict the next inte...

Transformers with multi-modal features and post-fusion context for e-commerce session-based recommendation

Session-based recommendation is an important task for e-commerce service...

Price DOES Matter! Modeling Price and Interest Preferences in Session-based Recommendation

Session-based recommendation aims to predict items that an anonymous use...

Reusable Self-Attention-based Recommender System for Fashion

A large number of empirical studies on applying self-attention models in...

Session-based k-NNs with Semantic Suggestions for Next-item Prediction

One of the most critical problems in e-commerce domain is the informatio...

Learning to Personalize Recommendation based on Customers' Shopping Intents

Understanding the customers' high level shopping intent, such as their d...

Please sign up or login with your details

Forgot password? Click here to reset