Theoretical Understandings of Product Embedding for E-commerce Machine Learning

02/24/2021
by   Da Xu, et al.
0

Product embeddings have been heavily investigated in the past few years, serving as the cornerstone for a broad range of machine learning applications in e-commerce. Despite the empirical success of product embeddings, little is known on how and why they work from the theoretical standpoint. Analogous results from the natural language processing (NLP) often rely on domain-specific properties that are not transferable to the e-commerce setting, and the downstream tasks often focus on different aspects of the embeddings. We take an e-commerce-oriented view of the product embeddings and reveal a complete theoretical view from both the representation learning and the learning theory perspective. We prove that product embeddings trained by the widely-adopted skip-gram negative sampling algorithm and its variants are sufficient dimension reduction regarding a critical product relatedness measure. The generalization performance in the downstream machine learning task is controlled by the alignment between the embeddings and the product relatedness measure. Following the theoretical discoveries, we conduct exploratory experiments that supports our theoretical insights for the product embeddings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/01/2022

e-CLIP: Large-Scale Vision-Language Representation Learning in E-commerce

Understanding vision and language representations of product content is ...
research
07/02/2022

GUIM – General User and Item Embedding with Mixture of Representation in E-commerce

Our goal is to build general representation (embedding) for each user an...
research
11/28/2019

Product Knowledge Graph Embedding for E-commerce

In this paper, we propose a new product knowledge graph (PKG) embedding ...
research
12/07/2022

Learning-To-Embed: Adopting Transformer based models for E-commerce Products Representation Learning

Learning low-dimensional representation for large number of products pre...
research
01/31/2020

Scalable bundling via dense product embeddings

Bundling, the practice of jointly selling two or more products at a disc...
research
05/24/2021

One4all User Representation for Recommender Systems in E-commerce

General-purpose representation learning through large-scale pre-training...
research
11/04/2022

Continuous Prompt Tuning Based Textual Entailment Model for E-commerce Entity Typing

The explosion of e-commerce has caused the need for processing and analy...

Please sign up or login with your details

Forgot password? Click here to reset