Extending CLIP for Category-to-image Retrieval in E-commerce

12/21/2021
by   Mariya Hendriksen, et al.
9

E-commerce provides rich multimodal data that is barely leveraged in practice. One aspect of this data is a category tree that is being used in search and recommendation. However, in practice, during a user's session there is often a mismatch between a textual and a visual representation of a given category. Motivated by the problem, we introduce the task of category-to-image retrieval in e-commerce and propose a model for the task, CLIP-ITA. The model leverages information from multiple modalities (textual, visual, and attribute modality) to create product representations. We explore how adding information from multiple modalities (textual, visual, and attribute modality) impacts the model's performance. In particular, we observe that CLIP-ITA significantly outperforms a comparable model that leverages only the visual modality and a comparable model that leverages the visual and attribute modality.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/07/2022

Multi-Modal Attribute Extraction for E-Commerce

To improve users' experience as they navigate the myriad of options offe...
research
07/21/2022

Unimodal vs. Multimodal Siamese Networks for Outfit Completion

The popularity of online fashion shopping continues to grow. The ability...
research
06/01/2023

PV2TEA: Patching Visual Modality to Textual-Established Information Extraction

Information extraction, e.g., attribute value extraction, has been exten...
research
12/14/2021

ACE-BERT: Adversarial Cross-modal Enhanced BERT for E-commerce Retrieval

Nowadays on E-commerce platforms, products are presented to the customer...
research
02/10/2023

Unified Vision-Language Representation Modeling for E-Commerce Same-Style Products Retrieval

Same-style products retrieval plays an important role in e-commerce plat...
research
12/01/2018

Towards Traversing the Continuous Spectrum of Image Retrieval

Image retrieval is one of the most popular tasks in computer vision. How...
research
10/10/2022

Visually Similar Products Retrieval for Shopsy

Visual search is of great assistance in reseller commerce, especially fo...

Please sign up or login with your details

Forgot password? Click here to reset