Weakly-Supervised Conditional Embedding for Referred Visual Search

06/05/2023
by   Simon Lepage, et al.
0

This paper presents a new approach to image similarity search in the context of fashion, a domain with inherent ambiguity due to the multiple ways in which images can be considered similar. We introduce the concept of Referred Visual Search (RVS), where users provide additional information to define the desired similarity. We present a new dataset, LAION-RVS-Fashion, consisting of 272K fashion products with 842K images extracted from LAION, designed explicitly for this task. We then propose an innovative method for learning conditional embeddings using weakly-supervised training, achieving a 6 at one (R@1) against a gallery with 2M distractors, compared to classical approaches based on explicit attention and filtering. The proposed method demonstrates robustness, maintaining similar R@1 when dealing with 2.5 times as many distractors as the baseline methods. We believe this is a step forward in the emerging field of Referred Visual Search both in terms of accessible data and approach. Code, data and models are available at https://www.github.com/Simon-Lepage/CondViT-LRVSF .

READ FULL TEXT

page 2

page 5

page 9

page 14

page 16

page 17

page 18

research
10/18/2019

Diversity in Fashion Recommendation using Semantic Parsing

Developing recommendation system for fashion images is challenging due t...
research
09/27/2019

A weakly supervised adaptive triplet loss for deep metric learning

We address the problem of distance metric learning in visual similarity ...
research
06/19/2021

Exploring Visual Context for Weakly Supervised Person Search

Person search has recently emerged as a challenging task that jointly ad...
research
05/28/2019

SizeNet: Weakly Supervised Learning of Visual Size and Fit in Fashion Images

Finding clothes that fit is a hot topic in the e-commerce fashion indust...
research
11/29/2014

Multiple Instance Reinforcement Learning for Efficient Weakly-Supervised Detection in Images

State-of-the-art visual recognition and detection systems increasingly r...
research
07/19/2023

Generative Prompt Model for Weakly Supervised Object Localization

Weakly supervised object localization (WSOL) remains challenging when le...
research
12/03/2022

StegaNeRF: Embedding Invisible Information within Neural Radiance Fields

Recent advances in neural rendering imply a future of widespread visual ...

Please sign up or login with your details

Forgot password? Click here to reset