MMFL-Net: Multi-scale and Multi-granularity Feature Learning for Cross-domain Fashion Retrieval

10/27/2022
by   Chen Bao, et al.
0

Instance-level image retrieval in fashion is a challenging issue owing to its increasing importance in real-scenario visual fashion search. Cross-domain fashion retrieval aims to match the unconstrained customer images as queries for photographs provided by retailers; however, it is a difficult task due to a wide range of consumer-to-shop (C2S) domain discrepancies and also considering that clothing image is vulnerable to various non-rigid deformations. To this end, we propose a novel multi-scale and multi-granularity feature learning network (MMFL-Net), which can jointly learn global-local aggregation feature representations of clothing images in a unified framework, aiming to train a cross-domain model for C2S fashion visual similarity. First, a new semantic-spatial feature fusion part is designed to bridge the semantic-spatial gap by applying top-down and bottom-up bidirectional multi-scale feature fusion. Next, a multi-branch deep network architecture is introduced to capture global salient, part-informed, and local detailed information, and extracting robust and discrimination feature embedding by integrating the similarity learning of coarse-to-fine embedding with the multiple granularities. Finally, the improved trihard loss, center loss, and multi-task classification loss are adopted for our MMFL-Net, which can jointly optimize intra-class and inter-class distance and thus explicitly improve intra-class compactness and inter-class discriminability between its visual representations for feature learning. Furthermore, our proposed model also combines the multi-task attribute recognition and classification module with multi-label semantic attributes and product ID labels. Experimental results demonstrate that our proposed MMFL-Net achieves significant improvement over the state-of-the-art methods on the two datasets, DeepFashion-C2S and Street2Shop.

READ FULL TEXT

page 2

page 6

page 19

page 20

page 21

page 23

research
07/23/2021

Learning Discriminative Representations for Multi-Label Image Recognition

Multi-label recognition is a fundamental, and yet is a challenging task ...
research
04/05/2019

Snap and Find: Deep Discrete Cross-domain Garment Image Retrieval

With the increasing number of online stores, there is a pressing need fo...
research
09/03/2020

Tasks Integrated Networks: Joint Detection and Retrieval for Image Search

The traditional object retrieval task aims to learn a discriminative fea...
research
11/10/2019

Semi-Heterogeneous Three-Way Joint Embedding Network for Sketch-Based Image Retrieval

Sketch-based image retrieval (SBIR) is a challenging task due to the lar...
research
11/12/2022

Partial Visual-Semantic Embedding: Fashion Intelligence System with Sensitive Part-by-Part Learning

In this study, we propose a technology called the Fashion Intelligence S...
research
11/24/2017

Cross-Domain Self-supervised Multi-task Feature Learning using Synthetic Imagery

In human learning, it is common to use multiple sources of information j...
research
05/29/2015

Cross-domain Image Retrieval with a Dual Attribute-aware Ranking Network

We address the problem of cross-domain image retrieval, considering the ...

Please sign up or login with your details

Forgot password? Click here to reset