Fine-grained Apparel Classification and Retrieval without rich annotations

11/06/2018
by   Aniket Bhatnagar, et al.
8

The ability to correctly classify and retrieve apparel images has a variety of applications important to e-commerce, online advertising and internet search. In this work, we propose a robust framework for fine-grained apparel classification, in-shop and cross-domain retrieval which eliminates the requirement of rich annotations like bounding boxes and human-joints or clothing landmarks, and training of bounding box/ key-landmark detector for the same. Factors such as subtle appearance differences, variations in human poses, different shooting angles, apparel deformations, and self-occlusion add to the challenges in classification and retrieval of apparel items. Cross-domain retrieval is even harder due to the presence of large variation between online shopping images, usually taken in ideal lighting, pose, positive angle and clean background as compared with street photos captured by users in complicated conditions with poor lighting and cluttered scenes. Our framework uses compact bilinear CNN with tensor sketch algorithm to generate embeddings that capture local pairwise feature interactions in a translationally invariant manner. For apparel classification, we pass the feature embeddings through a softmax classifier, while, the in-shop and cross-domain retrieval pipelines use a triplet-loss based optimization approach, such that squared Euclidean distance between embeddings measures the dissimilarity between the images. Unlike previous works that relied on bounding box, key clothing landmarks or human joint detectors to assist the final deep classifier, proposed framework can be trained directly on the provided category labels or generated triplets for triplet loss optimization. Lastly, Experimental results on the DeepFashion fine-grained categorization, and in-shop and consumer-to-shop retrieval datasets provide a comparative analysis with previous work performed in the domain.

READ FULL TEXT

page 10

page 11

page 12

research
05/29/2015

Cross-domain Image Retrieval with a Dual Attribute-aware Ranking Network

We address the problem of cross-domain image retrieval, considering the ...
research
07/15/2014

Part-based R-CNNs for Fine-grained Category Detection

Semantic part localization can facilitate fine-grained categorization by...
research
03/02/2017

BoxCars: Improving Fine-Grained Recognition of Vehicles using 3D Bounding Boxes in Traffic Surveillance

In this paper, we focus on fine-grained recognition of vehicles mainly i...
research
05/20/2016

Fine-Grained Classification of Pedestrians in Video: Benchmark and State of the Art

A video dataset that is designed to study fine-grained categorisation of...
research
04/05/2019

Snap and Find: Deep Discrete Cross-domain Garment Image Retrieval

With the increasing number of online stores, there is a pressing need fo...
research
01/23/2019

DeepFashion2: A Versatile Benchmark for Detection, Pose Estimation, Segmentation and Re-Identification of Clothing Images

Understanding fashion images has been advanced by benchmarks with rich a...

Please sign up or login with your details

Forgot password? Click here to reset