Clothing Retrieval with Visual Attention Model

10/31/2017
by   Zhonghao Wang, et al.
0

Clothing retrieval is a challenging problem in computer vision. With the advance of Convolutional Neural Networks (CNNs), the accuracy of clothing retrieval has been significantly improved. FashionNet[1], a recent study, proposes to employ a set of artificial features in the form of landmarks for clothing retrieval, which are shown to be helpful for retrieval. However, the landmark detection module is trained with strong supervision which requires considerable efforts to obtain. In this paper, we propose a self-learning Visual Attention Model (VAM) to extract attention maps from clothing images. The VAM is further connected to a global network to form an end-to-end network structure through Impdrop connection which randomly Dropout on the feature maps with the probabilities given by the attention map. Extensive experiments on several widely used benchmark clothing retrieval data sets have demonstrated the promise of the proposed method. We also show that compared to the trivial Product connection, the Impdrop connection makes the network structure more robust when training sets of limited size are used.

READ FULL TEXT

page 2

page 4

research
08/01/2022

A Rotation Meanout Network with Invariance for Dermoscopy Image Classification and Retrieval

The computer-aided diagnosis (CAD) system can provide a reference basis ...
research
11/05/2016

What Is the Best Practice for CNNs Applied to Visual Instance Retrieval?

Previous work has shown that feature maps of deep convolutional neural n...
research
10/27/2022

Improved Projection Learning for Lower Dimensional Feature Maps

The requirement to repeatedly move large feature maps off- and on-chip d...
research
03/29/2022

Shifting More Attention to Visual Backbone: Query-modulated Refinement Networks for End-to-End Visual Grounding

Visual grounding focuses on establishing fine-grained alignment between ...
research
12/19/2021

Parallel Multi-Scale Networks with Deep Supervision for Hand Keypoint Detection

Keypoint detection plays an important role in a wide range of applicatio...
research
03/10/2020

Channel Attention with Embedding Gaussian Process: A Probabilistic Methodology

Channel attention mechanisms, as the key components of some modern convo...
research
01/20/2017

A Large-scale Dataset and Benchmark for Similar Trademark Retrieval

Trademark retrieval (TR) has become an important yet challenging problem...

Please sign up or login with your details

Forgot password? Click here to reset