Learning a Deep Embedding Model for Zero-Shot Learning

11/15/2016
by   Li Zhang, et al.
0

Zero-shot learning (ZSL) models rely on learning a joint embedding space where both textual/semantic description of object classes and visual representation of object images can be projected to for nearest neighbour search. Despite the success of deep neural networks that learn an end-to-end model between text and images in other vision problems such as image captioning, very few deep ZSL model exists and they show little advantage over ZSL models that utilise deep feature representations but do not learn an end-to-end embedding. In this paper we argue that the key to make deep ZSL models succeed is to choose the right embedding space. Instead of embedding into a semantic space or an intermediate space, we propose to use the visual space as the embedding space. This is because that in this space, the subsequent nearest neighbour search would suffer much less from the hubness problem and thus become more effective. This model design also provides a natural mechanism for multiple semantic modalities (e.g., attributes and sentence descriptions) to be fused and optimised jointly in an end-to-end manner. Extensive experiments on four benchmarks show that our model significantly outperforms the existing models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/26/2016

Predicting Visual Exemplars of Unseen Classes for Zero-Shot Learning

Leveraging class semantic descriptions and examples of known objects, ze...
research
08/30/2018

Towards Effective Deep Embedding for Zero-Shot Learning

Zero-shot learning (ZSL) attempts to recognize visual samples of unseen ...
research
06/01/2015

Predicting Deep Zero-Shot Convolutional Neural Networks using Textual Descriptions

One of the main challenges in Zero-Shot Learning of visual categories is...
research
03/17/2017

Learning Robust Visual-Semantic Embeddings

Many of the existing methods for learning joint embedding of images and ...
research
06/30/2019

Visual Space Optimization for Zero-shot Learning

Zero-shot learning, which aims to recognize new categories that are not ...
research
03/18/2018

Discriminative Learning of Latent Features for Zero-Shot Recognition

Zero-shot learning (ZSL) aims to recognize unseen image categories by le...
research
03/08/2020

Unifying Specialist Image Embedding into Universal Image Embedding

Deep image embedding provides a way to measure the semantic similarity o...

Please sign up or login with your details

Forgot password? Click here to reset