Zero-Shot Visual Recognition using Semantics-Preserving Adversarial Embedding Network

12/05/2017
by   Long Chen, et al.
0

We propose a novel framework called Semantics-Preserving Adversarial Embedding Network (SP-AEN) for zero-shot visual recognition (ZSL), where test images and their classes are both unseen during training. SP-AEN aims to tackle the inherent problem --- semantic loss --- in the prevailing family of embedding-based ZSL, where some semantics would be discarded during training if they are non-discriminative for training classes, but informative for test classes. Specifically, SP-AEN prevents the semantic loss by introducing an independent visual-to-semantic space embedder which disentangles the semantic space into two subspaces for the two arguably conflicting objectives: classification and reconstruction. Through adversarial learning of the two subspaces, SP-AEN can transfer the semantics from the reconstructive subspace to the discriminative one, accomplishing the improved zero-shot recognition of unseen classes. Compared to prior works, SP-AEN can not only improve classification but also generate photo-realistic images, demonstrating the effectiveness of semantic preservation. On four benchmarks: CUB, AWA, SUN and aPY, SP-AEN considerably outperforms other state-of-the-art methods by absolute 12.2%, 9.3%, 4.0%, and 3.6% in harmonic mean values.

READ FULL TEXT

page 1

page 4

page 7

page 8

research
03/15/2017

Zero-Shot Recognition using Dual Visual-Semantic Mapping Paths

Zero-shot recognition aims to accurately recognize objects of unseen cla...
research
04/10/2017

Semantically Consistent Regularization for Zero-Shot Recognition

The role of semantics in zero-shot learning is considered. The effective...
research
07/12/2019

Dual Adversarial Semantics-Consistent Network for Generalized Zero-Shot Learning

Generalized zero-shot learning (GZSL) is a challenging class of vision a...
research
07/24/2018

Learning Class Prototypes via Structure Alignment for Zero-Shot Recognition

Zero-shot learning (ZSL) aims to recognize objects of novel classes with...
research
06/17/2022

Learning Using Privileged Information for Zero-Shot Action Recognition

Zero-Shot Action Recognition (ZSAR) aims to recognize video actions that...
research
05/28/2017

Vocabulary-informed Extreme Value Learning

The novel unseen classes can be formulated as the extreme values of know...
research
06/13/2019

Semantics to Space(S2S): Embedding semantics into spatial space for zero-shot verb-object query inferencing

We present a novel deep zero-shot learning (ZSL) model for inferencing h...

Please sign up or login with your details

Forgot password? Click here to reset