Stacked Semantic-Guided Attention Model for Fine-Grained Zero-Shot Learning

05/21/2018
by   Yunlong Yu, et al.
0

Zero-Shot Learning (ZSL) is achieved via aligning the semantic relationships between the global image feature vector and the corresponding class semantic descriptions. However, using the global features to represent fine-grained images may lead to sub-optimal results since they neglect the discriminative differences of local regions. Besides, different regions contain distinct discriminative information. The important regions should contribute more to the prediction. To this end, we propose a novel stacked semantics-guided attention (S2GA) model to obtain semantic relevant features by using individual class semantic features to progressively guide the visual features to generate an attention map for weighting the importance of different local regions. Feeding both the integrated visual features and the class semantic features into a multi-class classification architecture, the proposed framework can be trained end-to-end. Extensive experimental results on CUB and NABird datasets show that the proposed approach has a consistent improvement on both fine-grained zero-shot classification and retrieval tasks.

READ FULL TEXT

page 1

page 7

research
03/01/2019

Learning where to look: Semantic-Guided Multi-Attention Localization for Zero-Shot Learning

Zero-shot learning extends the conventional object classification to the...
research
07/04/2017

Zero-Shot Fine-Grained Classification by Deep Feature Learning with Semantics

Fine-grained image classification, which aims to distinguish images with...
research
05/22/2017

Semantic Softmax Loss for Zero-Shot Learning

A typical pipeline for Zero-Shot Learning (ZSL) is to integrate the visu...
research
07/27/2020

Rethinking Generative Zero-Shot Learning: An Ensemble Learning Perspective for Recognising Visual Patches

Zero-shot learning (ZSL) is commonly used to address the very pervasive ...
research
06/07/2022

SHRED: 3D Shape Region Decomposition with Learned Local Operations

We present SHRED, a method for 3D SHape REgion Decomposition. SHRED take...
research
11/03/2021

An Entropy-guided Reinforced Partial Convolutional Network for Zero-Shot Learning

Zero-Shot Learning (ZSL) aims to transfer learned knowledge from observe...
research
06/14/2023

Recognizing Unseen Objects via Multimodal Intensive Knowledge Graph Propagation

Zero-Shot Learning (ZSL), which aims at automatically recognizing unseen...

Please sign up or login with your details

Forgot password? Click here to reset