Balancing Generalization and Specialization in Zero-shot Learning

01/06/2022
by   Yun Li, et al.
1

Zero-Shot Learning (ZSL) aims to transfer classification capability from seen to unseen classes. Recent methods have proved that generalization and specialization are two essential abilities to achieve good performance in ZSL. However, they all focus on only one of the abilities, resulting in models that are either too general with the degraded classifying ability or too specialized to generalize to unseen classes. In this paper, we propose an end-to-end network with balanced generalization and specialization abilities, termed as BGSNet, to take advantage of both abilities, and balance them at instance- and dataset-level. Specifically, BGSNet consists of two branches: the Generalization Network (GNet), which applies episodic meta-learning to learn generalized knowledge, and the Balanced Specialization Network (BSNet), which adopts multiple attentive extractors to extract discriminative features and fulfill the instance-level balance. A novel self-adjusting diversity loss is designed to optimize BSNet with less redundancy and more diversity. We further propose a differentiable dataset-level balance and update the weights in a linear annealing schedule to simulate network pruning and thus obtain the optimal structure for BSNet at a low cost with dataset-level balance achieved. Experiments on four benchmark datasets demonstrate our model's effectiveness. Sufficient component ablations prove the necessity of integrating generalization and specialization abilities.

READ FULL TEXT

page 5

page 6

research
03/06/2019

Transfer feature generating networks with semantic classes structure for zero-shot learning

Suffering from the generating feature inconsistence of seen classes trai...
research
04/01/2020

Generalized Zero-Shot Learning Via Over-Complete Distribution

A well trained and generalized deep neural network (DNN) should be robus...
research
04/22/2021

Attribute-Modulated Generative Meta Learning for Zero-Shot Classification

Zero-shot learning (ZSL) aims to transfer knowledge from seen classes to...
research
08/30/2019

TGG: Transferable Graph Generation for Zero-shot and Few-shot Learning

Zero-shot and few-shot learning aim to improve generalization to unseen ...
research
11/20/2019

Heterogeneous Graph-based Knowledge Transfer for Generalized Zero-shot Learning

Generalized zero-shot learning (GZSL) tackles the problem of learning to...
research
02/03/2023

Multiple Thinking Achieving Meta-Ability Decoupling for Object Navigation

We propose a meta-ability decoupling (MAD) paradigm, which brings togeth...
research
02/21/2018

Generalization in Machine Learning via Analytical Learning Theory

This paper introduces a novel measure-theoretic learning theory to analy...

Please sign up or login with your details

Forgot password? Click here to reset