Weakly-supervised Compositional FeatureAggregation for Few-shot Recognition

06/11/2019
by   Ping Hu, et al.
2

Learning from a few examples is a challenging task for machine learning. While recent progress has been made for this problem, most of the existing methods ignore the compositionality in visual concept representation (e.g. objects are built from parts or composed of semantic attributes), which is key to the human ability to easily learn from a small number of examples. To enhance the few-shot learning models with compositionality, in this paper we present the simple yet powerful Compositional Feature Aggregation (CFA) module as a weakly-supervised regularization for deep networks. Given the deep feature maps extracted from the input, our CFA module first disentangles the feature space into disjoint semantic subspaces that model different attributes, and then bilinearly aggregates the local features within each of these subspaces. CFA explicitly regularizes the representation with both semantic and spatial compositionality to produce discriminative representations for few-shot recognition tasks. Moreover, our method does not need any supervision for attributes and object parts during training, thus can be conveniently plugged into existing models for end-to-end optimization while keeping the model size and computation cost nearly the same. Extensive experiments on few-shot image classification and action recognition tasks demonstrate that our method provides substantial improvements over recent state-of-the-art methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/02/2020

Few-shot Learning with Weakly-supervised Object Localization

Few-shot learning (FSL) aims to learn novel visual categories from very ...
research
12/21/2018

Learning Compositional Representations for Few-Shot Recognition

One of the key limitations of modern deep learning based approaches lies...
research
05/12/2020

Compositional Few-Shot Recognition with Primitive Discovery and Enhancing

Few-shot learning (FSL) aims at recognizing novel classes given only few...
research
03/18/2018

Discriminative Learning of Latent Features for Zero-Shot Recognition

Zero-shot learning (ZSL) aims to recognize unseen image categories by le...
research
12/13/2021

Shaping Visual Representations with Attributes for Few-Shot Learning

Few-shot recognition aims to recognize novel categories under low-data r...
research
06/13/2022

Compositional Mixture Representations for Vision and Text

Learning a common representation space between vision and language allow...
research
10/20/2014

Supervised mid-level features for word image representation

This paper addresses the problem of learning word image representations:...

Please sign up or login with your details

Forgot password? Click here to reset