Compositional Few-Shot Recognition with Primitive Discovery and Enhancing

05/12/2020
by   Yixiong Zou, et al.
11

Few-shot learning (FSL) aims at recognizing novel classes given only few training samples, which still remains a great challenge for deep learning. However, humans can easily recognize novel classes with only few samples. A key component of such ability is the compositional recognition that human can perform, which has been well studied in cognitive science but is not well explored in FSL. Inspired by such capability of humans, to imitate humans' ability of learning visual primitives and composing primitives to recognize novel classes, we propose an approach to FSL to learn a feature representation composed of important primitives, which is jointly trained with two parts, i.e. primitive discovery and primitive enhancing. In primitive discovery, we focus on learning primitives related to object parts by self-supervision from the order of image splits, avoiding extra laborious annotations and alleviating the effect of semantic gaps. In primitive enhancing, inspired by current studies on the interpretability of deep networks, we provide our composition view for the FSL baseline model. To modify this model for effective composition, inspired by both mathematical deduction and biological studies (the Hebbian Learning rule and the Winner-Take-All mechanism), we propose a soft composition mechanism by enlarging the activation of important primitives while reducing that of others, so as to enhance the influence of important primitives and better utilize these primitives to compose novel classes. Extensive experiments on public benchmarks are conducted on both the few-shot image classification and video recognition tasks. Our method achieves the state-of-the-art performance on all these datasets and shows better interpretability.

READ FULL TEXT

page 7

page 8

research
08/20/2022

Learning Primitive-aware Discriminative Representations for FSL

Few-shot learning (FSL) aims to learn a classifier that can be easily ad...
research
08/22/2022

Reference-Limited Compositional Zero-Shot Learning

Compositional zero-shot learning (CZSL) refers to recognizing unseen com...
research
06/11/2019

Weakly-supervised Compositional FeatureAggregation for Few-shot Recognition

Learning from a few examples is a challenging task for machine learning....
research
01/14/2023

Desbordante: from benchmarking suite to high-performance science-intensive data profiler (preprint)

Pioneering data profiling systems such as Metanome and OpenClean brought...
research
01/28/2021

COMPAS: Representation Learning with Compositional Part Sharing for Few-Shot Classification

Few-shot image classification consists of two consecutive learning proce...
research
03/31/2022

Do Vision-Language Pretrained Models Learn Primitive Concepts?

Vision-language pretrained models have achieved impressive performance o...
research
02/14/2022

HAKE: A Knowledge Engine Foundation for Human Activity Understanding

Human activity understanding is of widespread interest in artificial int...

Please sign up or login with your details

Forgot password? Click here to reset