Multi-level Second-order Few-shot Learning

01/15/2022
by   Hongguang Zhang, et al.
0

We propose a Multi-level Second-order (MlSo) few-shot learning network for supervised or unsupervised few-shot image classification and few-shot action recognition. We leverage so-called power-normalized second-order base learner streams combined with features that express multiple levels of visual abstraction, and we use self-supervised discriminating mechanisms. As Second-order Pooling (SoP) is popular in image recognition, we employ its basic element-wise variant in our pipeline. The goal of multi-level feature design is to extract feature representations at different layer-wise levels of CNN, realizing several levels of visual abstraction to achieve robust few-shot learning. As SoP can handle convolutional feature maps of varying spatial sizes, we also introduce image inputs at multiple spatial scales into MlSo. To exploit the discriminative information from multi-level and multi-scale features, we develop a Feature Matching (FM) module that reweights their respective branches. We also introduce a self-supervised step, which is a discriminator of the spatial level and the scale of abstraction. Our pipeline is trained in an end-to-end manner. With a simple architecture, we demonstrate respectable results on standard datasets such as Omniglot, mini-ImageNet, tiered-ImageNet, Open MIC, fine-grained datasets such as CUB Birds, Stanford Dogs and Cars, and action recognition datasets such as HMDB51, UCF101, and mini-MIT.

READ FULL TEXT

page 1

page 4

page 5

page 7

page 16

research
01/06/2020

Few-shot Learning with Multi-scale Self-supervision

Learning concepts from the limited number of datapoints is a challenging...
research
12/27/2020

Power Normalizations in Fine-grained Image, Few-shot Image and Graph Classification

Power Normalizations (PN) are useful non-linear operators which tackle f...
research
11/10/2018

Power Normalizing Second-order Similarity Network for Few-shot Learning

Second- and higher-order statistics of data points have played an import...
research
08/07/2020

Multi-Level Temporal Pyramid Network for Action Detection

Currently, one-stage frameworks have been widely applied for temporal ac...
research
04/16/2019

Object-Oriented Dynamics Learning through Multi-Level Abstraction

Object-based approaches for learning action-conditioned dynamics has dem...
research
07/15/2021

Multi-Level Contrastive Learning for Few-Shot Problems

Contrastive learning is a discriminative approach that aims at grouping ...
research
03/23/2017

Is Second-order Information Helpful for Large-scale Visual Recognition?

By stacking layers of convolution and nonlinearity, convolutional networ...

Please sign up or login with your details

Forgot password? Click here to reset