Few-Shot Learning by Dimensionality Reduction in Gradient Space

06/07/2022
by   Martin Gauch, et al.
5

We introduce SubGD, a novel few-shot learning method which is based on the recent finding that stochastic gradient descent updates tend to live in a low-dimensional parameter subspace. In experimental and theoretical analyses, we show that models confined to a suitable predefined subspace generalize well for few-shot learning. A suitable subspace fulfills three criteria across the given tasks: it (a) allows to reduce the training error by gradient flow, (b) leads to models that generalize well, and (c) can be identified by stochastic gradient descent. SubGD identifies these subspaces from an eigendecomposition of the auto-correlation matrix of update directions across different tasks. Demonstrably, we can identify low-dimensional suitable subspaces for few-shot learning of dynamical systems, which have varying properties described by one or few parameters of the analytical system description. Such systems are ubiquitous among real-world applications in science and engineering. We experimentally corroborate the advantages of SubGD on three distinct dynamical systems problem settings, significantly outperforming popular few-shot learning methods both in terms of sample efficiency and performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/08/2022

On the Subspace Structure of Gradient-Based Meta-Learning

In this work we provide an analysis of the distribution of the post-adap...
research
05/28/2021

Privately Learning Subspaces

Private data analysis suffers a costly curse of dimensionality. However,...
research
05/31/2022

HyperMAML: Few-Shot Adaptation of Deep Models with Hypernetworks

The aim of Few-Shot learning methods is to train models which can easily...
research
12/08/2022

A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces

Many machine learning problems encode their data as a matrix with a poss...
research
05/31/2019

Subspace Networks for Few-shot Classification

We propose subspace networks for the problem of few-shot classification,...
research
02/07/2022

Grassmann Stein Variational Gradient Descent

Stein variational gradient descent (SVGD) is a deterministic particle in...
research
08/31/2023

Few-shot Diagnosis of Chest x-rays Using an Ensemble of Random Discriminative Subspaces

Due to the scarcity of annotated data in the medical domain, few-shot le...

Please sign up or login with your details

Forgot password? Click here to reset