Stateless Neural Meta-Learning using Second-Order Gradients

04/21/2021
by   Mike Huisman, et al.
0

Deep learning typically requires large data sets and much compute power for each new problem that is learned. Meta-learning can be used to learn a good prior that facilitates quick learning, thereby relaxing these requirements so that new tasks can be learned quicker; two popular approaches are MAML and the meta-learner LSTM. In this work, we compare the two and formally show that the meta-learner LSTM subsumes MAML. Combining this insight with recent empirical findings, we construct a new algorithm (dubbed TURTLE) which is simpler than the meta-learner LSTM yet more expressive than MAML. TURTLE outperforms both techniques at few-shot sine wave regression and image classification on miniImageNet and CUB without any additional hyperparameter tuning, at a computational cost that is comparable with second-order MAML. The key to TURTLE's success lies in the use of second-order gradients, which also significantly increases the performance of the meta-learner LSTM by 1-6 accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/27/2021

Accelerating Gradient-based Meta Learner

Meta Learning has been in focus in recent years due to the meta-learner ...
research
09/15/2021

Sign-MAML: Efficient Model-Agnostic Meta-Learning by SignSGD

We propose a new computationally-efficient first-order algorithm for Mod...
research
10/31/2021

Can we learn gradients by Hamiltonian Neural Networks?

In this work, we propose a meta-learner based on ODE neural networks tha...
research
06/09/2021

Meta-Interpretive Learning as Metarule Specialisation

In Meta-Interpretive Learning (MIL) the metarules, second-order datalog ...
research
06/19/2021

EvoGrad: Efficient Gradient-Based Meta-Learning and Hyperparameter Optimization

Gradient-based meta-learning and hyperparameter optimization have seen s...
research
10/19/2021

BAMLD: Bayesian Active Meta-Learning by Disagreement

Data-efficient learning algorithms are essential in many practical appli...
research
04/22/2023

Constructing a meta-learner for unsupervised anomaly detection

Unsupervised anomaly detection (AD) is critical for a wide range of prac...

Please sign up or login with your details

Forgot password? Click here to reset