Sign-MAML: Efficient Model-Agnostic Meta-Learning by SignSGD

09/15/2021
by   Chen Fan, et al.
18

We propose a new computationally-efficient first-order algorithm for Model-Agnostic Meta-Learning (MAML). The key enabling technique is to interpret MAML as a bilevel optimization (BLO) problem and leverage the sign-based SGD(signSGD) as a lower-level optimizer of BLO. We show that MAML, through the lens of signSGD-oriented BLO, naturally yields an alternating optimization scheme that just requires first-order gradients of a learned meta-model. We term the resulting MAML algorithm Sign-MAML. Compared to the conventional first-order MAML (FO-MAML) algorithm, Sign-MAML is theoretically-grounded as it does not impose any assumption on the absence of second-order derivatives during meta training. In practice, we show that Sign-MAML outperforms FO-MAML in various few-shot image classification tasks, and compared to MAML, it achieves a much more graceful tradeoff between classification accuracy and computation efficiency.

READ FULL TEXT
research
04/21/2021

Stateless Neural Meta-Learning using Second-Order Gradients

Deep learning typically requires large data sets and much compute power ...
research
06/08/2023

EMO: Episodic Memory Optimization for Few-Shot Meta-Learning

Few-shot meta-learning presents a challenge for gradient descent optimiz...
research
06/19/2021

EvoGrad: Efficient Gradient-Based Meta-Learning and Hyperparameter Optimization

Gradient-based meta-learning and hyperparameter optimization have seen s...
research
10/16/2019

Model-Agnostic Meta-Learning using Runge-Kutta Methods

Meta-learning has emerged as an important framework for learning new tas...
research
09/25/2019

ES-MAML: Simple Hessian-Free Meta Learning

We introduce ES-MAML, a new framework for solving the model agnostic met...
research
10/31/2019

Hierarchical Expert Networks for Meta-Learning

The goal of meta-learning is to train a model on a variety of learning t...
research
12/30/2019

A Consistently Oriented Basis for Eigenanalysis

Repeated application of machine-learning, eigen-centric methods to an ev...

Please sign up or login with your details

Forgot password? Click here to reset