Recasting Gradient-Based Meta-Learning as Hierarchical Bayes

by   Erin Grant, et al.

Meta-learning allows an intelligent agent to leverage prior learning episodes as a basis for quickly improving performance on a novel task. Bayesian hierarchical modeling provides a theoretical framework for formalizing meta-learning as inference for a set of parameters that are shared across tasks. Here, we reformulate the model-agnostic meta-learning algorithm (MAML) of Finn et al. (2017) as a method for probabilistic inference in a hierarchical Bayesian model. In contrast to prior methods for meta-learning via hierarchical Bayes, MAML is naturally applicable to complex function approximators through its use of a scalable gradient descent procedure for posterior inference. Furthermore, the identification of MAML as hierarchical Bayes provides a way to understand the algorithm's operation as a meta-learning procedure, as well as an opportunity to make use of computational strategies for efficient inference. We use this opportunity to propose an improvement to the MAML algorithm that makes use of techniques from approximate inference and curvature estimation.


page 1

page 2

page 3

page 4


Online gradient-based mixtures for transfer modulation in meta-learning

Learning-to-learn or meta-learning leverages data-driven inductive bias ...

Gradient-EM Bayesian Meta-learning

Bayesian meta-learning enables robust and fast adaptation to new tasks w...

Meta Particle Flow for Sequential Bayesian Inference

We present a particle flow realization of Bayes' rule, where an ODE-base...

Learning not to learn: Nature versus nurture in silico

Animals are equipped with a rich innate repertoire of sensory, behaviora...

Deep Interactive Bayesian Reinforcement Learning via Meta-Learning

Agents that interact with other agents often do not know a priori what t...

Accelerating numerical methods by gradient-based meta-solving

In science and engineering applications, it is often required to solve s...

Hierarchical Bayes Modeling for Large-Scale Inference

Bayesian modeling is now ubiquitous in problems of large-scale inference...