Meta-Curvature

02/09/2019
by   Eunbyung Park, et al.
0

We propose to learn curvature information for better generalization and fast model adaptation, called meta-curvature. Based on the model-agnostic meta-learner (MAML), we learn to transform the gradients in the inner optimization such that the transformed gradients achieve better generalization performance to a new task. For training large scale neural networks, we decompose the curvature matrix into smaller matrices and capture the dependencies of the model's parameters with a series of tensor products. We demonstrate the effects of our proposed method on both few-shot image classification and few-shot reinforcement learning tasks. Experimental results show consistent improvements on classification tasks and promising results on reinforcement learning tasks. Furthermore, we observe faster convergence rates of the meta-training process. Finally, we present an analysis that explains better generalization performance with the meta-trained curvature.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/09/2017

Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks

We propose an algorithm for meta-learning that is model-agnostic, in the...
research
11/28/2019

A Generalization Theory based on Independent and Task-Identically Distributed Assumption

Existing generalization theories analyze the generalization performance ...
research
10/30/2019

Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation

Model-agnostic meta-learners aim to acquire meta-learned parameters from...
research
05/25/2022

Fast Inference and Transfer of Compositional Task Structures for Few-shot Task Generalization

We tackle real-world problems with complex structures beyond the pixel-b...
research
06/25/2022

p-Meta: Towards On-device Deep Model Adaptation

Data collected by IoT devices are often private and have a large diversi...
research
10/31/2021

Can we learn gradients by Hamiltonian Neural Networks?

In this work, we propose a meta-learner based on ODE neural networks tha...
research
10/09/2020

Hindsight Experience Replay with Kronecker Product Approximate Curvature

Hindsight Experience Replay (HER) is one of the efficient algorithm to s...

Please sign up or login with your details

Forgot password? Click here to reset