Global Convergence of MAML for LQR

05/31/2020
by   Igor Molybog, et al.
15

The paper studies the performance of the Model-Agnostic Meta-Learning (MAML) algorithm as an optimization method. The goal is to determine the global convergence of MAML on sequential decision-making tasks possessing a common structure. We prove that the benign landscape of a single task leads to the global convergence of MAML in the single-task scenario and in the scenario of multiple structurally connected tasks. We also show that there is a two-task scenario that does not possess this global convergence property even for identical tasks. We analyze the landscape of the MAML objective on LQR tasks to determine what type of similarities in their structures enables the algorithm to converge to the globally optimal solution.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/24/2018

Optimization of Weighted Individual Energy Efficiencies in Interference Network

This paper studies the maximization of the weighted sum energy efficienc...
research
04/21/2018

Global Convergence Analysis of the Flower Pollination Algorithm: A Discrete-Time Markov Chain Approach

Flower pollination algorithm is a recent metaheuristic algorithm for sol...
research
02/04/2014

A Survey of Multi-Objective Sequential Decision-Making

Sequential decision-making problems with multiple objectives arise natur...
research
10/04/2022

Are All Losses Created Equal: A Neural Collapse Perspective

While cross entropy (CE) is the most commonly used loss to train deep ne...
research
09/27/2019

Improving Federated Learning Personalization via Model Agnostic Meta Learning

Federated Learning (FL) refers to learning a high quality global model b...
research
11/19/2020

On the convergence of an improved discrete simulated annealing via landscape modification

In this paper, we propose new Metropolis-Hastings and simulated annealin...
research
02/26/2017

Iterative Local Voting for Collective Decision-making in Continuous Spaces

Many societal decision problems lie in high-dimensional continuous space...

Please sign up or login with your details

Forgot password? Click here to reset