On Implicit Bias in Overparameterized Bilevel Optimization

12/28/2022
by   Paul Vicol, et al.
0

Many problems in machine learning involve bilevel optimization (BLO), including hyperparameter optimization, meta-learning, and dataset distillation. Bilevel problems consist of two nested sub-problems, called the outer and inner problems, respectively. In practice, often at least one of these sub-problems is overparameterized. In this case, there are many ways to choose among optima that achieve equivalent objective values. Inspired by recent studies of the implicit bias induced by optimization algorithms in single-level optimization, we investigate the implicit bias of gradient-based algorithms for bilevel optimization. We delineate two standard BLO methods – cold-start and warm-start – and show that the converged solution or long-run behavior depends to a large degree on these and other algorithmic choices, such as the hypergradient approximation. We also show that the inner solutions obtained by warm-start BLO can encode a surprising amount of information about the outer objective, even when the outer parameters are low-dimensional. We believe that implicit bias deserves as central a role in the study of bilevel optimization as it has attained in the study of single-level neural net optimization.

READ FULL TEXT

page 12

page 31

research
12/18/2017

A Bridge Between Hyperparameter Optimization and Larning-to-learn

We consider a class of a nested optimization problems involving inner an...
research
07/17/2023

Convex Bi-Level Optimization Problems with Non-smooth Outer Objective Function

In this paper, we propose the Bi-Sub-Gradient (Bi-SG) method, which is a...
research
06/23/2020

On the Global Optimality of Model-Agnostic Meta-Learning

Model-agnostic meta-learning (MAML) formulates meta-learning as a bileve...
research
11/29/2021

Amortized Implicit Differentiation for Stochastic Bilevel Optimization

We study a class of algorithms for solving bilevel optimization problems...
research
06/04/2023

A Generalized Alternating Method for Bilevel Optimization under the Polyak-Łojasiewicz Condition

Bilevel optimization has recently regained interest owing to its applica...
research
05/26/2022

Fair Representation Learning through Implicit Path Alignment

We consider a fair representation learning perspective, where optimal pr...
research
08/26/2022

On the Implicit Bias in Deep-Learning Algorithms

Gradient-based deep-learning algorithms exhibit remarkable performance i...

Please sign up or login with your details

Forgot password? Click here to reset