Debiasing a First-order Heuristic for Approximate Bi-level Optimization

06/04/2021
by   Valerii Likhosherstov, et al.
0

Approximate bi-level optimization (ABLO) consists of (outer-level) optimization problems, involving numerical (inner-level) optimization loops. While ABLO has many applications across deep learning, it suffers from time and memory complexity proportional to the length r of its inner optimization loop. To address this complexity, an earlier first-order method (FOM) was proposed as a heuristic that omits second derivative terms, yielding significant speed gains and requiring only constant memory. Despite FOM's popularity, there is a lack of theoretical understanding of its convergence properties. We contribute by theoretically characterizing FOM's gradient bias under mild assumptions. We further demonstrate a rich family of examples where FOM-based SGD does not converge to a stationary point of the ABLO objective. We address this concern by proposing an unbiased FOM (UFOM) enjoying constant memory complexity as a function of r. We characterize the introduced time-variance tradeoff, demonstrate convergence bounds, and find an optimal UFOM for a given ABLO problem. Finally, we propose an efficient adaptive UFOM scheme.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/05/2020

UFO-BLO: Unbiased First-Order Bilevel Optimization

Bilevel optimization (BLO) is a popular approach with many applications ...
research
07/17/2023

Convex Bi-Level Optimization Problems with Non-smooth Outer Objective Function

In this paper, we propose the Bi-Sub-Gradient (Bi-SG) method, which is a...
research
02/02/2023

Convergent First-Order Methods for Bi-level Optimization and Stackelberg Games

We propose an algorithm to solve a class of bi-level optimization proble...
research
05/26/2022

Fair Representation Learning through Implicit Path Alignment

We consider a fair representation learning perspective, where optimal pr...
research
12/09/2021

A Fully Single Loop Algorithm for Bilevel Optimization without Hessian Inverse

In this paper, we propose a new Hessian inverse free Fully Single Loop A...
research
05/27/2022

Will Bilevel Optimizers Benefit from Loops

Bilevel optimization has arisen as a powerful tool for solving a variety...
research
05/16/2019

Efficient Optimization of Loops and Limits with Randomized Telescoping Sums

We consider optimization problems in which the objective requires an inn...

Please sign up or login with your details

Forgot password? Click here to reset