A Fully Single Loop Algorithm for Bilevel Optimization without Hessian Inverse

12/09/2021
by   Junyi Li, et al.
3

In this paper, we propose a new Hessian inverse free Fully Single Loop Algorithm (FSLA) for bilevel optimization problems. Classic algorithms for bilevel optimization admit a double loop structure which is computationally expensive. Recently, several single loop algorithms have been proposed with optimizing the inner and outer variable alternatively. However, these algorithms not yet achieve fully single loop. As they overlook the loop needed to evaluate the hyper-gradient for a given inner and outer state. In order to develop a fully single loop algorithm, we first study the structure of the hyper-gradient and identify a general approximation formulation of hyper-gradient computation that encompasses several previous common approaches, e.g. back-propagation through time, conjugate gradient, etc. Based on this formulation, we introduce a new state variable to maintain the historical hyper-gradient information. Combining our new formulation with the alternative update of the inner and outer variables, we propose an efficient fully single loop algorithm. We theoretically show that the error generated by the new state can be bounded and our algorithm converges with the rate of O(ϵ^-2). Finally, we verify the efficacy our algorithm empirically through multiple bilevel optimization based machine learning tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/07/2022

A conditional gradient homotopy method with applications to Semidefinite Programming

We propose a new homotopy-based conditional gradient method for solving ...
research
09/01/2020

Improved Bilevel Model: Fast and Optimal Algorithm with Theoretical Guarantee

Due to the hierarchical structure of many machine learning problems, bil...
research
11/23/2020

BiOpt: Bi-Level Optimization for Few-Shot Segmentation

Few-shot segmentation is a challenging task that aims to segment objects...
research
06/21/2021

BiAdam: Fast Adaptive Bilevel Optimization Methods

Bilevel optimization recently has attracted increased interest in machin...
research
01/18/2021

Randomised preconditioning for the forcing formulation of weak constraint 4D-Var

There is growing awareness that errors in the model equations cannot be ...
research
06/04/2021

Debiasing a First-order Heuristic for Approximate Bi-level Optimization

Approximate bi-level optimization (ABLO) consists of (outer-level) optim...
research
05/27/2022

Will Bilevel Optimizers Benefit from Loops

Bilevel optimization has arisen as a powerful tool for solving a variety...

Please sign up or login with your details

Forgot password? Click here to reset