A Momentum-Assisted Single-Timescale Stochastic Approximation Algorithm for Bilevel Optimization

02/15/2021
by   Prashant Khanduri, et al.
0

This paper proposes a new algorithm – the Momentum-assisted Single-timescale Stochastic Approximation (MSTSA) – for tackling unconstrained bilevel optimization problems. We focus on bilevel problems where the lower level subproblem is strongly-convex. Unlike prior works which rely on two timescale or double loop techniques that track the optimal solution to the lower level subproblem, we design a stochastic momentum assisted gradient estimator for the upper level subproblem's updates. The latter allows us to gradually control the error in stochastic gradient updates due to inaccurate solution to the lower level subproblem. We show that if the upper objective function is smooth but possibly non-convex (resp. strongly-convex), MSTSA requires 𝒪(ϵ^-2) (resp. 𝒪(ϵ^-1)) iterations (each using constant samples) to find an ϵ-stationary (resp. ϵ-optimal) solution. This achieves the best-known guarantees for stochastic bilevel problems. We validate our theoretical results by showing the efficiency of the MSTSA algorithm on hyperparameter optimization and data hyper-cleaning problems.

READ FULL TEXT
research
08/15/2023

Projection-Free Methods for Stochastic Simple Bilevel Optimization with Convex Lower-level Problem

In this paper, we study a class of stochastic bilevel optimization probl...
research
02/09/2021

A Single-Timescale Stochastic Bilevel Optimization Method

Stochastic bilevel optimization generalizes the classic stochastic optim...
research
06/17/2022

Generalized Frank-Wolfe Algorithm for Bilevel Optimization

In this paper, we study a class of bilevel optimization problems, also k...
research
01/26/2023

A Fully First-Order Method for Stochastic Bilevel Optimization

We consider stochastic unconstrained bilevel optimization problems when ...
research
02/01/2022

DoCoM-SGT: Doubly Compressed Momentum-assisted Stochastic Gradient Tracking Algorithm for Communication Efficient Decentralized Learning

This paper proposes the Doubly Compressed Momentum-assisted Stochastic G...
research
09/04/2023

On Penalty Methods for Nonconvex Bilevel Optimization and First-Order Stochastic Approximation

In this work, we study first-order algorithms for solving Bilevel Optimi...

Please sign up or login with your details

Forgot password? Click here to reset