Stochastic Bilevel Distributed Optimization over a Network

06/30/2022
by   Hongchang Gao, et al.
0

Bilevel optimization has been applied to a wide variety of machine learning models. Numerous stochastic bilevel optimization algorithms have been developed in recent years. However, most of them restrict their focus on the single-machine setting so that they are incapable of handling the distributed data. To address this issue, under the setting where all participants compose a network and perform the peer-to-peer communication in this network, we developed two novel distributed stochastic bilevel optimization algorithms based on the gradient tracking communication mechanism and two different gradient estimators. Additionally, we show that they can achieve O(1/ϵ^2(1-λ)^2) and O(1/ϵ^3/2(1-λ)^2) convergence rate respectively to obtain the ϵ-accuracy solution, where 1-λ denotes the spectral gap of the communication network. To our knowledge, this is the first work achieving these theoretical results. Finally, we applied our algorithms to practical machine learning models, and the experimental results confirmed the efficacy of our algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/06/2023

Stochastic Multi-Level Compositional Optimization Algorithms over Networks with Level-Independent Convergence Rate

Stochastic multi-level compositional optimization problems cover many ne...
research
04/24/2023

Can Decentralized Stochastic Minimax Optimization Algorithms Converge Linearly for Finite-Sum Nonconvex-Nonconcave Problems?

Decentralized minimax optimization has been actively studied in the past...
research
09/12/2019

Communication-Efficient Distributed Optimization in Networks with Gradient Tracking

There is a growing interest in large-scale machine learning and optimiza...
research
05/04/2022

Babel: A Framework for Developing Performant and Dependable Distributed Protocols

Prototyping and implementing distributed algorithms, particularly those ...
research
01/02/2020

Stochastic Gradient Langevin Dynamics on a Distributed Network

Langevin MCMC gradient optimization is a class of increasingly popular m...
research
02/20/2017

Hemingway: Modeling Distributed Optimization Algorithms

Distributed optimization algorithms are widely used in many industrial m...
research
07/25/2023

Achieving Linear Speedup in Decentralized Stochastic Compositional Minimax Optimization

The stochastic compositional minimax problem has attracted a surge of at...

Please sign up or login with your details

Forgot password? Click here to reset