Non-Convex Bilevel Optimization with Time-Varying Objective Functions

08/07/2023
by   Sen Lin, et al.
0

Bilevel optimization has become a powerful tool in a wide variety of machine learning problems. However, the current nonconvex bilevel optimization considers an offline dataset and static functions, which may not work well in emerging online applications with streaming data and time-varying functions. In this work, we study online bilevel optimization (OBO) where the functions can be time-varying and the agent continuously updates the decisions with online streaming data. To deal with the function variations and the unavailability of the true hypergradients in OBO, we propose a single-loop online bilevel optimizer with window averaging (SOBOW), which updates the outer-level decision based on a window average of the most recent hypergradient estimations stored in the memory. Compared to existing algorithms, SOBOW is computationally efficient and does not need to know previous functions. To handle the unique technical difficulties rooted in single-loop update and function variations for OBO, we develop a novel analytical technique that disentangles the complex couplings between decision variables, and carefully controls the hypergradient estimation error. We show that SOBOW can achieve a sublinear bilevel local regret under mild conditions. Extensive experiments across multiple domains corroborate the effectiveness of SOBOW.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/09/2021

Delay-Tolerant Constrained OCO with Application to Network Resource Allocation

We consider online convex optimization (OCO) with multi-slot feedback de...
research
05/19/2022

Augmented Lagrangian Methods for Time-varying Constrained Online Convex Optimization

In this paper, we consider online convex optimization (OCO) with time-va...
research
01/31/2020

Centralized and distributed online learning for sparse time-varying optimization

The development of online algorithms to track time-varying systems has d...
research
10/14/2020

Boosting One-Point Derivative-Free Online Optimization via Residual Feedback

Zeroth-order optimization (ZO) typically relies on two-point feedback to...
research
03/03/2021

Periodic Updates for Constrained OCO with Application to Large-Scale Multi-Antenna Systems

In many dynamic systems, such as wireless communications, decisions on s...
research
10/30/2022

One Gradient Frank-Wolfe for Decentralized Online Convex and Submodular Optimization

Decentralized learning has been studied intensively in recent years moti...
research
10/17/2019

Optimization and Learning with Information Streams: Time-varying Algorithms and Applications

There is a growing cross-disciplinary effort in the broad domain of opti...

Please sign up or login with your details

Forgot password? Click here to reset