Gradient-based Bi-level Optimization for Deep Learning: A Survey

07/24/2022
by   Can Chen, et al.
2

Bi-level optimization, especially the gradient-based category, has been widely used in the deep learning community including hyperparameter optimization and meta-knowledge extraction. Bi-level optimization embeds one problem within another and the gradient-based category solves the outer level task by computing the hypergradient, which is much more efficient than classical methods such as the evolutionary algorithm. In this survey, we first give a formal definition of the gradient-based bi-level optimization. Secondly, we illustrate how to formulate a research problem as a bi-level optimization problem, which is of great practical use for beginners. More specifically, there are two formulations: the single-task formulation to optimize hyperparameters such as regularization parameters and the distilled data, and the multi-task formulation to extract meta knowledge such as the model initialization. With a bi-level formulation, we then discuss four bi-level optimization solvers to update the outer variable including explicit gradient update, proxy update, implicit function update, and closed-form update. Last but not least, we conclude the survey by pointing out the great potential of gradient-based bi-level optimization on science problems (AI4Science).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/16/2021

A Generic Descent Aggregation Framework for Gradient-based Bi-level Optimization

In recent years, gradient-based methods for solving bi-level optimizatio...
research
07/27/2020

Stabilizing Bi-Level Hyperparameter Optimization using Moreau-Yosida Regularization

This research proposes to use the Moreau-Yosida envelope to stabilize th...
research
07/02/2023

Make Text Unlearnable: Exploiting Effective Patterns to Protect Personal Data

This paper addresses the ethical concerns arising from the use of unauth...
research
01/26/2023

Open Problems in Applied Deep Learning

This work formulates the machine learning mechanism as a bi-level optimi...
research
01/27/2021

Investigating Bi-Level Optimization for Learning and Vision from a Unified Perspective: A Survey and Beyond

Bi-Level Optimization (BLO) is originated from the area of economic game...
research
01/31/2022

Dynamic Origin-Destination Matrix Estimation in Urban Traffic Networks

Given the counters of vehicles that traverse the roads of a traffic netw...
research
12/18/2017

A Bridge Between Hyperparameter Optimization and Larning-to-learn

We consider a class of a nested optimization problems involving inner an...

Please sign up or login with your details

Forgot password? Click here to reset