A Continuized View on Nesterov Acceleration

02/11/2021
by   Raphaël Berthier, et al.
0

We introduce the "continuized" Nesterov acceleration, a close variant of Nesterov acceleration whose variables are indexed by a continuous time parameter. The two variables continuously mix following a linear ordinary differential equation and take gradient steps at random times. This continuized variant benefits from the best of the continuous and the discrete frameworks: as a continuous process, one can use differential calculus to analyze convergence and obtain analytical expressions for the parameters; but a discretization of the continuized process can be computed exactly with convergence rates similar to those of Nesterov original acceleration. We show that the discretization has the same structure as Nesterov acceleration, but with random parameters.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/10/2021

A Continuized View on Nesterov Acceleration for Stochastic Gradient Descent and Randomized Gossip

We introduce the continuized Nesterov acceleration, a close variant of N...
research
05/17/2019

A Dynamical Systems Perspective on Nesterov Acceleration

We present a dynamical system framework for understanding Nesterov's acc...
research
03/04/2015

A Differential Equation for Modeling Nesterov's Accelerated Gradient Method: Theory and Insights

We derive a second-order ordinary differential equation (ODE) which is t...
research
10/29/2020

Convergence of Constrained Anderson Acceleration

We prove non asymptotic linear convergence rates for the constrained And...
research
05/01/2018

Direct Runge-Kutta Discretization Achieves Acceleration

We study gradient-based optimization methods obtained by directly discre...
research
07/19/2017

Acceleration and Averaging in Stochastic Mirror Descent Dynamics

We formulate and study a general family of (continuous-time) stochastic ...
research
07/04/2020

Accelerating Nonconvex Learning via Replica Exchange Langevin Diffusion

Langevin diffusion is a powerful method for nonconvex optimization, whic...

Please sign up or login with your details

Forgot password? Click here to reset