Hessian-Free High-Resolution Nesterov Accelerationfor Sampling

06/16/2020
by   Ruilin Li, et al.
0

We propose an accelerated-gradient-based MCMC method. It relies on a modification of the Nesterov's accelerated gradient method for strongly convex functions (NAG-SC): We first reformulate NAG-SC as a Hessian-Free High-Resolution ODE, then release the high-resolution coefficient as a free hyperparameter, and finally inject appropriate noise and discretize the diffusion process. Accelerated sampling enabled by this new hyperparameter is not only experimentally demonstrated on several learning tasks, but also theoretically quantified, both at the continuous level and after discretization. For (not-necessarily-strongly-) convex and L-smooth potentials, exponential convergence in χ^2 divergence is proved, with a rate analogous to state-of-the-art results of underdamped Langevin dynamics, plus an additional acceleration. At the same time, the method also works for nonconvex potentials, for which we also establish exponential convergence as long as the potential satisfies a Poincaré inequality.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/16/2020

Hessian-Free High-Resolution Nesterov Acceleration for Sampling

We propose an accelerated-gradient-based MCMC method. It relies on a mod...
research
10/21/2018

Understanding the Acceleration Phenomenon via High-Resolution Differential Equations

Gradient-based optimization algorithms can be studied from the perspecti...
research
12/17/2021

Unadjusted Langevin algorithm for sampling a mixture of weakly smooth potentials

Discretization of continuous-time diffusion processes is a widely recogn...
research
01/27/2022

From the Ravine method to the Nesterov method and vice versa: a dynamical system perspective

We revisit the Ravine method of Gelfand and Tsetlin from a dynamical sys...
research
02/16/2023

Improved Discretization Analysis for Underdamped Langevin Monte Carlo

Underdamped Langevin Monte Carlo (ULMC) is an algorithm used to sample f...
research
10/03/2022

On Stability and Generalization of Bilevel Optimization Problem

(Stochastic) bilevel optimization is a frequently encountered problem in...
research
06/16/2023

Linear convergence of Nesterov-1983 with the strong convexity

For modern gradient-based optimization, a developmental landmark is Nest...

Please sign up or login with your details

Forgot password? Click here to reset