A More Stable Accelerated Gradient Method Inspired by Continuous-Time Perspective

12/09/2021
by   Yasong Feng, et al.
0

Nesterov's accelerated gradient method (NAG) is widely used in problems with machine learning background including deep learning, and is corresponding to a continuous-time differential equation. From this connection, the property of the differential equation and its numerical approximation can be investigated to improve the accelerated gradient method. In this work we present a new improvement of NAG in terms of stability inspired by numerical analysis. We give the precise order of NAG as a numerical approximation of its continuous-time limit and then present a new method with higher order. We show theoretically that our new method is more stable than NAG for large step size. Experiments of matrix completion and handwriting digit recognition demonstrate that the stability of our new method is better. Furthermore, better stability leads to higher computational speed in experiments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/04/2015

A Differential Equation for Modeling Nesterov's Accelerated Gradient Method: Theory and Insights

We derive a second-order ordinary differential equation (ODE) which is t...
research
06/07/2023

Comparison of SeDuMi and SDPT3 Solvers for Stability of Continuous-time Linear System

SeDuMi and SDPT3 are two solvers for solving Semi-definite Programming (...
research
03/14/2016

A Variational Perspective on Accelerated Methods in Optimization

Accelerated gradient methods play a central role in optimization, achiev...
research
05/17/2019

A Dynamical Systems Perspective on Nesterov Acceleration

We present a dynamical system framework for understanding Nesterov's acc...
research
03/04/2018

Accelerating Natural Gradient with Higher-Order Invariance

An appealing property of the natural gradient is that it is invariant to...
research
06/01/2023

A fast and accurate computation method for reflective diffraction simulations

We present a new computation method for simulating reflection high-energ...
research
06/27/2022

Zero Stability Well Predicts Performance of Convolutional Neural Networks

The question of what kind of convolutional neural network (CNN) structur...

Please sign up or login with your details

Forgot password? Click here to reset