NEON+: Accelerated Gradient Methods for Extracting Negative Curvature for Non-Convex Optimization

12/04/2017
by   Yi Xu, et al.
0

Accelerated gradient (AG) methods are breakthroughs in convex optimization, improving the convergence rate of the gradient descent method for optimization with smooth functions. However, the analysis of AG methods for non-convex optimization is still limited. It remains an open question whether AG methods from convex optimization can accelerate the convergence of the gradient descent method for finding local minimum of non-convex optimization problems. This paper provides an affirmative answer to this question. In particular, we analyze two renowned variants of AG methods (namely Polyak's Heavy Ball method and Nesterov's Accelerated Gradient method) for extracting the negative curvature from random noise, which is central to escaping from saddle points. By leveraging the proposed AG methods for extracting the negative curvature, we present a new AG algorithm with double loops for non-convex optimization [this is in contrast to a single-loop AG algorithm proposed in a recent manuscript AGNON, which directly analyzed the Nesterov's AG method for non-convex optimization and appeared online on November 29, 2017. However, we emphasize that our work is an independent work, which is inspired by our earlier work NEON17 and is based on a different novel analysis.], which converges to second-order stationary point such that ∇ f()≤ϵ and ∇^2 f()≥ -√(ϵ) I with O(1/ϵ^1.75) iteration complexity, improving that of gradient descent method by a factor of ϵ^-0.25 and matching the best iteration complexity of second-order Hessian-free methods for non-convex optimization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/25/2017

Stochastic Non-convex Optimization with Strong High Probability Second-order Convergence

In this paper, we study stochastic non-convex optimization with non-conv...
research
09/17/2019

Quantum algorithm for finding the negative curvature direction in non-convex optimization

We present an efficient quantum algorithm aiming to find the negative cu...
research
08/04/2023

Adaptive Proximal Gradient Method for Convex Optimization

In this paper, we explore two fundamental first-order algorithms in conv...
research
04/25/2022

Accelerated Multiplicative Weights Update Avoids Saddle Points almost always

We consider non-convex optimization problems with constraint that is a p...
research
07/09/2019

SNAP: Finding Approximate Second-Order Stationary Solutions Efficiently for Non-convex Linearly Constrained Problems

This paper proposes low-complexity algorithms for finding approximate se...
research
12/14/2021

Imaginary Zeroth-Order Optimization

Zeroth-order optimization methods are developed to overcome the practica...
research
10/04/2020

Quickly Finding a Benign Region via Heavy Ball Momentum in Non-Convex Optimization

The Heavy Ball Method, proposed by Polyak over five decades ago, is a fi...

Please sign up or login with your details

Forgot password? Click here to reset