Escape saddle points by a simple gradient-descent based algorithm

11/28/2021
by   Chenyi Zhang, et al.
2

Escaping saddle points is a central research topic in nonconvex optimization. In this paper, we propose a simple gradient-based algorithm such that for a smooth function fℝ^n→ℝ, it outputs an ϵ-approximate second-order stationary point in Õ(log n/ϵ^1.75) iterations. Compared to the previous state-of-the-art algorithms by Jin et al. with Õ((log n)^4/ϵ^2) or Õ((log n)^6/ϵ^1.75) iterations, our algorithm is polynomially better in terms of log n and matches their complexities in terms of 1/ϵ. For the stochastic setting, our algorithm outputs an ϵ-approximate second-order stationary point in Õ((log n)^2/ϵ^4) iterations. Technically, our main contribution is an idea of implementing a robust Hessian power method using only gradients, which can find negative curvature near saddle points and achieve the polynomial speedup in log n compared to the perturbed gradient descent methods. Finally, we also perform numerical experiments that support our results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/20/2020

Quantum Algorithms for Escaping from Saddle Points

We initiate the study of quantum algorithms for escaping from saddle poi...
research
04/19/2019

SSRGD: Simple Stochastic Recursive Gradient Descent for Escaping Saddle Points

We analyze stochastic gradient algorithms for optimizing nonconvex probl...
research
02/13/2019

Stochastic Gradient Descent Escapes Saddle Points Efficiently

This paper considers the perturbed stochastic gradient descent algorithm...
research
03/02/2017

How to Escape Saddle Points Efficiently

This paper shows that a perturbed form of gradient descent converges to ...
research
02/08/2022

Efficiently Escaping Saddle Points in Bilevel Optimization

Bilevel optimization is one of the fundamental problems in machine learn...
research
09/06/2018

Escaping Saddle Points in Constrained Optimization

In this paper, we focus on escaping from saddle points in smooth nonconv...
research
07/01/2018

A polynomial time log barrier method for problems with nonconvex constraints

Interior point methods (IPMs) that handle nonconvex constraints such as ...

Please sign up or login with your details

Forgot password? Click here to reset