Local Saddle Point Optimization: A Curvature Exploitation Approach

05/15/2018
by   Leonard Adolphs, et al.
0

Gradient-based optimization methods are the most popular choice for finding local optima for classical minimization and saddle point problems. Here, we highlight a systemic issue of gradient dynamics that arise for saddle point problems, namely the presence of undesired stable stationary points that are no local optima. We propose a novel optimization approach that exploits curvature information in order to escape from these undesired stationary points. We prove that different optimization methods, including gradient method and adagrad, equipped with curvature exploitation can escape non-optimal stationary points. We also provide empirical results on common saddle point problems which confirm the advantage of using curvature exploitation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/11/2018

KF-LAX: Kronecker-factored curvature estimation for control variate optimization in reinforcement learning

A key challenge for gradient based optimization methods in model-free re...
research
03/23/2023

Optimization Dynamics of Equivariant and Augmented Neural Networks

We investigate the optimization of multilayer perceptrons on symmetric d...
research
02/08/2022

Efficiently Escaping Saddle Points in Bilevel Optimization

Bilevel optimization is one of the fundamental problems in machine learn...
research
05/15/2020

Sobolev Gradients for the Möbius Energy

Aiming at optimizing the shape of closed embedded curves within prescrib...
research
12/24/2013

3D Interest Point Detection via Discriminative Learning

The task of detecting the interest points in 3D meshes has typically bee...
research
01/05/2014

Schatten-p Quasi-Norm Regularized Matrix Optimization via Iterative Reweighted Singular Value Minimization

In this paper we study general Schatten-p quasi-norm (SPQN) regularized ...
research
05/31/2018

On Curvature-aided Incremental Aggregated Gradient Methods

This paper studies an acceleration technique for incremental aggregated ...

Please sign up or login with your details

Forgot password? Click here to reset