On Complexity of Finding Stationary Points of Nonsmooth Nonconvex Functions

02/10/2020
by   Jingzhao Zhang, et al.
0

We provide the first non-asymptotic analysis for finding stationary points of nonsmooth, nonconvex functions. In particular, we study the class of Hadamard semi-differentiable functions, perhaps the largest class of nonsmooth functions for which the chain rule of calculus holds. This class contains important examples such as ReLU neural networks and others with non-differentiable activation functions. First, we show that finding an ϵ-stationary point with first-order methods is impossible in finite time. Therefore, we introduce the notion of (δ, ϵ)-stationarity, a generalization that allows for a point to be within distance δ of an ϵ-stationary point and reduces to ϵ-stationarity for smooth functions. We propose a series of randomized first-order methods and analyze their complexity of finding a (δ, ϵ)-stationary point. Furthermore, we provide a lower bound and show that our stochastic algorithm has min-max optimal dependence on δ. Empirically, our methods perform well for training ReLU neural networks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/27/2020

Can We Find Near-Approximately-Stationary Points of Nonsmooth Nonconvex Functions?

It is well-known that given a bounded, smooth nonconvex function, standa...
research
04/19/2019

SSRGD: Simple Stochastic Recursive Gradient Descent for Escaping Saddle Points

We analyze stochastic gradient algorithms for optimizing nonconvex probl...
research
04/18/2021

Complexity Lower Bounds for Nonconvex-Strongly-Concave Min-Max Optimization

We provide a first-order oracle complexity lower bound for finding stati...
research
02/16/2023

Deterministic Nonsmooth Nonconvex Optimization

We study the complexity of optimizing nonsmooth nonconvex Lipschitz func...
research
10/31/2020

Efficient Methods for Structured Nonconvex-Nonconcave Min-Max Optimization

The use of min-max optimization in adversarial training of deep neural n...
research
09/06/2018

Determination of Stationary Points and Their Bindings in Dataset using RBF Methods

Stationary points of multivariable function which represents some surfac...
research
10/08/2021

Nonconvex-Nonconcave Min-Max Optimization with a Small Maximization Domain

We study the problem of finding approximate first-order stationary point...

Please sign up or login with your details

Forgot password? Click here to reset