Optimizing Deep Neural Networks via Discretization of Finite-Time Convergent Flows

10/06/2020
by   Mouhacine Benosman, et al.
0

In this paper, we investigate in the context of deep neural networks, the performance of several discretization algorithms for two first-order finite-time optimization flows. These flows are, namely, the rescaled-gradient flow (RGF) and the signed-gradient flow (SGF), and consist of non-Lipscthiz or discontinuous dynamical systems that converge locally in finite time to the minima of gradient-dominated functions. We introduce three discretization methods for these first-order finite-time flows, and provide convergence guarantees. We then apply the proposed algorithms in training neural networks and empirically test their performances on three standard datasets, namely, CIFAR10, SVHN, and MNIST. Our results show that our schemes demonstrate faster convergences against standard optimization alternatives, while achieving equivalent or better accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/21/2017

Predictive networking and optimization for flow-based networks

Artificial Neural Networks (ANNs) were used to classify neural network f...
research
09/29/2022

From geodesic extrapolation to a variational BDF2 scheme for Wasserstein gradient flows

We introduce a time discretization for Wasserstein gradient flows based ...
research
06/19/2020

An Ode to an ODE

We present a new paradigm for Neural ODE algorithms, calledODEtoODE, whe...
research
11/11/2020

Gradient discretization of two-phase poro-mechanical models with discontinuous pressures at matrix fracture interfaces

We consider a two-phase Darcy flow in a fractured and deformable porous ...
research
12/02/2021

Breaking the Convergence Barrier: Optimization via Fixed-Time Convergent Flows

Accelerated gradient methods are the cornerstones of large-scale, data-d...
research
02/22/2023

From Optimization to Sampling Through Gradient Flows

This article overviews how gradient flows, and discretizations thereof, ...
research
07/27/2018

On the overfly algorithm in deep learning of neural networks

In this paper we investigate the supervised backpropagation training of ...

Please sign up or login with your details

Forgot password? Click here to reset