An Optimal Control Approach to Deep Learning and Applications to Discrete-Weight Neural Networks

03/04/2018
by   Qianxiao Li, et al.
0

Deep learning is formulated as a discrete-time optimal control problem. This allows one to characterize necessary conditions for optimality and develop training algorithms that do not rely on gradients with respect to the trainable parameters. In particular, we introduce the discrete-time method of successive approximations (MSA), which is based on the Pontryagin's maximum principle, for training neural networks. A rigorous error estimate for the discrete MSA is obtained, which sheds light on its dynamics and the means to stabilize the algorithm. The developed methods are applied to train, in a rather principled way, neural networks with weights that are constrained to take values in a discrete set. We obtain competitive performance and interestingly, very sparse weights in the case of ternary networks, which may be useful in model deployment in low-memory devices.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/11/2019

Deep learning as optimal control problems: models and numerical methods

We consider recent work of Haber and Ruthotto 2017 and Chang et al. 2018...
research
10/26/2017

Maximum Principle Based Algorithms for Deep Learning

The continuous dynamical system approach to deep learning is explored in...
research
08/28/2019

Deep Learning Theory Review: An Optimal Control and Dynamical Systems Perspective

Attempts from different disciplines to provide a fundamental understandi...
research
12/15/2022

Morse index and determinant of block Jacobi matrices via optimal control

We describe the relation between block Jacobi matrices and minimization ...
research
05/31/2019

Updates of Equilibrium Prop Match Gradients of Backprop Through Time in an RNN with Static Input

Equilibrium Propagation (EP) is a biologically inspired learning algorit...
research
10/15/2018

An Optimal Control Approach to Sequential Machine Teaching

Given a sequential learning algorithm and a target model, sequential mac...
research
03/16/2023

Predicting discrete-time bifurcations with deep learning

Many natural and man-made systems are prone to critical transitions – ab...

Please sign up or login with your details

Forgot password? Click here to reset