Deep Learning Theory Review: An Optimal Control and Dynamical Systems Perspective

08/28/2019
by   Guan-Horng Liu, et al.
0

Attempts from different disciplines to provide a fundamental understanding of deep learning have advanced rapidly in recent years, yet a unified framework remains relatively limited. In this article, we provide one possible way to align existing branches of deep learning theory through the lens of dynamical system and optimal control. By viewing deep neural networks as discrete-time nonlinear dynamical systems, we can analyze how information propagates through layers using mean field theory. When optimization algorithms are further recast as controllers, the ultimate goal of training processes can be formulated as an optimal control problem. In addition, we can reveal convergence and generalization properties by studying the stochastic dynamics of optimization algorithms. This viewpoint features a wide range of theoretical study from information bottleneck to statistical physics. It also provides a principled way for hyper-parameter tuning when optimal control theory is introduced. Our framework fits nicely with supervised learning and can be extended to other learning problems, such as Bayesian learning, adversarial training, and specific forms of meta learning, without efforts. The review aims to shed lights on the importance of dynamics and optimal control when developing deep learning theory.

READ FULL TEXT
research
07/03/2018

A Mean-Field Optimal Control Formulation of Deep Learning

Recent work linking deep neural networks and dynamical systems opened up...
research
12/16/2020

Physical deep learning based on optimal control of dynamical systems

A central topic in recent artificial intelligence technologies is deep l...
research
06/22/2022

Near-optimal control of dynamical systems with neural ordinary differential equations

Optimal control problems naturally arise in many scientific applications...
research
03/04/2018

An Optimal Control Approach to Deep Learning and Applications to Discrete-Weight Neural Networks

Deep learning is formulated as a discrete-time optimal control problem. ...
research
07/17/2020

A Differential Game Theoretic Neural Optimizer for Training Residual Networks

Connections between Deep Neural Networks (DNNs) training and optimal con...
research
09/14/2022

Algorithmic (Semi-)Conjugacy via Koopman Operator Theory

Iterative algorithms are of utmost importance in decision and control. W...
research
05/24/2023

Neural Lyapunov and Optimal Control

Optimal control (OC) is an effective approach to controlling complex dyn...

Please sign up or login with your details

Forgot password? Click here to reset