Differential Dynamic Programming Neural Optimizer

02/20/2020
by   Guan-Horng Liu, et al.
0

Interpretation of Deep Neural Networks (DNNs) training as an optimal control problem with nonlinear dynamical systems has received considerable attention recently, yet the algorithmic development remains relatively limited. In this work, we make an attempt along this line by reformulating the training procedure from the trajectory optimization perspective. We first show that most widely-used algorithms for training DNNs can be linked to the Differential Dynamic Programming (DDP), a celebrated second-order trajectory optimization algorithm rooted in the Approximate Dynamic Programming. In this vein, we propose a new variant of DDP that can accept batch optimization for training feedforward networks, while integrating naturally with the recent progress in curvature approximation. The resulting algorithm features layer-wise feedback policies which improve convergence rate and reduce sensitivity to hyper-parameter over existing methods. We show that the algorithm is competitive against state-ofthe-art first and second order methods. Our work opens up new avenues for principled algorithmic design built upon the optimal control theory.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/17/2020

A Differential Game Theoretic Neural Optimizer for Training Residual Networks

Connections between Deep Neural Networks (DNNs) training and optimal con...
research
04/07/2022

Parameterized Differential Dynamic Programming

Differential Dynamic Programming (DDP) is an efficient trajectory optimi...
research
05/08/2021

Dynamic Game Theoretic Neural Optimizer

The connection between training deep neural networks (DNNs) and optimal ...
research
09/29/2021

Second-Order Neural ODE Optimizer

We propose a novel second-order optimization framework for training the ...
research
09/14/2023

A Unified Perspective on Multiple Shooting In Differential Dynamic Programming

Differential Dynamic Programming (DDP) is an efficient computational too...
research
09/02/2022

Differentiable Optimal Control via Differential Dynamic Programming

Robot design optimization, imitation learning and system identification ...
research
03/08/2022

Leveraging Randomized Smoothing for Optimal Control of Nonsmooth Dynamical Systems

Optimal control (OC) algorithms such as Differential Dynamic Programming...

Please sign up or login with your details

Forgot password? Click here to reset