A Differential Game Theoretic Neural Optimizer for Training Residual Networks

07/17/2020
by   Guan-Horng Liu, et al.
0

Connections between Deep Neural Networks (DNNs) training and optimal control theory has attracted considerable attention as a principled tool of algorithmic design. Differential Dynamic Programming (DDP) neural optimizer is a recently proposed method along this line. Despite its empirical success, the applicability has been limited to feedforward networks and whether such a trajectory-optimization inspired framework can be extended to modern architectures remains unclear. In this work, we derive a generalized DDP optimizer that accepts both residual connections and convolution layers. The resulting optimal control representation admits a game theoretic perspective, in which training residual networks can be interpreted as cooperative trajectory optimization on state-augmented dynamical systems. This Game Theoretic DDP (GT-DDP) optimizer enjoys the same theoretic connection in previous work, yet generates a much complex update rule that better leverages available information during network propagation. Evaluation on image classification datasets (e.g. MNIST and CIFAR100) shows an improvement in training convergence and variance reduction over existing methods. Our approach highlights the benefit gained from architecture-aware optimization.

READ FULL TEXT
research
05/08/2021

Dynamic Game Theoretic Neural Optimizer

The connection between training deep neural networks (DNNs) and optimal ...
research
02/20/2020

Differential Dynamic Programming Neural Optimizer

Interpretation of Deep Neural Networks (DNNs) training as an optimal con...
research
08/28/2019

Deep Learning Theory Review: An Optimal Control and Dynamical Systems Perspective

Attempts from different disciplines to provide a fundamental understandi...
research
02/07/2019

Predict Globally, Correct Locally: Parallel-in-Time Optimal Control of Neural Networks

The links between optimal control of dynamical systems and neural networ...
research
08/31/2023

On a Connection between Differential Games, Optimal Control, and Energy-based Models for Multi-Agent Interactions

Game theory offers an interpretable mathematical framework for modeling ...
research
12/30/2021

Parallel Network Flow Allocation in Repeated Routing Games via LQR Optimal Control

In this article, we study the repeated routing game problem on a paralle...
research
03/03/2020

Underactuated Waypoint Trajectory Optimization for Light Painting Photography

Despite their abundance in robotics and nature, underactuated systems re...

Please sign up or login with your details

Forgot password? Click here to reset