Multilevel Minimization for Deep Residual Networks

04/13/2020
by   Lisa Gaedke-Merzhäuser, et al.
0

We present a new multilevel minimization framework for the training of deep residual networks (ResNets), which has the potential to significantly reduce training time and effort. Our framework is based on the dynamical system's viewpoint, which formulates a ResNet as the discretization of an initial value problem. The training process is then formulated as a time-dependent optimal control problem, which we discretize using different time-discretization parameters, eventually generating multilevel-hierarchy of auxiliary networks with different resolutions. The training of the original ResNet is then enhanced by training the auxiliary networks with reduced resolutions. By design, our framework is conveniently independent of the choice of the training strategy chosen on each level of the multilevel hierarchy. By means of numerical examples, we analyze the convergence behavior of the proposed method and demonstrate its robustness. For our examples we employ a multilevel gradient-based methods. Comparisons with standard single level methods show a speedup of more than factor three while achieving the same validation accuracy.

READ FULL TEXT
research
12/11/2018

Layer-Parallel Training of Deep Residual Neural Networks

Residual neural networks (ResNets) are a promising class of deep neural ...
research
12/19/2019

Multilevel Initialization for Layer-Parallel Deep Neural Network Training

This paper investigates multilevel initialization strategies for trainin...
research
06/28/2020

A Multilevel Approach to Training

We propose a novel training method based on nonlinear multilevel minimiz...
research
03/01/2019

Recursive multilevel trust region method with application to fully monolithic phase-field models of brittle fracture

The simulation of crack initiation and propagation in an elastic materia...
research
10/15/2022

A multilevel reinforcement learning framework for PDE based control

Reinforcement learning (RL) is a promising method to solve control probl...
research
03/26/2021

Multilevel Active-Set Trust-Region (MASTR) Method for Bound Constrained Minimization

We introduce a novel variant of the recursive multilevel trust-region (R...
research
06/14/2018

Multilevel Artificial Neural Network Training for Spatially Correlated Learning

Multigrid modeling algorithms are a technique used to accelerate relaxat...

Please sign up or login with your details

Forgot password? Click here to reset