Variational formulations of ODE-Net as a mean-field optimal control problem and existence results

03/09/2023
by   Noboru Isobe, et al.
0

This paper presents a mathematical analysis of ODE-Net, a continuum model of deep neural networks (DNNs). In recent years, Machine Learning researchers have introduced ideas of replacing the deep structure of DNNs with ODEs as a continuum limit. These studies regard the "learning" of ODE-Net as the minimization of a "loss" constrained by a parametric ODE. Although the existence of a minimizer for this minimization problem needs to be assumed, only a few studies have investigated its existence analytically in detail. In the present paper, the existence of a minimizer is discussed based on a formulation of ODE-Net as a measure-theoretic mean-field optimal control problem. The existence result is proved when a neural network, which describes a vector field of ODE-Net, is linear with respect to learnable parameters. The proof employs the measure-theoretic formulation combined with the direct method of Calculus of Variations. Secondly, an idealized minimization problem is proposed to remove the above linearity assumption. Such a problem is inspired by a kinetic regularization associated with the Benamou–Brenier formula and universal approximation theorems for neural networks. The proofs of these existence results use variational methods, differential equations, and mean-field optimal control theory. They will stand for a new analytic way to investigate the learning process of deep neural networks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/03/2018

A Mean-Field Optimal Control Formulation of Deep Learning

Recent work linking deep neural networks and dynamical systems opened up...
research
05/17/2023

The mathematical theory of Hughes' model: a survey of results

We provide an overview of the results on Hughes' model for pedestrian mo...
research
07/02/2020

Optimal control of mean field equations with monotone coefficients and applications in neuroscience

We are interested in the optimal control problem associated with certain...
research
06/01/2019

A mean-field limit for certain deep neural networks

Understanding deep neural networks (DNNs) is a key challenge in the theo...
research
10/15/2020

Optimal control and stablilization for linear continuous-time mean-field systems with delay

This paper studies optimal control and stabilization problems for contin...
research
09/13/2021

On the regularized risk of distributionally robust learning over deep neural networks

In this paper we explore the relation between distributionally robust le...
research
04/07/2019

Information Bottleneck and its Applications in Deep Learning

Information Theory (IT) has been used in Machine Learning (ML) from earl...

Please sign up or login with your details

Forgot password? Click here to reset