A Mathematical Principle of Deep Learning: Learn the Geodesic Curve in the Wasserstein Space

02/18/2021
by   Kuo Gai, et al.
0

Recent studies revealed the mathematical connection of deep neural network (DNN) and dynamic system. However, the fundamental principle of DNN has not been fully characterized with dynamic system in terms of optimization and generalization. To this end, we build the connection of DNN and continuity equation where the measure is conserved to model the forward propagation process of DNN which has not been addressed before. DNN learns the transformation of the input distribution to the output one. However, in the measure space, there are infinite curves connecting two distributions. Which one can lead to good optimization and generaliztion for DNN? By diving the optimal transport theory, we find DNN with weight decay attempts to learn the geodesic curve in the Wasserstein space, which is induced by the optimal transport map. Compared with plain network, ResNet is a better approximation to the geodesic curve, which explains why ResNet can be optimized and generalize better. Numerical experiments show that the data tracks of both plain network and ResNet tend to be line-shape in term of line-shape score (LSS), and the map learned by ResNet is closer to the optimal transport map in term of optimal transport score (OTS). In a word, we conclude a mathematical principle of deep learning is to learn the geodesic curve in the Wasserstein space; and deep learning is a great engineering realization of continuous transformation in high-dimensional space.

READ FULL TEXT
research
09/19/2022

The GenCol algorithm for high-dimensional optimal transport: general formulation and application to barycenters and Wasserstein splines

We extend the recently introduced genetic column generation algorithm fo...
research
07/04/2023

Fast Optimal Transport through Sliced Wasserstein Generalized Geodesics

Wasserstein distance (WD) and the associated optimal transport plan have...
research
05/21/2020

CPOT: Channel Pruning via Optimal Transport

Recent advances in deep neural networks (DNNs) lead to tremendously grow...
research
11/07/2017

Large-Scale Optimal Transport and Mapping Estimation

This paper presents a novel two-step approach for the fundamental proble...
research
05/16/2022

A scalable deep learning approach for solving high-dimensional dynamic optimal transport

The dynamic formulation of optimal transport has attracted growing inter...
research
02/09/2022

Online Learning to Transport via the Minimal Selection Principle

Motivated by robust dynamic resource allocation in operations research, ...
research
04/08/2023

Efficient Multimodal Sampling via Tempered Distribution Flow

Sampling from high-dimensional distributions is a fundamental problem in...

Please sign up or login with your details

Forgot password? Click here to reset