Variational Characterizations of Local Entropy and Heat Regularization in Deep Learning

01/29/2019
by   Nicolas Garcia Trillos, et al.
0

The aim of this paper is to provide new theoretical and computational understanding on two loss regularizations employed in deep learning, known as local entropy and heat regularization. For both regularized losses we introduce variational characterizations that naturally suggest a two-step scheme for their optimization, based on the iterative shift of a probability density and the calculation of a best Gaussian approximation in Kullback-Leibler divergence. Under this unified light, the optimization schemes for local entropy and heat regularized loss differ only over which argument of the Kullback-Leibler divergence is used to find the best Gaussian approximation. Local entropy corresponds to minimizing over the second argument, and the solution is given by moment matching. This allows to replace traditional back-propagation calculation of gradients by sampling algorithms, opening an avenue for gradient-free, parallelizable training of neural networks.

READ FULL TEXT

page 4

page 5

research
10/15/2019

REVE: Regularizing Deep Learning with Variational Entropy Bound

Studies on generalization performance of machine learning algorithms und...
research
07/01/2019

Implementation of batched Sinkhorn iterations for entropy-regularized Wasserstein loss

In this report, we review the calculation of entropy-regularised Wassers...
research
01/31/2018

The entropy of a thermodynamic graph

We introduce an algorithmic model of heat conduction, the thermodynamic ...
research
10/30/2018

Divergence Network: Graphical calculation method of divergence functions

In this paper, we introduce directed networks called `divergence network...
research
04/09/2023

Variational operator learning: A unified paradigm for training neural operators and solving partial differential equations

Based on the variational method, we propose a novel paradigm that provid...
research
03/08/2023

Nonlinear Kalman Filtering with Reparametrization Gradients

We introduce a novel nonlinear Kalman filter that utilizes reparametriza...
research
11/27/2018

Understanding the impact of entropy on policy optimization

Entropy regularization is commonly used to improve policy optimization i...

Please sign up or login with your details

Forgot password? Click here to reset