Learning Generative Models with Sinkhorn Divergences

06/01/2017
by   Aude Genevay, et al.
0

The ability to compare two degenerate probability distributions (i.e. two probability distributions supported on two distinct low-dimensional manifolds living in a much higher-dimensional space) is a crucial problem arising in the estimation of generative models for high-dimensional observations such as those arising in computer vision or natural language. It is known that optimal transport metrics can represent a cure for this problem, since they were specifically designed as an alternative to information divergences to handle such problematic scenarios. Unfortunately, training generative machines using OT raises formidable computational and statistical challenges, because of (i) the computational burden of evaluating OT losses, (ii) the instability and lack of smoothness of these losses, (iii) the difficulty to estimate robustly these losses and their gradients in high dimension. This paper presents the first tractable computational method to train large scale generative models using an optimal transport loss, and tackles these three issues by relying on two key ideas: (a) entropic smoothing, which turns the original OT loss into one that can be computed using Sinkhorn fixed point iterations; (b) algorithmic (automatic) differentiation of these iterations. These two approximations result in a robust and differentiable approximation of the OT loss with streamlined GPU execution. Entropic smoothing generates a family of losses interpolating between Wasserstein (OT) and Maximum Mean Discrepancy (MMD), thus allowing to find a sweet spot leveraging the geometry of OT and the favorable high-dimensional sample complexity of MMD which comes with unbiased gradient estimates. The resulting computational architecture complements nicely standard deep network generative models by a stack of extra layers implementing the loss function.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/16/2018

Entropy-regularized Optimal Transport Generative Models

We investigate the use of entropy-regularized optimal transport (EOT) co...
research
08/26/2023

SyMOT-Flow: Learning optimal transport flow for two arbitrary distributions with maximum mean discrepancy

Finding a transformation between two unknown probability distributions f...
research
11/08/2021

Efficient estimates of optimal transport via low-dimensional embeddings

Optimal transport distances (OT) have been widely used in recent work in...
research
11/16/2022

Unbalanced Optimal Transport, from Theory to Numerics

Optimal Transport (OT) has recently emerged as a central tool in data sc...
research
06/11/2023

On Kinetic Optimal Probability Paths for Generative Models

Recent successful generative models are trained by fitting a neural netw...
research
06/15/2022

Rethinking Initialization of the Sinkhorn Algorithm

Computing an optimal transport (OT) coupling between distributions plays...
research
10/29/2021

Learning generative models for valid knockoffs using novel multivariate-rank based statistics

We consider the problem of generating valid knockoffs for knockoff filte...

Please sign up or login with your details

Forgot password? Click here to reset