Linear Time Sinkhorn Divergences using Positive Features

06/12/2020
by   Meyer Scetbon, et al.
0

Although Sinkhorn divergences are now routinely used in data sciences to compare probability distributions, the computational effort required to compute them remains expensive, growing in general quadratically in the size n of the support of these distributions. Indeed, solving optimal transport (OT) with an entropic regularization requires computing a n× n kernel matrix (the neg-exponential of a n× n pairwise ground cost matrix) that is repeatedly applied to a vector. We propose to use instead ground costs of the form c(x,y)=-logφ(x)φ(y) where φ is a map from the ground space onto the positive orthant ^r_+, with r≪ n. This choice yields, equivalently, a kernel k(x,y)=φ(x)φ(y), and ensures that the cost of Sinkhorn iterations scales as O(nr). We show that usual cost functions can be approximated using this form. Additionaly, we take advantage of the fact that our approach yields approximation that remain fully differentiable with respect to input distributions, as opposed to previously proposed adaptive low-rank approximations of the kernel matrix, to train a faster variant of OT-GAN <cit.>.

READ FULL TEXT
research
06/07/2021

Scalable Computation of Monge Maps with General Costs

Monge map refers to the optimal transport map between two probability di...
research
03/08/2021

Low-Rank Sinkhorn Factorization

Several recent applications of optimal transport (OT) theory to machine ...
research
06/01/2017

Krylov Subspace Recycling for Fast Iterative Least-Squares in Machine Learning

Solving symmetric positive definite linear problems is a fundamental com...
research
07/14/2021

Scalable Optimal Transport in High Dimensions for Graph Distances, Embedding Alignment, and More

The current best practice for computing optimal transport (OT) is via en...
research
05/24/2022

Low-rank Optimal Transport: Approximation, Statistics and Debiasing

The matching principles behind optimal transport (OT) play an increasing...
research
02/08/2022

Linear Time Kernel Matrix Approximation via Hyperspherical Harmonics

We propose a new technique for constructing low-rank approximations of m...
research
06/20/2023

Learning Costs for Structured Monge Displacements

Optimal transport theory has provided machine learning with several tool...

Please sign up or login with your details

Forgot password? Click here to reset