Rethinking Initialization of the Sinkhorn Algorithm

06/15/2022
by   James Thornton, et al.
26

Computing an optimal transport (OT) coupling between distributions plays an increasingly important role in machine learning. While OT problems can be solved as linear programs, adding an entropic smoothing term is known to result in solvers that are faster and more robust to outliers, differentiable and easier to parallelize. The Sinkhorn fixed point algorithm is the cornerstone of these approaches, and, as a result, multiple attempts have been made to shorten its runtime using, for instance, annealing, momentum or acceleration. The premise of this paper is that initialization of the Sinkhorn algorithm has received comparatively little attention, possibly due to two preconceptions: as the regularized OT problem is convex, it may not be worth crafting a tailored initialization as any is guaranteed to work; secondly, because the Sinkhorn algorithm is often differentiated in end-to-end pipelines, data-dependent initializations could potentially bias gradient estimates obtained by unrolling iterations. We challenge this conventional wisdom and show that carefully chosen initializations can result in dramatic speed-ups, and will not bias gradients which are computed with implicit differentiation. We detail how initializations can be recovered from closed-form or approximate OT solutions, using known results in the 1D or Gaussian settings. We show empirically that these initializations can be used off-the-shelf, with little to no tuning, and result in consistent speed-ups for a variety of OT problems.

READ FULL TEXT

page 13

page 17

research
02/19/2021

On the Implicit Bias of Initialization Shape: Beyond Infinitesimal Mirror Descent

Recent work has highlighted the role of initialization scale in determin...
research
01/28/2022

Optimal Transport Tools (OTT): A JAX Toolbox for all things Wasserstein

Optimal transport tools (OTT-JAX) is a Python toolbox that can solve opt...
research
05/24/2022

Low-rank Optimal Transport: Approximation, Statistics and Debiasing

The matching principles behind optimal transport (OT) play an increasing...
research
11/30/2022

Generative Adversarial Learning of Sinkhorn Algorithm Initializations

The Sinkhorn algorithm (arXiv:1306.0895) is the state-of-the-art to comp...
research
06/01/2017

Learning Generative Models with Sinkhorn Divergences

The ability to compare two degenerate probability distributions (i.e. tw...
research
12/26/2022

Robust computation of optimal transport by β-potential regularization

Optimal transport (OT) has become a widely used tool in the machine lear...
research
09/30/2022

Sparsity-Constrained Optimal Transport

Regularized optimal transport (OT) is now increasingly used as a loss or...

Please sign up or login with your details

Forgot password? Click here to reset