A Unified Framework for Implicit Sinkhorn Differentiation

05/13/2022
by   Marvin Eisenberger, et al.
3

The Sinkhorn operator has recently experienced a surge of popularity in computer vision and related fields. One major reason is its ease of integration into deep learning frameworks. To allow for an efficient training of respective neural networks, we propose an algorithm that obtains analytical gradients of a Sinkhorn layer via implicit differentiation. In comparison to prior work, our framework is based on the most general formulation of the Sinkhorn operator. It allows for any type of loss function, while both the target capacities and cost matrices are differentiated jointly. We further construct error bounds of the resulting algorithm for approximate inputs. Finally, we demonstrate that for a number of applications, simply replacing automatic differentiation with our algorithm directly improves the stability and accuracy of the obtained gradients. Moreover, we show that it is computationally more efficient, particularly when resources like GPU memory are scarce.

READ FULL TEXT

page 6

page 7

page 8

page 13

page 14

page 15

research
05/31/2021

Efficient and Modular Implicit Differentiation

Automatic differentiation (autodiff) has revolutionized machine learning...
research
05/23/2023

One-step differentiation of iterative algorithms

In appropriate frameworks, automatic differentiation is transparent to t...
research
12/28/2021

Efficient Automatic Differentiation of Implicit Functions

Derivative-based algorithms are ubiquitous in statistics, machine learni...
research
09/21/2022

Improved Marginal Unbiased Score Expansion (MUSE) via Implicit Differentiation

We apply the technique of implicit differentiation to boost performance,...
research
11/23/2021

Multiset-Equivariant Set Prediction with Approximate Implicit Differentiation

Most set prediction models in deep learning use set-equivariant operatio...
research
05/19/2022

Learning Energy Networks with Generalized Fenchel-Young Losses

Energy-based models, a.k.a. energy networks, perform inference by optimi...
research
11/19/2022

Fully implicit frictional dynamics with soft constraints

Dynamics simulation with frictional contacts is important for a wide ran...

Please sign up or login with your details

Forgot password? Click here to reset