Pathwise Derivatives Beyond the Reparameterization Trick

06/05/2018
by   Martin Jankowiak, et al.
0

We observe that gradients computed via the reparameterization trick are in direct correspondence with solutions of the transport equation in the formalism of optimal transport. We use this perspective to compute (approximate) pathwise gradients for probability distributions not directly amenable to the reparameterization trick: Gamma, Beta, and Dirichlet. We further observe that when the reparameterization trick is applied to the Cholesky-factorized multivariate Normal distribution, the resulting gradients are suboptimal in the sense of optimal transport. We derive the optimal gradients and show that they have reduced variance in a Gaussian Process regression task. We demonstrate with a variety of synthetic experiments and stochastic variational inference tasks that our pathwise gradients are competitive with other methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/05/2018

Pathwise Derivatives for Multivariate Distributions

We exploit the link between the transport equation and derivatives of ex...
research
07/17/2023

Efficient and Accurate Optimal Transport with Mirror Descent and Conjugate Gradients

We design a novel algorithm for optimal transport by drawing from the en...
research
01/05/2021

Minibatch optimal transport distances; analysis and applications

Optimal transport distances have become a classic tool to compare probab...
research
06/26/2020

Computing Light Transport Gradients using the Adjoint Method

This paper proposes a new equation from continuous adjoint theory to com...
research
01/25/2023

Learning Gradients of Convex Functions with Monotone Gradient Networks

While much effort has been devoted to deriving and studying effective co...
research
12/28/2020

Comparing Probability Distributions with Conditional Transport

To measure the difference between two probability distributions, we prop...
research
05/22/2018

Implicit Reparameterization Gradients

By providing a simple and efficient way of computing low-variance gradie...

Please sign up or login with your details

Forgot password? Click here to reset