Frank-Wolfe Stein Sampling

05/21/2018
by   Futoshi Futami, et al.
0

In Bayesian inference, the posterior distributions are difficult to obtain analytically for complex models such as neural networks. Variational inference usually uses a parametric distribution to approximate, from which we can easily draw samples. Recently discrete approximation by particles has attracted attention because of its expressive ability. An example is Stein variational gradient descent (SVGD), which iteratively optimizes particles. Although SVGD has been shown to be computationally efficient empirically, its theoretical properties have not been clarified yet and no finite sample bound of a convergence rate is known. Another example is Stein points (SP), which minimizes kernelized Stein discrepancy directly. The finite sample bound of SP is O(√(N/N)) for N particles, which is computationally inefficient empirically, especially in high-dimensional problems. In this paper, we propose a novel method named Frank-Wolfe Stein sampling, which minimizes the maximum mean discrepancy in a greedy way. Our method is computationally efficient empirically and theoretically achieves a faster convergence rate, O(e^-N). Numerical experiments show the superiority of our method.

READ FULL TEXT
research
05/27/2023

Provably Fast Finite Particle Variants of SVGD via Virtual Particle Stochastic Approximation

Stein Variational Gradient Descent (SVGD) is a popular variational infer...
research
01/24/2021

Annealed Stein Variational Gradient Descent

Particle based optimization algorithms have recently been developed as s...
research
11/17/2022

A Finite-Particle Convergence Rate for Stein Variational Gradient Descent

We provide a first finite-particle convergence rate for Stein variationa...
research
04/09/2019

Kernelized Complete Conditional Stein Discrepancy

Much of machine learning relies on comparing distributions with discrepa...
research
05/18/2023

Augmented Message Passing Stein Variational Gradient Descent

Stein Variational Gradient Descent (SVGD) is a popular particle-based me...
research
06/21/2021

Local convexity of the TAP free energy and AMP convergence for Z2-synchronization

We study mean-field variational Bayesian inference using the TAP approac...
research
06/27/2023

Coupling parameter and particle dynamics for adaptive sampling in Neural Galerkin schemes

Training nonlinear parametrizations such as deep neural networks to nume...

Please sign up or login with your details

Forgot password? Click here to reset