A New Unbiased and Efficient Class of LSH-Based Samplers and Estimators for Partition Function Computation in Log-Linear Models

03/15/2017
by   Ryan Spring, et al.
0

Log-linear models are arguably the most successful class of graphical models for large-scale applications because of their simplicity and tractability. Learning and inference with these models require calculating the partition function, which is a major bottleneck and intractable for large state spaces. Importance Sampling (IS) and MCMC-based approaches are lucrative. However, the condition of having a "good" proposal distribution is often not satisfied in practice. In this paper, we add a new dimension to efficient estimation via sampling. We propose a new sampling scheme and an unbiased estimator that estimates the partition function accurately in sub-linear time. Our samples are generated in near-constant time using locality sensitive hashing (LSH), and so are correlated and unnormalized. We demonstrate the effectiveness of our proposed approach by comparing the accuracy and speed of estimating the partition function against other state-of-the-art estimation techniques including IS and the efficient variant of Gumbel-Max sampling. With our efficient sampling scheme, we accurately train real-world language models using only 1-2 computations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/07/2017

Neural Variational Inference and Learning in Undirected Graphical Models

Many problems in machine learning are naturally expressed in the languag...
research
07/23/2020

Efficient Evaluation of the Partition Function of RBMs with Annealed Importance Sampling

Probabilistic models based on Restricted Boltzmann Machines (RBMs) imply...
research
10/30/2019

Lsh-sampling Breaks the Computation Chicken-and-egg Loop in Adaptive Stochastic Gradient Estimation

Stochastic Gradient Descent or SGD is the most popular optimization algo...
research
10/15/2016

Markov Chain Truncation for Doubly-Intractable Inference

Computing partition functions, the normalizing constants of probability ...
research
03/07/2016

Partition Functions from Rao-Blackwellized Tempered Sampling

Partition functions of probability distributions are important quantitie...
research
04/08/2020

Incidence weighting estimation under bipartite incidence graph sampling

Bipartite incidence graph sampling provides a unified representation of ...
research
08/18/2019

Revisiting the balance heuristic for estimating normalising constants

Multiple importance sampling estimators are widely used for computing in...

Please sign up or login with your details

Forgot password? Click here to reset