Optimal Approximate Sampling from Discrete Probability Distributions

01/13/2020
by   Feras A. Saad, et al.
0

This paper addresses a fundamental problem in random variate generation: given access to a random source that emits a stream of independent fair bits, what is the most accurate and entropy-efficient algorithm for sampling from a discrete probability distribution (p_1, ..., p_n), where the probabilities of the output distribution (p̂_1, ..., p̂_n) of the sampling algorithm must be specified using at most k bits of precision? We present a theoretical framework for formulating this problem and provide new techniques for finding sampling algorithms that are optimal both statistically (in the sense of sampling accuracy) and information-theoretically (in the sense of entropy consumption). We leverage these results to build a system that, for a broad family of measures of statistical accuracy, delivers a sampling algorithm whose expected entropy usage is minimal among those that induce the same distribution (i.e., is "entropy-optimal") and whose output distribution (p̂_1, ..., p̂_n) is a closest approximation to the target distribution (p_1, ..., p_n) among all entropy-optimal sampling algorithms that operate within the specified k-bit precision. This optimal approximate sampler is also a closer approximation than any (possibly entropy-suboptimal) sampler that consumes a bounded amount of entropy with the specified precision, a class which includes floating-point implementations of inversion sampling and related methods found in many software libraries. We evaluate the accuracy, entropy consumption, precision requirements, and wall-clock runtime of our optimal approximate sampling algorithms on a broad set of distributions, demonstrating the ways that they are superior to existing approximate samplers and establishing that they often consume significantly fewer resources than are needed by exact samplers.

READ FULL TEXT

page 1

page 24

page 32

research
03/08/2020

The Fast Loaded Dice Roller: A Near-Optimal Exact Sampler for Discrete Probability Distributions

This paper introduces a new algorithm for the fundamental problem of gen...
research
01/19/2019

Minimum--Entropy Couplings and their Applications

Given two discrete random variables X and Y, with probability distributi...
research
05/20/2023

Low-Entropy Latent Variables Hurt Out-of-Distribution Performance

We study the relationship between the entropy of intermediate representa...
research
04/22/2018

Maximum entropy priors with derived parameters in a specified distribution

We propose a method for transforming probability distributions so that p...
research
01/30/2022

Fast Relative Entropy Coding with A* coding

Relative entropy coding (REC) algorithms encode a sample from a target d...
research
02/23/2023

Minimum-Entropy Coupling Approximation Guarantees Beyond the Majorization Barrier

Given a set of discrete probability distributions, the minimum entropy c...
research
03/10/2022

A Tighter Approximation Guarantee for Greedy Minimum Entropy Coupling

We examine the minimum entropy coupling problem, where one must find the...

Please sign up or login with your details

Forgot password? Click here to reset