Regenerative Particle Thompson Sampling

03/15/2022
by   Zeyu Zhou, et al.
0

This paper proposes regenerative particle Thompson sampling (RPTS), a flexible variation of Thompson sampling. Thompson sampling itself is a Bayesian heuristic for solving stochastic bandit problems, but it is hard to implement in practice due to the intractability of maintaining a continuous posterior distribution. Particle Thompson sampling (PTS) is an approximation of Thompson sampling obtained by simply replacing the continuous distribution by a discrete distribution supported at a set of weighted static particles. We observe that in PTS, the weights of all but a few fit particles converge to zero. RPTS is based on the heuristic: delete the decaying unfit particles and regenerate new particles in the vicinity of fit surviving particles. Empirical evidence shows uniform improvement from PTS to RPTS and flexibility and efficacy of RPTS across a set of representative bandit problems, including an application to 5G network slicing.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/15/2014

Thompson sampling with the online bootstrap

Thompson sampling provides a solution to bandit problems in which new ob...
research
09/05/2018

Stochastic Particle-Optimization Sampling and the Non-Asymptotic Convergence Theory

Particle-optimization sampling (POS) is a recently developed technique t...
research
12/08/2022

The Lifebelt Particle Filter for robust estimation from low-valued count data

Particle filtering methods are well developed for continuous state-space...
research
05/04/2023

Stereological determination of particle size distributions for similar convex bodies

Consider an opaque medium which contains 3D particles. All particles are...
research
10/06/2021

Unrolling Particles: Unsupervised Learning of Sampling Distributions

Particle filtering is used to compute good nonlinear estimates of comple...
research
09/01/2020

A heuristic independent particle approximation to determinantal point processes

A determinantal point process is a stochastic point process that is comm...
research
03/04/2023

Progressive Bayesian Particle Flows based on Optimal Transport Map Sequences

We propose a method for optimal Bayesian filtering with deterministic pa...

Please sign up or login with your details

Forgot password? Click here to reset