Hypermodels for Exploration

06/12/2020
by   Vikranth Dwaracherla, et al.
0

We study the use of hypermodels to represent epistemic uncertainty and guide exploration. This generalizes and extends the use of ensembles to approximate Thompson sampling. The computational cost of training an ensemble grows with its size, and as such, prior work has typically been limited to ensembles with tens of elements. We show that alternative hypermodels can enjoy dramatic efficiency gains, enabling behavior that would otherwise require hundreds or thousands of elements, and even succeed in situations where ensemble methods fail to learn regardless of size. This allows more accurate approximation of Thompson sampling as well as use of more sophisticated exploration schemes. In particular, we consider an approximate form of information-directed sampling and demonstrate performance gains relative to Thompson sampling. As alternatives to ensembles, we consider linear and neural network hypermodels, also known as hypernetworks. We prove that, with neural network base models, a linear hypermodel can represent essentially any distribution over functions, and as such, hypernetworks are no more expressive.

READ FULL TEXT
research
04/30/2019

Ensemble Distribution Distillation

Ensemble of Neural Network (NN) models are known to yield improvements i...
research
06/05/2017

UCB Exploration via Q-Ensembles

We show how an ensemble of Q^*-functions can be leveraged for more effec...
research
09/12/2023

Epistemic Modeling Uncertainty of Rapid Neural Network Ensembles for Adaptive Learning

Emulator embedded neural networks, which are a type of physics informed ...
research
12/30/2021

SAE: Sequential Anchored Ensembles

Computing the Bayesian posterior of a neural network is a challenging ta...
research
09/20/2023

You can have your ensemble and run it too – Deep Ensembles Spread Over Time

Ensembles of independently trained deep neural networks yield uncertaint...
research
06/12/2023

Diverse Projection Ensembles for Distributional Reinforcement Learning

In contrast to classical reinforcement learning, distributional reinforc...
research
10/19/2017

A Space-Efficient Method for Navigable Ensemble Analysis and Visualization

Scientists increasingly rely on simulation runs of complex models in lie...

Please sign up or login with your details

Forgot password? Click here to reset