A Short Note on the Relationship of Information Gain and Eluder Dimension

07/06/2021
by   Kaixuan Huang, et al.
5

Eluder dimension and information gain are two widely used methods of complexity measures in bandit and reinforcement learning. Eluder dimension was originally proposed as a general complexity measure of function classes, but the common examples of where it is known to be small are function spaces (vector spaces). In these cases, the primary tool to upper bound the eluder dimension is the elliptic potential lemma. Interestingly, the elliptic potential lemma also features prominently in the analysis of linear bandits/reinforcement learning and their nonparametric generalization, the information gain. We show that this is not a coincidence – eluder dimension and information gain are equivalent in a precise sense for reproducing kernel Hilbert spaces.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/18/2020

Provably adaptive reinforcement learning in metric spaces

We study reinforcement learning in continuous state and action spaces en...
research
04/15/2019

A Linear Upper Bound on the Weisfeiler-Leman Dimension of Graphs of Bounded Genus

The Weisfeiler-Leman (WL) dimension of a graph is a measure for the inhe...
research
01/08/2020

On Thompson Sampling for Smoother-than-Lipschitz Bandits

Thompson Sampling is a well established approach to bandit and reinforce...
research
08/31/2023

Training Neural Networks Using Reproducing Kernel Space Interpolation and Model Reduction

We introduce and study the theory of training neural networks using inte...
research
01/30/2019

Private Q-Learning with Functional Noise in Continuous Spaces

We consider privacy-preserving algorithms for deep reinforcement learnin...
research
03/01/2022

On the Generalization of Representations in Reinforcement Learning

In reinforcement learning, state representations are used to tractably d...
research
05/19/2023

From Random Search to Bandit Learning in Metric Measure Spaces

Random Search is one of the most widely-used method for Hyperparameter O...

Please sign up or login with your details

Forgot password? Click here to reset