A Hoeffding Inequality for Finite State Markov Chains and its Applications to Markovian Bandits

01/05/2020
by   Vrettos Moulos, et al.
0

This paper develops a Hoeffding inequality for the partial sums ∑_k=1^n f (X_k), where {X_k}_k ∈Z_> 0 is an irreducible Markov chain on a finite state space S, and f : S → [a, b] is a real-valued function. Our bound is simple, general, since it only assumes irreducibility and finiteness of the state space, and powerful. In order to demonstrate its usefulness we provide two applications in multi-armed bandit problems. The first is about identifying an approximately best Markovian arm, while the second is concerned with regret minimization in the context of Markovian bandits.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/01/2018

Hoeffding's lemma for Markov Chains and its applications to statistical learning

We establish the counterpart of Hoeffding's lemma for Markov dependent r...
research
07/10/2019

Optimal Chernoff and Hoeffding Bounds for Finite Markov Chains

This paper develops an optimal Chernoff type bound for the probabilities...
research
01/05/2018

Nonparametric Stochastic Contextual Bandits

We analyze the K-armed bandit problem where the reward for each arm is a...
research
08/09/2021

Whittle Index for A Class of Restless Bandits with Imperfect Observations

We consider a class of restless bandit problems that finds a broad appli...
research
07/06/2023

PCL-Indexability and Whittle Index for Restless Bandits with General Observation Models

In this paper, we consider a general observation model for restless mult...
research
05/24/2017

Boundary Crossing Probabilities for General Exponential Families

We consider parametric exponential families of dimension K on the real l...
research
06/30/2020

Bounded Rationality in Las Vegas: Probabilistic Finite Automata PlayMulti-Armed Bandits

While traditional economics assumes that humans are fully rational agent...

Please sign up or login with your details

Forgot password? Click here to reset