Learning on the Edge: Online Learning with Stochastic Feedback Graphs

10/09/2022
by   Emmanuel Esposito, et al.
0

The framework of feedback graphs is a generalization of sequential decision-making with bandit or full information feedback. In this work, we study an extension where the directed feedback graph is stochastic, following a distribution similar to the classical Erdős-Rényi model. Specifically, in each round every edge in the graph is either realized or not with a distinct probability for each edge. We prove nearly optimal regret bounds of order min{min_ε√((α_ε/ε) T), min_ε (δ_ε/ε)^1/3 T^2/3} (ignoring logarithmic factors), where α_ε and δ_ε are graph-theoretic quantities measured on the support of the stochastic feedback graph 𝒢 with edge probabilities thresholded at ε. Our result, which holds without any preliminary knowledge about 𝒢, requires the learner to observe only the realized out-neighborhood of the chosen action. When the learner is allowed to observe the realization of the entire graph (but only the losses in the out-neighborhood of the chosen action), we derive a more efficient algorithm featuring a dependence on weighted versions of the independence and weak domination numbers that exhibits improved bounds for some special cases.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2016

Online Learning with Feedback Graphs Without the Graphs

We study an online learning framework introduced by Mannor and Shamir (2...
research
06/01/2022

A Near-Optimal Best-of-Both-Worlds Algorithm for Online Learning with Feedback Graphs

We consider online learning with feedback graphs, a sequential decision-...
research
05/23/2018

Analysis of Thompson Sampling for Graphical Bandits Without the Graphs

We study multi-armed bandit problems with graph feedback, in which the d...
research
06/07/2021

Beyond Bandit Feedback in Online Multiclass Classification

We study the problem of online multiclass classification in a setting wh...
research
06/15/2021

Online Learning with Uncertain Feedback Graphs

Online learning with expert advice is widely used in various machine lea...
research
03/04/2019

Stochastic Online Learning with Probabilistic Graph Feedback

We consider a problem of stochastic online learning with general probabi...
research
09/03/2022

Sharp bounds on the price of bandit feedback for several models of mistake-bounded online learning

We determine sharp bounds on the price of bandit feedback for several va...

Please sign up or login with your details

Forgot password? Click here to reset