An Empirical Evaluation of Federated Contextual Bandit Algorithms

03/17/2023
by   Alekh Agarwal, et al.
0

As the adoption of federated learning increases for learning from sensitive data local to user devices, it is natural to ask if the learning can be done using implicit signals generated as users interact with the applications of interest, rather than requiring access to explicit labels which can be difficult to acquire in many tasks. We approach such problems with the framework of federated contextual bandits, and develop variants of prominent contextual bandit algorithms from the centralized seting for the federated setting. We carefully evaluate these algorithms in a range of scenarios simulated using publicly available datasets. Our simulations model typical setups encountered in the real-world, such as various misalignments between an initial pre-trained model and the subsequent user interactions due to non-stationarity in the data and/or heterogeneity across clients. Our experiments reveal the surprising effectiveness of the simple and commonly used softmax heuristic in balancing the well-know exploration-exploitation tradeoff across the breadth of our settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/04/2021

Asynchronous Upper Confidence Bound Algorithms for Federated Linear Bandits

Linear contextual bandit is a popular online learning problem. It has be...
research
10/14/2022

Where to Begin? On the Impact of Pre-Training and Initialization in Federated Learning

An oft-cited challenge of federated learning is the presence of heteroge...
research
06/30/2022

Where to Begin? Exploring the Impact of Pre-Training and Initialization in Federated Learning

An oft-cited challenge of federated learning is the presence of data het...
research
10/20/2022

Vertical Federated Linear Contextual Bandits

In this paper, we investigate a novel problem of building contextual ban...
research
10/12/2022

Simulated Contextual Bandits for Personalization Tasks from Recommendation Datasets

We propose a method for generating simulated contextual bandit environme...
research
09/21/2023

Incentivized Communication for Federated Bandits

Most existing works on federated bandits take it for granted that all cl...
research
09/06/2023

Federated Learning Over Images: Vertical Decompositions and Pre-Trained Backbones Are Difficult to Beat

We carefully evaluate a number of algorithms for learning in a federated...

Please sign up or login with your details

Forgot password? Click here to reset