An Efficient Algorithm for Fair Multi-Agent Multi-Armed Bandit with Low Regret

09/23/2022
by   Matthew Jones, et al.
0

Recently a multi-agent variant of the classical multi-armed bandit was proposed to tackle fairness issues in online learning. Inspired by a long line of work in social choice and economics, the goal is to optimize the Nash social welfare instead of the total utility. Unfortunately previous algorithms either are not efficient or achieve sub-optimal regret in terms of the number of rounds T. We propose a new efficient algorithm with lower regret than even previous inefficient ones. For N agents, K arms, and T rounds, our approach has a regret bound of Õ(√(NKT) + NK). This is an improvement to the previous approach, which has regret bound of Õ( min(NK, √(N) K^3/2)√(T)). We also complement our efficient algorithm with an inefficient approach with Õ(√(KT) + N^2K) regret. The experimental findings confirm the effectiveness of our efficient algorithm compared to the previous approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/13/2020

Fair Algorithms for Multi-Agent Multi-Armed Bandits

We propose a multi-agent variant of the classical multi-armed bandit pro...
research
05/27/2022

Fairness and Welfare Quantification for Regret in Multi-Armed Bandits

We extend the notion of regret with a welfarist perspective. Focussing o...
research
11/30/2022

On Regret-optimal Cooperative Nonstochastic Multi-armed Bandits

We consider the nonstochastic multi-agent multi-armed bandit problem wit...
research
07/23/2019

Achieving Fairness in the Stochastic Multi-armed Bandit Problem

We study an interesting variant of the stochastic multi-armed bandit pro...
research
11/10/2021

Multi-Agent Learning for Iterative Dominance Elimination: Formal Barriers and New Algorithms

Dominated actions are natural (and perhaps the simplest possible) multi-...
research
02/07/2023

Leveraging Demonstrations to Improve Online Learning: Quality Matters

We investigate the extent to which offline demonstration data can improv...
research
11/05/2019

Response Prediction for Low-Regret Agents

Companies like Google and Microsoft run billions of auctions every day t...

Please sign up or login with your details

Forgot password? Click here to reset