Thompson Sampling for Factored Multi-Agent Bandits

11/22/2019
by   Timothy Verstraeten, et al.
0

Multi-agent coordination is prevalent in many real-world applications. However, such coordination is challenging due to its combinatorial nature. An important observation in this regard is that agents in the real world often only directly affect a limited set of neighboring agents. Leveraging such loose couplings among agents is key to making coordination in multi-agent systems feasible. In this work, we focus on learning to coordinate. Specifically, we consider the multi-agent multi-armed bandit framework, in which fully cooperative loosely-coupled agents must learn to coordinate their decisions to optimize a common objective. As opposed to in the planning setting, for learning methods it is challenging to establish theoretical guarantees. We propose multi-agent Thompson sampling (MATS), a new Bayesian exploration-exploitation algorithm that leverages loose couplings. We provide a regret bound that is sublinear in time and low-order polynomial in the highest number of actions of a single agent for sparse coordination graphs. Finally, we empirically show that MATS outperforms the state-of-the-art algorithm, MAUCE, on two synthetic benchmarks, a realistic wind farm control task, and a novel benchmark with Poisson distributions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/22/2019

Multi-Agent Thompson Sampling for Bandit Applications with Sparse Neighbourhood Structures

Multi-agent coordination is prevalent in many real-world applications. H...
research
01/01/2014

Design of a GIS-based Assistant Software Agent for the Incident Commander to Coordinate Emergency Response Operations

Problem: This paper addresses the design of an intelligent software syst...
research
09/18/2023

MindAgent: Emergent Gaming Interaction

Large Language Models (LLMs) have the capacity of performing complex sch...
research
05/26/2023

A Model-Based Solution to the Offline Multi-Agent Reinforcement Learning Coordination Problem

Training multiple agents to coordinate is an important problem with appl...
research
10/04/2022

Meta Navigation Functions: Adaptive Associations for Coordination of Multi-Agent Systems

In this paper, we introduce a new class of potential fields, i.e., meta ...
research
05/25/2020

Non-cooperative Multi-agent Systems with Exploring Agents

Multi-agent learning is a challenging problem in machine learning that h...
research
04/12/2018

Automatic Generation of Communication Requirements for Enforcing Multi-Agent Safety

Distributed controllers are often necessary for a multi-agent system to ...

Please sign up or login with your details

Forgot password? Click here to reset