Event-Based Communication in Distributed Q-Learning

09/03/2021
by   Daniel Jarne Ornia, et al.
0

We present an approach to reduce the communication of information needed on a Distributed Q-Learning system inspired by Event Triggered Control (ETC) techniques. We consider a baseline scenario of a distributed Q-learning problem on a Markov Decision Process (MDP). Following an event-based approach, N agents explore the MDP and communicate experiences to a central learner only when necessary, which performs updates of the actor Q functions. We design an Event Based distributed Q learning system (EBd-Q), and derive convergence guarantees with respect to a vanilla Q-learning algorithm. We present experimental results showing that event-based communication results in a substantial reduction of data transmission rates in such distributed systems. Additionally, we discuss what effects (desired and undesired) these event-based approaches have on the learning processes studied, and how they can be applied to more complex multi-agent systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/07/2022

Robust Event-Driven Interactions in Cooperative Multi-Agent Learning

We present an approach to reduce the communication required between agen...
research
02/22/2022

Event-Triggered Tracking Control of Networked Multi-Agent Systems

This paper studies the tracking control problem of networked multi-agent...
research
03/05/2018

Event-triggered Learning for Resource-efficient Networked Control

Common event-triggered state estimation (ETSE) algorithms save communica...
research
11/16/2022

Asynchronous Bayesian Learning over a Network

We present a practical asynchronous data fusion model for networked agen...
research
09/08/2019

Distributed Deep Learning with Event-Triggered Communication

We develop a Distributed Event-Triggered Stochastic GRAdient Descent (DE...
research
03/12/2019

Self-triggered distributed k-order coverage control

A k-order coverage control problem is studied where a network of agents ...
research
04/02/2023

On the trade-off between event-based and periodic state estimation under bandwidth constraints

Event-based methods carefully select when to transmit information to ena...

Please sign up or login with your details

Forgot password? Click here to reset