Federated Reinforcement Learning: Linear Speedup Under Markovian Sampling

06/21/2022
by   Sajad Khodadadian, et al.
2

Since reinforcement learning algorithms are notoriously data-intensive, the task of sampling observations from the environment is usually split across multiple agents. However, transferring these observations from the agents to a central location can be prohibitively expensive in terms of the communication cost, and it can also compromise the privacy of each agent's local behavior policy. In this paper, we consider a federated reinforcement learning framework where multiple agents collaboratively learn a global model, without sharing their individual data and policies. Each agent maintains a local copy of the model and updates it using locally sampled data. Although having N agents enables the sampling of N times more data, it is not clear if it leads to proportional convergence speedup. We propose federated versions of on-policy TD, off-policy TD and Q-learning, and analyze their convergence. For all these algorithms, to the best of our knowledge, we are the first to consider Markovian noise and multiple local updates, and prove a linear convergence speedup with respect to the number of agents. To obtain these results, we show that federated TD and Q-learning are special cases of a general framework for federated stochastic approximation with Markovian noise, and we leverage this framework to provide a unified convergence analysis that applies to all the algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/14/2023

Federated TD Learning over Finite-Rate Erasure Channels: Linear Speedup under Markovian Sampling

Federated learning (FL) has recently gained much attention due to its ef...
research
01/24/2019

Federated Reinforcement Learning

In reinforcement learning, building policies of high-quality is challeng...
research
04/06/2022

Federated Reinforcement Learning with Environment Heterogeneity

We study a Federated Reinforcement Learning (FedRL) problem in which n a...
research
05/18/2023

The Blessing of Heterogeneity in Federated Q-learning: Linear Speedup and Beyond

When the data used for reinforcement learning (RL) are collected by mult...
research
11/03/2020

Federated LQR: Learning through Sharing

In many multi-agent reinforcement learning applications such as flocking...
research
05/27/2022

FedFormer: Contextual Federation with Attention in Reinforcement Learning

A core issue in federated reinforcement learning is defining how to aggr...
research
05/20/2018

Knowledge Aggregation via Epsilon Model Spaces

In many practical applications, machine learning is divided over multipl...

Please sign up or login with your details

Forgot password? Click here to reset