Federated Ensemble-Directed Offline Reinforcement Learning

05/04/2023
by   Desik Rengarajan, et al.
0

We consider the problem of federated offline reinforcement learning (RL), a scenario under which distributed learning agents must collaboratively learn a high-quality control policy only using small pre-collected datasets generated according to different unknown behavior policies. Naively combining a standard offline RL approach with a standard federated learning approach to solve this problem can lead to poorly performing policies. In response, we develop the Federated Ensemble-Directed Offline Reinforcement Learning Algorithm (FEDORA), which distills the collective wisdom of the clients using an ensemble learning approach. We develop the FEDORA codebase to utilize distributed compute resources on a federated learning platform. We show that FEDORA significantly outperforms other approaches, including offline RL over the combined data pool, in various complex continuous control environments and real world datasets. Finally, we demonstrate the performance of FEDORA in the real-world on a mobile robot.

READ FULL TEXT
research
11/08/2021

Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning

In real world, affecting the environment by a weak policy can be expensi...
research
09/12/2021

Federated Ensemble Model-based Reinforcement Learning

Federated learning (FL) is a privacy-preserving machine learning paradig...
research
01/26/2023

FedHQL: Federated Heterogeneous Q-Learning

Federated Reinforcement Learning (FedRL) encourages distributed agents t...
research
07/28/2023

Benchmarking Offline Reinforcement Learning on Real-Robot Hardware

Learning policies from previously recorded data is a promising direction...
research
10/13/2022

Personalized Federated Hypernetworks for Privacy Preservation in Multi-Task Reinforcement Learning

Multi-Agent Reinforcement Learning currently focuses on implementations ...
research
06/11/2022

Federated Offline Reinforcement Learning

Evidence-based or data-driven dynamic treatment regimes are essential fo...
research
01/24/2019

Federated Reinforcement Learning

In reinforcement learning, building policies of high-quality is challeng...

Please sign up or login with your details

Forgot password? Click here to reset