TensorFlow Agents: Efficient Batched Reinforcement Learning in TensorFlow

09/08/2017
by   Danijar Hafner, et al.
0

We introduce TensorFlow Agents, an efficient infrastructure paradigm for building parallel reinforcement learning algorithms in TensorFlow. We simulate multiple environments in parallel, and group them to perform the neural network computation on a batch rather than individual observations. This allows the TensorFlow execution engine to parallelize computation, without the need for manual synchronization. Environments are stepped in separate Python processes to progress them in parallel without interference of the global interpreter lock. As part of this project, we introduce BatchPPO, an efficient implementation of the proximal policy optimization algorithm. By open sourcing TensorFlow Agents, we hope to provide a flexible starting point for future projects that accelerates future research in the field.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/31/2018

APES: a Python toolbox for simulating reinforcement learning environments

Assisted by neural networks, reinforcement learning agents have been abl...
research
02/27/2019

TensorFlow Eager: A Multi-Stage, Python-Embedded DSL for Machine Learning

TensorFlow Eager is a multi-stage, Python-embedded domain-specific langu...
research
04/10/2020

Efficient Sampled Softmax for Tensorflow

This short paper discusses an efficient implementation of sampled softma...
research
03/12/2021

TensorGP – Genetic Programming Engine in TensorFlow

In this paper, we resort to the TensorFlow framework to investigate the ...
research
03/08/2019

Auto-Vectorizing TensorFlow Graphs: Jacobians, Auto-Batching And Beyond

We propose a static loop vectorization optimization on top of high level...
research
01/11/2021

From Tinkering to Engineering: Measurements in Tensorflow Playground

In this article, we present an extension of the Tensorflow Playground, c...
research
05/23/2022

Generalization, Mayhems and Limits in Recurrent Proximal Policy Optimization

At first sight it may seem straightforward to use recurrent layers in De...

Please sign up or login with your details

Forgot password? Click here to reset