
ModelBased Reinforcement Learning via MetaPolicy Optimization
Modelbased reinforcement learning approaches carry the promise of being...
read it

Quantifying Generalization in Reinforcement Learning
In this paper, we investigate the problem of overfitting in deep reinfor...
read it

Policy Gradient Search: Online Planning and Expert Iteration without Search Trees
Monte Carlo Tree Search (MCTS) algorithms perform simulationbased searc...
read it

Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations
Dexterous multifingered hands are extremely versatile and provide a gen...
read it

TeacherStudent Curriculum Learning
We propose TeacherStudent Curriculum Learning (TSCL), a framework for a...
read it

UCB Exploration via QEnsembles
We show how an ensemble of Q^*functions can be leveraged for more effec...
read it

#Exploration: A Study of CountBased Exploration for Deep Reinforcement Learning
Countbased exploration algorithms are known to perform nearoptimally w...
read it

Concrete Problems in AI Safety
Rapid progress in machine learning and artificial intelligence (AI) has ...
read it

OpenAI Gym
OpenAI Gym is a toolkit for reinforcement learning research. It includes...
read it

RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning
Deep reinforcement learning (deep RL) has been successful in learning so...
read it

Variational Lossy Autoencoder
Representation learning seeks to expose certain aspects of observed data...
read it

InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets
This paper describes InfoGAN, an informationtheoretic extension to the ...
read it

VIME: Variational Information Maximizing Exploration
Scalable and effective exploration remains a key challenge in reinforcem...
read it

Reptile: a Scalable Metalearning Algorithm
This paper considers metalearning problems, where there is a distributio...
read it

Gotta Learn Fast: A New Benchmark for Generalization in RL
In this report, we present a new reinforcement learning (RL) benchmark b...
read it

Theano: A Python framework for fast computation of mathematical expressions
Theano is a Python library that allows to define, optimize, and evaluate...
read it

SemiSupervised Learning by Label Gradient Alignment
We present label gradient alignment, a novel algorithm for semisupervis...
read it

Leveraging Procedural Generation to Benchmark Reinforcement Learning
In this report, we introduce Procgen Benchmark, a suite of 16 procedural...
read it