b'Tom Le Paine'

research

∙ 08/17/2023

Reinforced Self-Training (ReST) for Language Modeling

Reinforcement learning from human feedback (RLHF) can improve the qualit...

0 Caglar Gulcehre, et al. ∙

research

∙ 08/07/2023

AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning

StarCraft II is one of the most challenging simulated reinforcement lear...

0 Michael Mathieu, et al. ∙

research

∙ 06/16/2023

π2vec: Policy Representations with Successor Features

This paper describes π2vec, a method for representing behaviors of black...

0 Gianluca Scarpellini, et al. ∙

research

∙ 05/21/2021

On Instrumental Variable Regression for Deep Offline Policy Evaluation

We show that the popular reinforcement learning (RL) strategy of estimat...

26 Yutian Chen, et al. ∙

research

∙ 04/28/2021

Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization

Standard dynamics models for continuous control make use of feedforward ...

5 Michael R. Zhang, et al. ∙

research

∙ 03/30/2021

Benchmarks for Deep Off-Policy Evaluation

Off-policy evaluation (OPE) holds the promise of being able to leverage ...

13 Justin Fu, et al. ∙

research

∙ 07/17/2020

Hyperparameter Selection for Offline Reinforcement Learning

Offline reinforcement learning (RL purely from logged data) is an import...

30 Tom Le Paine, et al. ∙

research

∙ 06/24/2020

RL Unplugged: Benchmarks for Offline Reinforcement Learning

Offline methods for reinforcement learning have the potential to help br...

10 Caglar Gulcehre, et al. ∙

research

∙ 06/01/2020

Acme: A Research Framework for Distributed Reinforcement Learning

Deep reinforcement learning has led to many recent-and groundbreaking-ad...

22 Matt Hoffman, et al. ∙

research

∙ 10/22/2019

Improving the Gating Mechanism of Recurrent Neural Networks

Gating mechanisms are widely used in neural network models, where they a...

30 Albert Gu, et al. ∙

research

∙ 09/03/2019

Making Efficient Use of Demonstrations to Solve Hard Exploration Problems

This paper introduces R2D3, an agent that makes efficient use of demonst...

10 Tom Le Paine, et al. ∙

research

∙ 10/11/2018

One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL

Humans are experts at high-fidelity imitation -- closely mimicking a dem...

4 Tom Le Paine, et al. ∙

research

∙ 05/29/2018

Playing hard exploration games by watching YouTube

Deep reinforcement learning methods traditionally struggle with tasks wh...

2 Yusuf Aytar, et al. ∙

research

∙ 04/20/2017

Fast Generation for Convolutional Autoregressive Models

Convolutional autoregressive models have recently demonstrated state-of-...

0 Prajit Ramachandran, et al. ∙

research

∙ 02/26/2016

Seq-NMS for Video Object Detection

Video object detection is challenging because objects that are easily de...

0 Wei Han, et al. ∙

research

∙ 02/24/2016

How Deep Neural Networks Can Improve Emotion Recognition on Video Data

We consider the task of dimensional emotion recognition on video data us...

0 Pooya Khorrami, et al. ∙

research

∙ 10/10/2015

Do Deep Neural Networks Learn Facial Action Units When Doing Expression Recognition?

Despite being the appearance-based classifier of choice in recent years,...

0 Pooya Khorrami, et al. ∙

research

∙ 12/20/2014

An Analysis of Unsupervised Pre-training in Light of Recent Advances

Convolutional neural networks perform well on object recognition because...

0 Tom Le Paine, et al. ∙

Tom Le Paine

Featured Co-authors

Sign in with Google

Consider DeepAI Pro