Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks

11/20/2019
by   Vibhavari Dasagi, et al.
8

Deep reinforcement learning has been shown to solve challenging tasks where large amounts of training experience is available, usually obtained online while learning the task. Robotics is a significant potential application domain for many of these algorithms, but generating robot experience in the real world is expensive, especially when each task requires a lengthy online training procedure. Off-policy algorithms can in principle learn arbitrary tasks from a diverse enough fixed dataset. In this work, we evaluate popular exploration methods by generating robotics datasets for the purpose of learning to solve tasks completely offline without any further interaction in the real world. We present results on three popular continuous control tasks in simulation, as well as continuous control of a high-dimensional real robot arm. Code documenting all algorithms, experiments, and hyper-parameters is available at https://github.com/qutrobotlearning/batchlearning.

READ FULL TEXT
research
05/12/2020

Generalized State-Dependent Exploration for Deep Reinforcement Learning in Robotics

Reinforcement learning (RL) enables robots to learn skills from interact...
research
03/21/2022

Quad2Plane: An Intermediate Training Procedure for Online Exploration in Aerial Robotics via Receding Horizon Control

Data driven robotics relies upon accurate real-world representations to ...
research
03/03/2023

Hindsight States: Blending Sim and Real Task Elements for Efficient Reinforcement Learning

Reinforcement learning has shown great potential in solving complex task...
research
11/14/2020

PLAS: Latent Action Space for Offline Reinforcement Learning

The goal of offline reinforcement learning is to learn a policy from a f...
research
03/24/2023

Multi-Task Reinforcement Learning in Continuous Control with Successor Feature-Based Concurrent Composition

Deep reinforcement learning (DRL) frameworks are increasingly used to so...
research
04/19/2023

Torque-based Deep Reinforcement Learning for Task-and-Robot Agnostic Learning on Bipedal Robots Using Sim-to-Real Transfer

In this paper, we review the question of which action space is best suit...
research
06/22/2020

dm_control: Software and Tasks for Continuous Control

The dm_control software package is a collection of Python libraries and ...

Please sign up or login with your details

Forgot password? Click here to reset