
Dataefficient Hindsight Offpolicy Option Learning
Solutions to most complex tasks can be decomposed into simpler, intermed...
read it

Acme: A Research Framework for Distributed Reinforcement Learning
Deep reinforcement learning has led to many recentand groundbreakingad...
read it

A Distributional View on MultiObjective Policy Optimization
Many realworld problems require trading off multiple competing objectiv...
read it

Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning
Offpolicy reinforcement learning algorithms promise to be applicable in...
read it

ContinuousDiscrete Reinforcement Learning for Hybrid Control in Robotics
Many realworld control problems involve both discrete decision variable...
read it

Quinoa: a Qfunction You Infer Normalized Over Actions
We present an algorithm for learning an approximate actionvalue soft Q...
read it

Modelling Generalized Forces with Reinforcement Learning for SimtoReal Transfer
Learning robotic control policies in the real world gives rise to challe...
read it

Imagined Value Gradients: ModelBased Policy Optimization with Transferable Latent Dynamics Models
Humans are masters at quickly learning many complex tasks, relying on an...
read it

Augmenting learning using symmetry in a biologicallyinspired domain
Invariances to translation, rotation and other spatial transformations a...
read it

VMPO: OnPolicy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Some of the most successful applications of deep reinforcement learning ...
read it

Regularized Hierarchical Policies for Compositional Transfer in Robotics
The successful application of flexible, general learning algorithms  s...
read it

Robust Reinforcement Learning for Continuous Control with Model Misspecification
We provide a framework for incorporating robustness  to perturbations ...
read it

Simultaneously Learning Vision and Featurebased Control Policies for Realworld BallinaCup
We present a method for fast training of vision based control policies o...
read it

Value constrained modelfree continuous control
The naive application of Reinforcement Learning algorithms to continuous...
read it

Relative Entropy Regularized Policy Iteration
We present an offpolicy actorcritic algorithm for Reinforcement Learni...
read it

Maximum a Posteriori Policy Optimisation
We introduce a new algorithm for reinforcement learning called Maximum a...
read it

DeepMind Control Suite
The DeepMind Control Suite is a set of continuous control tasks with a s...
read it

Deep Reinforcement Learning with Relative Entropy Stochastic Search
Many reinforcement learning methods for continuous control tasks are bas...
read it
Abbas Abdolmaleki
is this you? claim profile