
Quinoa: a Qfunction You Infer Normalized Over Actions
We present an algorithm for learning an approximate actionvalue soft Q...
ContinuousDiscrete Reinforcement Learning for Hybrid Control in Robotics
Many realworld control problems involve both discrete decision variable...
Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning
Offpolicy reinforcement learning algorithms promise to be applicable in...
Imagined Value Gradients: ModelBased Policy Optimization with Transferable Latent Dynamics Models
Humans are masters at quickly learning many complex tasks, relying on an...
Relative Entropy Regularized Policy Iteration
We present an offpolicy actorcritic algorithm for Reinforcement Learni...
Simultaneously Learning Vision and Featurebased Control Policies for Realworld BallinaCup
We present a method for fast training of vision based control policies o...
Regularized Hierarchical Policies for Compositional Transfer in Robotics
The successful application of flexible, general learning algorithms  s...
Deep Reinforcement Learning with Relative Entropy Stochastic Search
Many reinforcement learning methods for continuous control tasks are bas...
DeepMind Control Suite
The DeepMind Control Suite is a set of continuous control tasks with a s...
Maximum a Posteriori Policy Optimisation
We introduce a new algorithm for reinforcement learning called Maximum a...
Value constrained modelfree continuous control
The naive application of Reinforcement Learning algorithms to continuous...
Robust Reinforcement Learning for Continuous Control with Model Misspecification
We provide a framework for incorporating robustness  to perturbations ...
VMPO: OnPolicy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Some of the most successful applications of deep reinforcement learning ...
Augmenting learning using symmetry in a biologicallyinspired domain
Invariances to translation, rotation and other spatial transformations a...
Modelling Generalized Forces with Reinforcement Learning for SimtoReal Transfer
Learning robotic control policies in the real world gives rise to challe...
A Distributional View on MultiObjective Policy Optimization
Many realworld problems require trading off multiple competing objectiv...
Abbas Abdolmaleki
