
Rethinking Exploration for SampleEfficient Policy Learning
Offpolicy reinforcement learning for control has made great strides in ...
read it

Training Generative Adversarial Networks by Solving Ordinary Differential Equations
The instability of Generative Adversarial Network (GAN) training has fre...
read it

Learning Dexterous Manipulation from Suboptimal Experts
Learning dexterous manipulation in highdimensional stateaction spaces ...
read it

Local Search for Policy Iteration in Continuous Control
We present an algorithm for local, regularized, policy improvement in re...
read it

Critic Regularized Regression
Offline reinforcement learning (RL), also known as batch RL, offers the ...
read it

Simple Sensor Intentions for Exploration
Modern reinforcement learning algorithms can learn solutions to increasi...
read it

Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning
Offpolicy reinforcement learning algorithms promise to be applicable in...
read it

ContinuousDiscrete Reinforcement Learning for Hybrid Control in Robotics
Many realworld control problems involve both discrete decision variable...
read it

Quinoa: a Qfunction You Infer Normalized Over Actions
We present an algorithm for learning an approximate actionvalue soft Q...
read it

Imagined Value Gradients: ModelBased Policy Optimization with Transferable Latent Dynamics Models
Humans are masters at quickly learning many complex tasks, relying on an...
read it

VMPO: OnPolicy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Some of the most successful applications of deep reinforcement learning ...
read it

Regularized Hierarchical Policies for Compositional Transfer in Robotics
The successful application of flexible, general learning algorithms  s...
read it

Robust Reinforcement Learning for Continuous Control with Model Misspecification
We provide a framework for incorporating robustness  to perturbations ...
read it

Selfsupervised Learning of Image Embedding for Continuous Control
Operating directly from raw high dimensional sensory inputs like images ...
read it

Relative Entropy Regularized Policy Iteration
We present an offpolicy actorcritic algorithm for Reinforcement Learni...
read it

Maximum a Posteriori Policy Optimisation
We introduce a new algorithm for reinforcement learning called Maximum a...
read it

Graph networks as learnable physics engines for inference and control
Understanding and interacting with everyday physical scenes requires ric...
read it

Learning by Playing  Solving Sparse Reward Tasks from Scratch
We propose Scheduled Auxiliary Control (SACX), a new learning paradigm ...
read it

Deep learning with convolutional neural networks for EEG decoding and visualization
A revised version of this article is now available at Human Brain Mappin...
read it

Deep Reinforcement Learning with Successor Features for Navigation across Similar Environments
In this paper we consider the problem of robot navigation in simple maze...
read it

Asynchronous Stochastic Gradient MCMC with Elastic Coupling
We consider parallel asynchronous Markov Chain Monte Carlo (MCMC) sampli...
read it

Unsupervised and Semisupervised Learning with Categorical Generative Adversarial Networks
In this paper we present a method for learning a discriminative classifi...
read it

Multimodal Deep Learning for Robust RGBD Object Recognition
Robust object recognition is a crucial ingredient of many, if not all, r...
read it

Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images
We introduce Embed to Control (E2C), a method for model learning and con...
read it

Striving for Simplicity: The All Convolutional Net
Most modern convolutional neural networks (CNNs) used for object recogni...
read it

Learning to Generate Chairs, Tables and Cars with Convolutional Networks
We train generative 'upconvolutional' neural networks which are able to...
read it

Discriminative Unsupervised Feature Learning with Exemplar Convolutional Neural Networks
Deep convolutional networks have proven to be very successful in learnin...
read it

Improving Deep Neural Networks with Probabilistic Maxout Units
We present a probabilistic variant of the recently introduced maxout uni...
read it

Unsupervised feature learning by augmenting single images
When deep learning is applied to visual object recognition, data augment...
read it
Jost Tobias Springenberg
is this you? claim profile
Staff Research Scientist at AlbertLudwigsUniversity Freiburg im Breisgau