
-
Experimental design for MRI by greedy policy search
In today's clinical practice, magnetic resonance imaging (MRI) is routin...
read it
-
Doubly Stochastic Variational Inference for Neural Processes with Hierarchical Latent Variables
Neural processes (NPs) constitute a family of variational approximate mo...
read it
-
An Autonomous Free Airspace En-route Controller using Deep Reinforcement Learning Techniques
Air traffic control is becoming a more and more complex task due to the ...
read it
-
MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning
This paper introduces MDP homomorphic networks for deep reinforcement le...
read it
-
Social navigation with human empowerment driven reinforcement learning
The next generation of mobile robots needs to be socially-compliant to b...
read it
-
Estimating Gradients for Discrete Random Variables by Sampling without Replacement
We derive an unbiased estimator for expectations over discrete random va...
read it
-
Unifying Variational Inference and PAC-Bayes for Supervised Learning that Scales
Neural Network based controllers hold enormous potential to learn comple...
read it
-
Reinforcement Learning with Non-uniform State Representations for Adaptive Search
Efficient spatial exploration is a key aspect of search and rescue. In t...
read it
-
Stochastic Beams and Where to Find Them: The Gumbel-Top-k Trick for Sampling Sequences Without Replacement
The well-known Gumbel-Max trick for sampling from a categorical distribu...
read it
-
Uncertainty Aware Learning from Demonstrations in Multiple Contexts using Bayesian Neural Networks
Diversity of environments is a key challenge that causes learned robotic...
read it
-
Deep Generative Modeling of LiDAR Data
Building models capable of generating structured output is a key challen...
read it
-
BanditSum: Extractive Summarization as a Contextual Bandit
In this work, we propose a novel method for training neural networks to ...
read it
-
Addressing Function Approximation Error in Actor-Critic Methods
In value-based reinforcement learning methods such as deep Q-learning, f...
read it
-
Policy Search with High-Dimensional Context Variables
Direct contextual policy search methods learn to improve policy paramete...
read it