Pool-based active learning (AL) is a promising technology for increasing...
A natural solution concept for many multiagent settings is the Stackelbe...
Hierarchical methods in reinforcement learning have the potential to red...
Exposure bias is a well-known issue in recommender systems where items a...
In reinforcement learning, different reward functions can be equivalent ...
Recent works on machine learning for combinatorial optimization have sho...
Hex is a turn-based two-player connection game with a high branching fac...
We consider multi-agent reinforcement learning (MARL) for cooperative
co...
We present Nonparametric Approximation of Inter-Trace returns (NAIT), a
...
Enabling reinforcement learning (RL) agents to leverage a knowledge base...
This paper introduces Multi-Agent MDP Homomorphic Networks, a class of
n...
Solving robotic navigation tasks via reinforcement learning (RL) is
chal...
Exploration is an essential component of reinforcement learning algorith...
Given two sources of evidence about a latent variable, one can combine t...
Routing problems are a class of combinatorial problems with many practic...
Reinforcement learning is a promising paradigm for solving sequential
de...
In today's clinical practice, magnetic resonance imaging (MRI) is routin...
Neural processes (NPs) constitute a family of variational approximate mo...
Air traffic control is becoming a more and more complex task due to the
...
This paper introduces MDP homomorphic networks for deep reinforcement
le...
The next generation of mobile robots needs to be socially-compliant to b...
We derive an unbiased estimator for expectations over discrete random
va...
Neural Network based controllers hold enormous potential to learn comple...
Efficient spatial exploration is a key aspect of search and rescue. In t...
The well-known Gumbel-Max trick for sampling from a categorical distribu...
Diversity of environments is a key challenge that causes learned robotic...
Building models capable of generating structured output is a key challen...
In this work, we propose a novel method for training neural networks to
...
In value-based reinforcement learning methods such as deep Q-learning,
f...
Direct contextual policy search methods learn to improve policy paramete...