We study the problem of temporal-difference-based policy evaluation in
r...
Plasticity, the ability of a neural network to quickly change its predic...
A machine learning (ML) system must learn not only to match the output o...
We study the learning dynamics of self-predictive learning for reinforce...
Solving a reinforcement learning (RL) problem poses two competing challe...
The reinforcement learning (RL) problem is rife with sources of
non-stat...
The success of neural architecture search (NAS) has historically been li...
We challenge a common assumption underlying most supervised deep learnin...
Robustness of decision rules to shifts in the data-generating process is...
Existing generalization measures that aim to capture a model's simplicit...
While auxiliary tasks play a key role in shaping the representations lea...
We study reinforcement learning (RL) with no-reward demonstrations, a se...
We take a Bayesian perspective to illustrate a connection between traini...
Reliable yet efficient evaluation of generalisation performance of a pro...
Many real world data analysis problems exhibit invariant structure, and
...
The information bottleneck (IB) principle offers both a mechanism to exp...
Generalization across environments is critical to the successful applica...
This paper proposes a new approach to representation learning based on
g...
Since their introduction a year ago, distributional approaches to
reinfo...
Distributional reinforcement learning (distributional RL) has seen empir...
This report surveys the landscape of potential security threats from
mal...