Knowledge distillation is commonly used for compressing neural networks ...
In reinforcement learning (RL), state representations are key to dealing...
We introduce a value-based RL agent, which we call BBF, that achieves
su...
Auxiliary tasks improve the representations learned by deep reinforcemen...
In this work we identify the dormant neuron phenomenon in deep reinforce...
Offline model selection (OMS), that is, choosing the best policy from a ...
Many machine learning problems encode their data as a matrix with a poss...
The potential of offline reinforcement learning (RL) is that high-capaci...
Learning tabula rasa, that is without any prior knowledge, is the preval...
In reinforcement learning, state representations are used to tractably d...
Despite overparameterization, deep networks trained via supervised learn...
Deep reinforcement learning (RL) algorithms are predominantly evaluated ...
The shortcomings of maximum likelihood estimation in the context of
mode...
Reinforcement learning methods trained on few environments rarely learn
...
We identify an implicit under-parameterization phenomenon in value-based...
This paper describes the system proposed for addressing the research pro...
Experience replay is central to off-policy algorithms in deep reinforcem...
Offline methods for reinforcement learning have the potential to help br...
Deep neural networks (DNNs) are powerful black-box predictors that have
...
Reflecting on the advances of off-policy deep reinforcement learning (RL...
We consider the problem of learning from sparse and underspecified rewar...
The current state-of-the-art Scrabble agents are not learning-based but
...