We analyse quantile temporal-difference learning (QTD), a distributional...
Deep neural networks are the most commonly used function approximators i...
Learning to act from observational data without active environmental
int...
Exploration remains a central challenge for reinforcement learning (RL)....
Scaling issues are mundane yet irritating for practitioners of reinforce...
While auxiliary tasks play a key role in shaping the representations lea...
Determining what experience to generate to best facilitate learning (i.e...
We introduce autoregressive implicit quantile networks (AIQN), a
fundame...
In this work, we build on recent advances in distributional reinforcemen...
We introduce new theoretical insights into two-population asymmetric gam...
The deep reinforcement learning community has made several independent
i...
We consider an agent's uncertainty about its environment and the problem...
This paper introduces new optimality-preserving operators on Q-functions...