Planning with a learned model is arguably a key component of intelligenc...
How much credit (or blame) should an action taken in a state get for a f...
In this work, we study auxiliary prediction tasks defined by
temporal-di...
We present a method for learning intrinsic reward functions to drive the...
Arguably, intelligent agents ought to be able to discover their own ques...
The exploration of novel chemical spaces is one of the most important ta...
Motivated by vision-based reinforcement learning (RL) problems, in parti...