The Languini Kitchen serves as both a research collective and codebase
d...
Goal-conditioned Reinforcement Learning (RL) aims at learning optimal
po...
Learning to evaluate and improve policies is a core problem of Reinforce...
Neural ordinary differential equations (ODEs) have attracted much attent...
Upside-Down Reinforcement Learning (UDRL) is an approach for solving RL
...
Reward-Weighted Regression (RWR) belongs to a family of widely known
ite...
Under the Bayesian brain hypothesis, behavioural variations can be attri...
Learning value functions off-policy is at the core of modern Reinforceme...
Policy optimization is an effective reinforcement learning approach to s...