
Pointer Graph Networks
Graph neural networks (GNNs) are typically applied to static graphs that...
DivideandConquer Monte Carlo Tree Search For GoalDirected Planning
Standard planners for sequential decision making (including Monte Carlo ...
Valuedriven Hindsight Modelling
Value estimation is a critical component of the reinforcement learning (...
Causally Correct Partial Models for Reinforcement Learning
In reinforcement learning, we can learn a model of future observations a...
Combining QLearning and Search with Amortized Value Estimates
We introduce "Search with Amortized Value Estimates" (SAVE), an approach...
Approximate Inference in Discrete Distributions with Monte Carlo Tree Search and Value Functions
A plethora of problems in AI, engineering and the sciences are naturally...
Credit Assignment Techniques in Stochastic Computation Graphs
Stochastic computation graphs (SCGs) provide a formalism to represent st...
Woulda, Coulda, Shoulda: CounterfactuallyGuided Policy Search
Learning policies on data synthesized by models can in principle quench ...
Learning and Querying Fast Generative Models for Reinforcement Learning
A key challenge in modelbased reinforcement learning (RL) is to synthes...
Fast amortized inference of neural activity from calcium imaging data with variational autoencoders
Calcium imaging permits optical measurement of neural activity. Since in...
ImaginationAugmented Agents for Deep Reinforcement Learning
We introduce ImaginationAugmented Agents (I2As), a novel architecture f...
Learning modelbased planning from scratch
Conventional wisdom holds that modelbased planning is a powerful approa...
Black box variational inference for state space models
Latent variable timeseries models are among the most heavily used tools...
Bayesian Manifold Learning: The Locally Linear Latent Variable Model (LLLVM)
We introduce the Locally Linear Latent Variable Model (LLLVM), a probab...
Lars Buesing
