
DivideandConquer Monte Carlo Tree Search For GoalDirected Planning
Standard planners for sequential decision making (including Monte Carlo ...
read it

Valuedriven Hindsight Modelling
Value estimation is a critical component of the reinforcement learning (...
read it

Causally Correct Partial Models for Reinforcement Learning
In reinforcement learning, we can learn a model of future observations a...
read it

Unsupervised Doodling and Painting with Improved SPIRAL
We investigate using reinforcement learning agents as generative models ...
read it

An investigation of modelfree planning
The field of reinforcement learning (RL) is facing increasingly challeng...
read it

Relational recurrent neural networks
Memorybased neural networks model temporal data by leveraging an abilit...
read it

ImaginationAugmented Agents for Deep Reinforcement Learning
We introduce ImaginationAugmented Agents (I2As), a novel architecture f...
read it

Learning modelbased planning from scratch
Conventional wisdom holds that modelbased planning is a powerful approa...
read it

Visual Interaction Networks
From just a glance, humans can make rich predictions about the future st...
read it

Deep Reinforcement Learning in Large Discrete Action Spaces
Being able to reason in an environment with a large number of discrete a...
read it

Attend, Infer, Repeat: Fast Scene Understanding with Generative Models
We present a framework for efficient inference in structured image model...
read it

Automated Variational Inference in Probabilistic Programming
We present a new algorithm for approximate inference in probabilistic pr...
read it

Learning and Querying Fast Generative Models for Reinforcement Learning
A key challenge in modelbased reinforcement learning (RL) is to synthes...
read it

Learning to Search with MCTSnets
Planning problems are among the most important and wellstudied problems...
read it

Woulda, Coulda, Shoulda: CounterfactuallyGuided Policy Search
Learning policies on data synthesized by models can in principle quench ...
read it

SingleAgent Policy Tree Search With Guarantees
We introduce two novel tree search algorithms that use a policy to guide...
read it

Credit Assignment Techniques in Stochastic Computation Graphs
Stochastic computation graphs (SCGs) provide a formalism to represent st...
read it

Approximate Inference in Discrete Distributions with Monte Carlo Tree Search and Value Functions
A plethora of problems in AI, engineering and the sciences are naturally...
read it

Combining QLearning and Search with Amortized Value Estimates
We introduce "Search with Amortized Value Estimates" (SAVE), an approach...
read it
Theophane Weber
is this you? claim profile