
Muesli: Combining Improvements in Policy Optimization
We propose a novel policy update that combines regularized policy optimi...
read it

Synthetic Returns for LongTerm Credit Assignment
Since the earliest days of reinforcement learning, the workhorse method ...
read it

Neural Recursive Belief States in MultiAgent Reinforcement Learning
In multiagent reinforcement learning, the problem of learning to act is...
read it

A case for new neural network smoothness constraints
How sensitive should machine learning models be to input changes? We tac...
read it

Counterfactual Credit Assignment in ModelFree Reinforcement Learning
Credit assignment in reinforcement learning is the problem of measuring ...
read it

On the role of planning in modelbased deep reinforcement learning
Modelbased planning is often thought to be necessary for deep, careful ...
read it

Beyond TabulaRasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban
Intelligent robots need to achieve abstract objectives using concrete, s...
read it

Physically Embedded Planning Problems: New Challenges for Reinforcement Learning
Recent work in deep reinforcement learning (RL) has produced algorithms ...
read it

DivideandConquer Monte Carlo Tree Search For GoalDirected Planning
Standard planners for sequential decision making (including Monte Carlo ...
read it

Valuedriven Hindsight Modelling
Value estimation is a critical component of the reinforcement learning (...
read it

Causally Correct Partial Models for Reinforcement Learning
In reinforcement learning, we can learn a model of future observations a...
read it

Combining QLearning and Search with Amortized Value Estimates
We introduce "Search with Amortized Value Estimates" (SAVE), an approach...
read it

Approximate Inference in Discrete Distributions with Monte Carlo Tree Search and Value Functions
A plethora of problems in AI, engineering and the sciences are naturally...
read it

Unsupervised Doodling and Painting with Improved SPIRAL
We investigate using reinforcement learning agents as generative models ...
read it

An investigation of modelfree planning
The field of reinforcement learning (RL) is facing increasingly challeng...
read it

Credit Assignment Techniques in Stochastic Computation Graphs
Stochastic computation graphs (SCGs) provide a formalism to represent st...
read it

SingleAgent Policy Tree Search With Guarantees
We introduce two novel tree search algorithms that use a policy to guide...
read it

Woulda, Coulda, Shoulda: CounterfactuallyGuided Policy Search
Learning policies on data synthesized by models can in principle quench ...
read it

Relational recurrent neural networks
Memorybased neural networks model temporal data by leveraging an abilit...
read it

Learning to Search with MCTSnets
Planning problems are among the most important and wellstudied problems...
read it

Learning and Querying Fast Generative Models for Reinforcement Learning
A key challenge in modelbased reinforcement learning (RL) is to synthes...
read it

ImaginationAugmented Agents for Deep Reinforcement Learning
We introduce ImaginationAugmented Agents (I2As), a novel architecture f...
read it

Learning modelbased planning from scratch
Conventional wisdom holds that modelbased planning is a powerful approa...
read it

Visual Interaction Networks
From just a glance, humans can make rich predictions about the future st...
read it

Attend, Infer, Repeat: Fast Scene Understanding with Generative Models
We present a framework for efficient inference in structured image model...
read it

Deep Reinforcement Learning in Large Discrete Action Spaces
Being able to reason in an environment with a large number of discrete a...
read it

Automated Variational Inference in Probabilistic Programming
We present a new algorithm for approximate inference in probabilistic pr...
read it
Theophane Weber
is this you? claim profile