Planning with Diffusion for Flexible Behavior Synthesis

05/20/2022
by   Michael Janner, et al.
2

Model-based reinforcement learning methods often use learning only for the purpose of estimating an approximate dynamics model, offloading the rest of the decision-making work to classical trajectory optimizers. While conceptually simple, this combination has a number of empirical shortcomings, suggesting that learned models may not be well-suited to standard trajectory optimization. In this paper, we consider what it would look like to fold as much of the trajectory optimization pipeline as possible into the modeling problem, such that sampling from the model and planning with it become nearly identical. The core of our technical approach lies in a diffusion probabilistic model that plans by iteratively denoising trajectories. We show how classifier-guided sampling and image inpainting can be reinterpreted as coherent planning strategies, explore the unusual and useful properties of diffusion-based planning methods, and demonstrate the effectiveness of our framework in control settings that emphasize long-horizon decision-making and test-time flexibility.

READ FULL TEXT

page 2

page 3

page 4

page 7

page 9

page 10

page 11

page 12

research
06/15/2023

Deep Generative Models for Decision-Making and Control

Deep model-based reinforcement learning methods offer a conceptually sim...
research
06/07/2023

Professional Basketball Player Behavior Synthesis via Planning with Diffusion

Dynamically planning in multi-agent systems has been explored to improve...
research
06/27/2023

Trajectory Generation, Control, and Safety with Denoising Diffusion Probabilistic Models

We present a framework for safety-critical optimal control of physical s...
research
03/28/2019

Regularizing Trajectory Optimization with Denoising Autoencoders

Trajectory optimization with learned dynamics models can often suffer fr...
research
06/25/2021

Predictive Control Using Learned State Space Models via Rolling Horizon Evolution

A large part of the interest in model-based reinforcement learning deriv...
research
05/22/2023

Training Diffusion Models with Reinforcement Learning

Diffusion models are a class of flexible generative models trained with ...
research
04/22/2020

Flexible and Efficient Long-Range Planning Through Curious Exploration

Identifying algorithms that flexibly and efficiently discover temporally...

Please sign up or login with your details

Forgot password? Click here to reset